BlogVibe Annotation: We’re building “Auta” — AI-powered data annotation with prompts

Vibe Annotation: We’re building “Auta” — AI-powered data annotation with prompts

Introducing Auta: Annotate Data by Vibe Data annotation is one of the most important — and often most frustrating — parts of building machine learning systems. Whether you're worki...

Perceptron Team

December 3, 2025

6 min read

Introducing Auta: Annotate Data by Vibe

Data annotation is one of the most important — and often most frustrating — parts of building machine learning systems. Whether you're working in computer vision, robotics, medicine, or sports analytics, high-quality labeled data is essential. But producing that data usually means spending hours drawing bounding boxes, tracing segmentation masks, assigning labels, and organizing datasets.

The process is repetitive, time-consuming, and expensive.

At the same time, the way we interact with AI is rapidly changing. Instead of manually writing every line of code, developers can now describe what they want and let AI assist with the implementation. Tools like Github Copilot and Cursor have shown how powerful intent-driven workflows can be.

This raised an interesting question for us:

What if annotation could work the same way?

Instead of manually drawing boxes and masks, what if you could simply describe what you want — and let AI handle the rest?

That idea led us to build Auta.

What is Auta?

Auta is an AI-powered data annotation tool inspired by the concept of vibe coding.

In vibe coding, you describe the intent of your code in natural language, and the system helps generate or modify the implementation.

With Auta, the same idea applies to annotation.

Instead of manually labeling objects, you simply write a prompt describing what needs to be annotated.

For example:

“Annotate all the monkeys in these images.”

From that single instruction, Auta can automatically:

Detect relevant objects
Generate labels
Assign consistent colors
Create unique object IDs
Draw bounding boxes
Produce segmentation masks

The goal is simple:

Turn annotation from a manual task into an intent-driven workflow.

Why Annotation Needs to Change

Modern AI systems depend on massive datasets. But creating these datasets remains one of the biggest bottlenecks in machine learning pipelines.

Traditional annotation tools require users to:

Draw bounding boxes around every object
Manually outline segmentation masks
Assign labels and categories
Maintain consistency across large datasets
Repeat the same actions thousands of times

Even with semi-automated tools, the workflow is still heavily manual.

This slows down experimentation, increases costs, and makes it harder for smaller teams or independent researchers to build high-quality datasets.

Auta aims to reduce this friction by allowing users to describe what they want instead of manually performing each step.

How Auta Works

Auta converts natural language instructions into annotation workflows.

When a user provides a prompt, the system first plans the annotation task automatically. It determines what labels need to be created, how objects should be identified, what annotation types should be used, and how results should be organized.

Then the AI performs the annotation across the dataset.

For example, if a user writes:

“Annotate all the monkeys in these images.”

Auta will automatically:

Identify objects matching the description
Generate bounding boxes or segmentation masks
Create labels for the detected objects
Assign consistent IDs and colors
Apply the annotations across the entire dataset

Instead of manually labeling hundreds of images, the user simply provides the intent, and the system handles the execution.

Features Implemented So Far

Although Auta is still in an early stage, several core features are already implemented.

Automatic Annotation Planning

When a prompt is given, Auta automatically creates a structured plan for the annotation task.

This includes:

Label creation
Color assignment
Object ID generation
Annotation type selection

This removes the need for manual configuration before starting annotation.

Bounding Box Detection

Auta can automatically generate bounding boxes around objects based on the user's instruction.

This is useful for tasks such as:

Object detection datasets
Autonomous driving models
Wildlife monitoring
Sports analytics

Segmentation Masks

In addition to bounding boxes, Auta can generate pixel-level segmentation masks, enabling more precise annotations for tasks like medical imaging, object segmentation, and robotics perception.

Batch Annotation

One of the most powerful aspects of Auta is the ability to annotate large batches of images at once.

Instead of labeling images one by one, a single prompt can be applied across an entire dataset.

This dramatically reduces the time required to create training data.

What We're Building Next (Phase 2)

We're currently working on expanding Auta with more advanced capabilities.

Object Tracking Across Video Frames

Many real-world datasets involve video rather than static images.

Auta will soon be able to track objects across frames, automatically assigning consistent IDs as objects move through the video.

This is especially useful for:

Sports analytics
Autonomous driving datasets
Surveillance systems
Motion analysis

Automatic Dataset Creation

Another feature we’re exploring is automatic dataset generation from prompts.

For example, a user could write:

“Create a dataset of 1,000 images with segmentation masks for cats.”

The system would then:

Collect relevant images
Generate annotations
Organize them into a structured dataset
Export them for training

This would allow developers to build datasets with minimal manual involvement.

Our Vision

Our long-term vision for Auta is simple:

Make dataset creation as easy as describing what you want.

Instead of spending days or weeks labeling data, researchers and developers should be able to:

Describe their dataset
Generate annotations automatically
Iterate quickly on new ideas

We believe this could significantly accelerate development in fields like:

Computer vision
Robotics
Medical AI
Sports analytics
Wildlife monitoring
Autonomous systems

We’d Love Your Feedback

Auta is still in an early stage, and we’re sharing it now because tools like this become far better when shaped by the community.

If you work with machine learning, data labeling, or computer vision, we’d love to hear your thoughts.

Some questions we’re especially interested in:

What would make a tool like this genuinely useful for you?
What features are missing?
What parts of annotation workflows slow you down the most?

Your feedback will help shape the direction of Auta and the features we build next.

Thanks for reading, and we’d really appreciate hearing your thoughts.

Back to all articles