Practical

Data Annotation

Last reviewed: April 2026

The process of adding labels, tags, or other metadata to raw data so that AI models can learn from it — the essential human step that makes supervised learning possible.

Data annotation is the process of attaching meaningful labels, tags, or other metadata to raw data — text, images, audio, or video — so that AI models can learn from it. It is the human-powered step that transforms unlabelled data into the training material that supervised learning requires.

Why annotation matters

AI models do not learn from raw data alone. They need to know what each piece of data represents:

An image needs a label saying "this is a cat" or bounding boxes showing where objects are
A text needs a label saying "this is positive sentiment" or "this is a spam email"
An audio clip needs a transcription saying what words were spoken
A medical scan needs an expert marking regions of interest

Without these annotations, the model has no ground truth to learn from.

Types of annotation

Classification labels: Assigning a category to each data item. "Spam" or "not spam." "Positive," "neutral," or "negative."
Bounding boxes: Drawing rectangles around objects in images. Used for object detection in autonomous vehicles, security systems, and retail analytics.
Segmentation masks: Labelling every pixel in an image with its class. More precise than bounding boxes but far more expensive to create.
Named entity recognition: Marking words in text as person names, organisation names, locations, dates, etc.
Relationship annotation: Marking how entities relate to each other. "Company X acquired Company Y."
Transcription: Converting audio speech to text with timing information.
Preference ranking: Comparing two AI outputs and indicating which is better. Used for RLHF.

The annotation pipeline

Task design: Define exactly what annotators should label, with clear guidelines and examples.
Annotator training: Ensure annotators understand the task and can produce consistent labels.
Annotation: Annotators label the data using specialised tools.
Quality assurance: Review annotations for accuracy and consistency. Common approaches include double-annotation (two people label each item) and expert review of samples.
Adjudication: Resolve disagreements between annotators through discussion or expert ruling.
Delivery: Export annotations in the format required by the training pipeline.

The scale of annotation

The annotation industry is enormous. Companies like Scale AI, Labelbox, and Appen employ hundreds of thousands of annotators globally. Training a single large AI model may require millions of annotated examples.

Challenges in annotation

Cost: High-quality annotation is expensive. Medical image annotation requires doctors. Legal document annotation requires lawyers.
Consistency: Different annotators may interpret guidelines differently, introducing noise into the training data.
Subjectivity: Some tasks have inherently subjective labels. Is this restaurant review "positive" or "neutral"?
Scale: The volume of annotation needed for modern AI far exceeds what any small team can produce.
Annotator welfare: Annotation of harmful content (violence, abuse, misinformation) has raised serious concerns about the psychological impact on workers.

Want to go deeper?

This topic is covered in our Essentials level. Access all 100+ lessons free.

Why This Matters

Data annotation is the hidden labour behind every AI model. Understanding this process helps you estimate realistic costs and timelines for AI projects, evaluate the quality of training data, and appreciate why building high-quality labelled datasets is often the hardest part of an AI initiative.

Related Terms

Data Labelling

The process of tagging raw data with informative labels so supervised machine learning models can learn the relationship between inputs and desired outputs.

Supervised Learning

A machine learning approach where the model learns from labelled examples — input data paired with correct answers. The most common type of machine learning in business applications.

Training Data

The dataset used to teach an AI model. The quality, size, and composition of training data directly determines what the AI can and cannot do well.

Human-in-the-Loop (HITL)

A system design where AI handles execution but a human reviews, approves, or intervenes at critical decision points before actions are taken.

Learn More

Continue learning in Essentials

This topic is covered in our lesson: The AI Landscape — Models, Tools, and Players

← Back to Glossary