Practical

Named Entity Recognition (NER)

Last reviewed: April 2026

An NLP task that identifies and classifies proper nouns and specific terms in text into predefined categories like person, organisation, location, and date.

Named entity recognition (NER) is a natural language processing task that scans text and identifies mentions of specific entities — people, organisations, locations, dates, monetary values, and other defined categories. It is one of the foundational building blocks of text analytics.

How NER works

Given the text "Satya Nadella announced that Microsoft will invest $10 billion in OpenAI's San Francisco headquarters in January 2025," a NER system identifies:

"Satya Nadella" → Person
"Microsoft" → Organisation
"$10 billion" → Money
"OpenAI" → Organisation
"San Francisco" → Location
"January 2025" → Date

NER approaches

Rule-based — handcrafted patterns and dictionaries. Precise but inflexible and expensive to maintain.
Statistical models — trained on annotated text using algorithms like conditional random fields (CRFs). The traditional ML approach.
Deep learning models — transformer-based models fine-tuned on NER datasets. Current state of the art, especially for complex and ambiguous text.
LLM-based — using large language models with prompting for entity extraction. Most flexible but slower and more expensive per query.

Standard entity types

The most widely used NER categories (from the OntoNotes standard) include: Person, Organisation, Location, Date, Time, Money, Percentage, Product, Event, and Law.

Custom NER

Many business applications require custom entity types:

Insurance: policy numbers, claim types, coverage categories
Healthcare: drug names, symptoms, procedures, anatomical terms
Legal: case citations, statute references, party names
Finance: ticker symbols, fund names, regulatory references

Custom NER requires annotated training data specific to your domain.

NER in the LLM era

Large language models have made NER more accessible. Instead of training a custom model, you can prompt an LLM to extract entities from text. This works well for prototyping and low-volume use cases, but dedicated NER models are faster and cheaper at scale.

Want to go deeper?

This topic is covered in our Practitioner level. Access all 100+ lessons free.

Why This Matters

NER transforms unstructured text into structured, actionable data. For organisations processing large volumes of documents, emails, or customer communications, NER automates information extraction that would otherwise require hours of manual work. It is often the first step in building intelligent document processing systems.

Related Terms

Natural Language Processing (NLP)

The branch of AI focused on enabling computers to understand, interpret, and generate human language in useful ways.

Entity Extraction

An NLP technique that automatically identifies and classifies named entities — people, organisations, locations, dates, amounts — from unstructured text.

Large Language Model (LLM)

A type of AI trained on vast amounts of text to understand and generate human language. ChatGPT, Claude, and Gemini are all LLMs.

Machine Learning (ML)

A type of AI where systems learn patterns from data instead of following explicitly programmed rules. The system improves its performance through experience.

Learn More

Continue learning in Practitioner

This topic is covered in our lesson: Building Your First AI Workflow

← Back to Glossary