Skip to main content
Early access β€” new tools and guides added regularly
Practical

Image Recognition

Last reviewed: April 2026

AI's ability to identify and categorise objects, people, scenes, and other elements within images, powering applications from photo organisation to medical diagnosis.

Image recognition is the ability of AI systems to identify and categorise the contents of images β€” objects, people, scenes, text, activities, and more. It is one of the most mature and widely deployed applications of deep learning.

How image recognition works

Modern image recognition uses convolutional neural networks (CNNs) or vision transformers (ViTs) trained on millions of labelled images. During training, the model learns to identify visual features β€” edges, textures, shapes, patterns β€” that are associated with different categories.

Types of image recognition tasks

  • Image classification β€” assigns a label to an entire image ("this is a cat")
  • Object detection β€” identifies and locates multiple objects within an image with bounding boxes ("there is a cat at coordinates X,Y and a dog at coordinates A,B")
  • Segmentation β€” classifies every pixel ("these pixels are cat, these are background")
  • Face recognition β€” identifies specific individuals from facial features
  • Optical character recognition (OCR) β€” reads text from images
  • Scene understanding β€” identifies the context ("this is a kitchen," "this is a highway")

Business applications

  • Manufacturing β€” visual quality inspection on production lines
  • Healthcare β€” detecting tumours in medical scans, analysing pathology slides
  • Retail β€” visual search ("find products that look like this photo"), inventory management
  • Security β€” surveillance analysis, access control
  • Agriculture β€” crop disease detection, yield estimation from aerial images
  • Insurance β€” damage assessment from photos of vehicles or property

Accuracy and limitations

Modern image recognition exceeds human accuracy on many benchmarks but can fail in unexpected ways:

  • Adversarial examples β€” subtle image modifications invisible to humans can cause misclassification
  • Domain shift β€” a model trained on professional photos may fail on smartphone photos taken in poor lighting
  • Bias β€” models may perform worse on underrepresented demographics or unusual contexts
  • Context dependence β€” a model might correctly identify a stop sign in daylight but miss it when partially obscured by snow
Want to go deeper?
This topic is covered in our Practitioner level. Access all 60+ lessons free.

Why This Matters

Image recognition is one of the most practically valuable AI capabilities, with proven ROI in manufacturing, healthcare, retail, and security. Understanding its capabilities and limitations helps you identify high-value opportunities in your organisation and set realistic expectations for accuracy in your specific operating conditions.

Related Terms

Learn More

Continue learning in Practitioner

This topic is covered in our lesson: Building Your First AI Workflow