Practical

Machine Learning Operations (MLOps)

Last reviewed: April 2026

The set of practices and tools for deploying, monitoring, and maintaining machine learning models in production reliably and at scale.

MLOps (Machine Learning Operations) is the discipline of deploying, monitoring, and maintaining machine learning models in production. It applies DevOps principles to machine learning, bridging the gap between building a model and running it reliably at scale.

Why MLOps exists

Most machine learning models never make it to production. Studies have found that up to eighty-seven per cent of data science projects fail to deploy. The challenge is not building models — it is operationalising them. MLOps addresses the "last mile" problem.

Core MLOps practices

Version control — tracking not just code but also data, model weights, hyperparameters, and experiments
CI/CD for ML — automated pipelines that test, validate, and deploy models
Model registry — a catalogue of trained models with metadata, lineage, and approval status
Feature stores — centralised repositories of engineered features that ensure consistency between training and serving
Monitoring — tracking model performance, data drift, and system health in production
Automated retraining — triggering model retraining when performance degrades below acceptable thresholds

The MLOps lifecycle

Development — experiment with data, features, and architectures
Training — train and validate models in reproducible pipelines
Evaluation — test against benchmarks and business metrics
Deployment — serve the model via APIs, batch jobs, or edge devices
Monitoring — track predictions, latency, and data quality
Retraining — update the model as data and requirements change

Common tools

MLflow — experiment tracking and model registry
Kubeflow — ML workflows on Kubernetes
Weights & Biases — experiment tracking and visualisation
Seldon / BentoML — model serving
Great Expectations — data quality validation

MLOps maturity levels

Level 0 — manual, ad-hoc process. Models deployed once and rarely updated.
Level 1 — automated training pipelines. Models retrained regularly.
Level 2 — full CI/CD for ML. Automated testing, deployment, and monitoring.

Want to go deeper?

This topic is covered in our Expert level. Access all 100+ lessons free.

Why This Matters

MLOps is what separates a successful AI demo from a successful AI product. Without it, models degrade silently, data pipelines break without anyone noticing, and the promising proof-of-concept becomes a liability. Investing in MLOps from the start is far cheaper than retrofitting it after problems emerge.

Related Terms

Machine Learning (ML)

A type of AI where systems learn patterns from data instead of following explicitly programmed rules. The system improves its performance through experience.

Data Pipeline

An automated sequence of steps that collects, processes, transforms, and delivers data from source systems to AI models or analytics tools.

API (Application Programming Interface)

A way for software to communicate with other software. APIs are how developers connect AI capabilities to websites, apps, and business tools.

Automation

Using technology to perform tasks without manual human effort. AI automation goes beyond traditional rule-based automation by handling unstructured tasks like writing, analysis, and decision-making.

Workflow

A sequence of connected steps that accomplish a specific business task. In AI context, a workflow combines human actions and AI processing to complete work efficiently.

Learn More

Continue learning in Expert

This topic is covered in our lesson: Deploying AI Across Your Organisation