Core AI

Federated Learning

Last reviewed: April 2026

A training technique where multiple devices or organisations collaboratively train a shared AI model without sharing their raw data, preserving privacy.

Federated learning is a machine learning approach where a model is trained across multiple devices or organisations without any raw data leaving its source. Instead of collecting all data in one place, the algorithm sends the model to the data.

How federated learning works

A central server sends the current model to all participating devices or organisations
Each participant trains the model on their local data
Only the model updates (not the data) are sent back to the server
The server aggregates these updates into an improved global model
The cycle repeats until the model converges

The raw data never leaves each participant's control.

Why this matters for privacy

Traditional machine learning requires centralising data — a major barrier in regulated industries. Hospitals cannot share patient records. Banks cannot share customer transactions. Competing companies will not share proprietary data. Federated learning lets these organisations benefit from collective intelligence without exposing sensitive data.

Real-world applications

Mobile keyboards — Google's Gboard learns from millions of users' typing patterns without uploading what they type
Healthcare — hospitals train diagnostic models collaboratively without sharing patient data
Finance — banks improve fraud detection by learning from industry-wide patterns without revealing customer information
Manufacturing — factories share quality improvement insights without exposing trade secrets

Challenges

Communication cost — sending model updates back and forth requires significant bandwidth
Data heterogeneity — different participants have different types and distributions of data, which can make training difficult
Security — while raw data stays local, model updates can potentially be reverse-engineered to infer information about the training data. Differential privacy techniques help mitigate this.
Coordination — managing training across thousands of devices with varying availability, connectivity, and compute power is complex
Free riders — some participants may benefit from the shared model without contributing useful updates

Want to go deeper?

This topic is covered in our Expert level. Access all 100+ lessons free.

Why This Matters

Federated learning is the key technology enabling AI in privacy-sensitive industries. If your organisation handles sensitive data — healthcare records, financial information, personal communications — federated learning may be the path to benefiting from AI without compromising data governance obligations.

Related Terms

Machine Learning (ML)

A type of AI where systems learn patterns from data instead of following explicitly programmed rules. The system improves its performance through experience.

Deep Learning

A subset of machine learning that uses neural networks with many layers to learn complex patterns. The 'deep' refers to the number of layers, not the depth of understanding.

AI Governance

The policies, processes, and frameworks that guide how an organisation develops, deploys, and manages AI systems — covering risk, ethics, compliance, and accountability.

Training Data

The dataset used to teach an AI model. The quality, size, and composition of training data directly determines what the AI can and cannot do well.

Learn More

Continue learning in Expert

This topic is covered in our lesson: Deploying AI Across Your Organisation

← Back to Glossary