Federated Learning
A training technique where multiple devices or organisations collaboratively train a shared AI model without sharing their raw data, preserving privacy.
Federated learning is a machine learning approach where a model is trained across multiple devices or organisations without any raw data leaving its source. Instead of collecting all data in one place, the algorithm sends the model to the data.
How federated learning works
- A central server sends the current model to all participating devices or organisations
- Each participant trains the model on their local data
- Only the model updates (not the data) are sent back to the server
- The server aggregates these updates into an improved global model
- The cycle repeats until the model converges
The raw data never leaves each participant's control.
Why this matters for privacy
Traditional machine learning requires centralising data β a major barrier in regulated industries. Hospitals cannot share patient records. Banks cannot share customer transactions. Competing companies will not share proprietary data. Federated learning lets these organisations benefit from collective intelligence without exposing sensitive data.
Real-world applications
- Mobile keyboards β Google's Gboard learns from millions of users' typing patterns without uploading what they type
- Healthcare β hospitals train diagnostic models collaboratively without sharing patient data
- Finance β banks improve fraud detection by learning from industry-wide patterns without revealing customer information
- Manufacturing β factories share quality improvement insights without exposing trade secrets
Challenges
- Communication cost β sending model updates back and forth requires significant bandwidth
- Data heterogeneity β different participants have different types and distributions of data, which can make training difficult
- Security β while raw data stays local, model updates can potentially be reverse-engineered to infer information about the training data. Differential privacy techniques help mitigate this.
- Coordination β managing training across thousands of devices with varying availability, connectivity, and compute power is complex
- Free riders β some participants may benefit from the shared model without contributing useful updates
Why This Matters
Federated learning is the key technology enabling AI in privacy-sensitive industries. If your organisation handles sensitive data β healthcare records, financial information, personal communications β federated learning may be the path to benefiting from AI without compromising data governance obligations.
Related Terms
Continue learning in Expert
This topic is covered in our lesson: Deploying AI Across Your Organisation