Practical

Model Fine-Tuning

Last reviewed: April 2026

The process of further training a pre-trained AI model on your own data so it performs better on your specific tasks.

Model fine-tuning is the process of taking an AI model that has already been trained on general data and continuing its training on a smaller, specialised dataset. The goal is to adapt the model's behaviour for specific tasks, domains, or styles.

Why fine-tune instead of just prompting?

Prompt engineering can get you far, but it has limits. If you need a model to consistently follow a specific format, use domain terminology correctly, adopt a particular tone, or handle niche tasks reliably, fine-tuning encodes these requirements directly into the model's weights rather than relying on instructions in every prompt.

When fine-tuning makes sense

You need consistent behaviour that prompt engineering cannot reliably achieve.
You have a specific, repeatable task with clear quality criteria.
You have enough high-quality training examples (typically hundreds to thousands).
The task is important enough to justify the setup cost and ongoing maintenance.

When fine-tuning does not make sense

Your needs change frequently — prompting is more flexible.
You do not have enough quality training data.
The task is one-off or experimental.
RAG (retrieval-augmented generation) can solve the problem by providing context at query time.

The fine-tuning process

Prepare your data: Create examples in the format the model expects — typically input-output pairs showing the desired behaviour.
Choose your approach: Full fine-tuning updates all parameters (expensive, powerful). LoRA and QLoRA update a small fraction (cheaper, usually sufficient).
Train the model: Upload your data to the provider's fine-tuning service or run training on your own hardware.
Evaluate results: Test the fine-tuned model against held-out examples to measure improvement.
Iterate: Adjust your training data, hyperparameters, or approach based on results.

Cost considerations

Fine-tuning costs include compute time for training, higher per-token inference costs for custom models (on some providers), and the ongoing cost of maintaining and updating the fine-tuned model as your needs evolve.

Providers offering fine-tuning

OpenAI, Anthropic, Google, and most open-source model providers support fine-tuning. For open-source models, platforms like Hugging Face, Together AI, and Replicate provide managed fine-tuning infrastructure.

Want to go deeper?

This topic is covered in our Advanced level. Access all 100+ lessons free.

Why This Matters

Fine-tuning represents the middle ground between off-the-shelf AI and building from scratch. Understanding when it is worth the investment versus when prompting or RAG suffice helps you allocate AI budgets wisely and avoid over-engineering solutions to problems that have simpler answers.

Related Terms

Fine-Tuning

Training an existing AI model on your specific data to improve its performance on your specific tasks. Like giving the AI specialised on-the-job training.

LoRA (Low-Rank Adaptation)

A fine-tuning technique that trains only a small number of additional parameters instead of updating the entire model, making customisation faster and cheaper.

Training Data

The dataset used to teach an AI model. The quality, size, and composition of training data directly determines what the AI can and cannot do well.

Retrieval-Augmented Generation (RAG)

A technique that connects AI to your own documents and data so it can answer questions using your specific information, not just its general training.

Prompt Engineering

The skill of writing instructions to AI that consistently produce useful, accurate, high-quality output.

Knowledge Distillation

A technique for training a smaller, faster AI model to replicate the behaviour of a larger, more capable model.

Learn More

Continue learning in Advanced

This topic is covered in our lesson: Fine-Tuning and Customisation Strategies

← Back to Glossary