Core AI

Soft Prompt

Last reviewed: April 2026

A learned sequence of continuous vectors prepended to an AI model's input that steers its behaviour, as an alternative to traditional text prompts.

A soft prompt is a sequence of learned, continuous vectors that are prepended to a model's input to steer its behaviour. Unlike a regular (hard) prompt made of human-readable words, a soft prompt exists only as numbers in the model's embedding space — it is not text you can read.

Hard prompts vs soft prompts

A hard prompt is text you write: "You are a helpful customer service agent. Answer the following question politely and concisely." You craft it manually, iterating on wording until the model behaves as desired.

A soft prompt achieves the same goal differently. Instead of finding the right words, you learn a sequence of embedding vectors through training. These vectors are optimised directly to make the model produce the desired behaviour. The result is often more effective than any hard prompt because the optimisation is not constrained to actual words.

How soft prompting works

A sequence of virtual tokens (typically 10-100) is initialised randomly.
These tokens are prepended to the model's input.
The model processes the combined sequence normally.
The model's output is compared to desired output, and the soft prompt vectors are adjusted.
This process repeats until the soft prompt reliably produces the desired behaviour.

Crucially, the model's weights are frozen during this process — only the soft prompt vectors are updated. This makes soft prompting a very lightweight form of adaptation.

Advantages

Extremely parameter-efficient: A soft prompt might contain only a few thousand trainable parameters versus billions in the full model.
Task switching: You can swap different soft prompts in and out to change the model's behaviour without reloading it.
No readable instructions: The steering mechanism is opaque, which can be an advantage for proprietary prompting strategies.

Limitations

Not human-readable: You cannot inspect a soft prompt to understand what it "says." It is a black box.
Model-specific: A soft prompt trained for one model does not transfer to another.
Limited capability: For complex behavioural changes, fine-tuning or LoRA typically outperforms soft prompting.
Requires training: Unlike hard prompts, you need a dataset and training pipeline.

Soft prompting vs other adaptation methods

Soft prompting sits between prompt engineering (no training, text-based) and fine-tuning (extensive training, weight changes). It requires some training but changes far fewer parameters than fine-tuning. LoRA has largely overtaken soft prompting in popularity because it offers more capability at a similar cost.

Want to go deeper?

This topic is covered in our Advanced level. Access all 100+ lessons free.

Why This Matters

Soft prompting represents an important concept in AI customisation: the idea that you can steer model behaviour through learned representations rather than human language. While LoRA has become more popular in practice, understanding soft prompts helps you appreciate the spectrum of model adaptation techniques available.

Related Terms

Prompt Engineering

The skill of writing instructions to AI that consistently produce useful, accurate, high-quality output.

LoRA (Low-Rank Adaptation)

A fine-tuning technique that trains only a small number of additional parameters instead of updating the entire model, making customisation faster and cheaper.

Model Fine-Tuning

The process of further training a pre-trained AI model on your own data so it performs better on your specific tasks.

Embedding

A numerical representation of text (or images, audio, etc.) that captures its meaning. Embeddings let AI measure how similar two pieces of content are.

Prompt Template

A reusable prompt structure with placeholder variables that gets filled in with specific data for each use, ensuring consistent AI output.

Learn More

Continue learning in Advanced

This topic is covered in our lesson: Fine-Tuning and Customisation Strategies

← Back to Glossary