These are three ways to make an LLM do what you want, working at different layers: prompting shapes behavior, RAG injects knowledge, and fine-tuning changes the model.
These are three ways to make an LLM do what you want, working at different layers: prompting shapes behavior, RAG injects knowledge, and fine-tuning changes the model.
| Prompt engineering | RAG | Fine-tuning | |
|---|---|---|---|
| Changes | The prompt | The prompt (+ retrieval) | The model weights |
| Best for | Behavior, format, tone | Up-to-date / private facts | Consistent style, narrow tasks |
| Knowledge freshness | N/A | Live (re-index data) | Frozen at train time |
| Cost / effort | Lowest | Medium (infra) | Highest (training + data) |
| Updating | Edit text | Update the index | Re-train |
Reaching for the wrong tool is expensive: people often try to fine-tune to add knowledge (which RAG does better and cheaper) or to fix behavior (which prompting handles). Knowing that prompting shapes behavior, RAG supplies knowledge, and fine-tuning changes the model lets you pick the cheapest approach that works — and combine them deliberately rather than by accident.