Category · AI

Fine-tuning

Continuing training of an LLM on a specific dataset.

LoRA, full fine-tuning, RLHF, DPO. Useful for style, output format, or highly specific domains (legal code, medical). Cost: GPUs + labeled dataset + evaluation. Often, a solid RAG + prompting gets the job done without fine-tuning.

// In action with our clients

IAAI agents in production: avoiding the demo theatre.

// See also

LLM (Large Language Model)
Large-scale language model trained on massive text corpora.
RAG (Retrieval-Augmented Generation)
AI architecture that combines document retrieval with LLM generation.
Eval (LLM evaluation)
Automated test suite measuring the quality of an LLM.

Want us to apply this for you?

Talk to an architect