Agentic AI Glossary

Fine-Tuning & Alignment Terms

Fine-Tuning & Alignment Terms terms and explanations from the Agentic AI Glossary.

31 terms in this chapter

Adapter Tuning

Definition

A fine-tuning method that adds small trainable adapter layers while keeping most original model weights frozen.

Batch Size

Definition

The number of training examples processed together before updating model parameters.

Catastrophic Forgetting

Definition

When fine-tuning causes a model to lose useful abilities it learned during earlier training.

Checkpoint

Definition

A saved model state that can be reused, evaluated, restored, or continued during training.

Data Cleaning

Definition

Removing errors, duplicates, unsafe content, or irrelevant examples from training or evaluation data.

Data Curation

Definition

Selecting and organizing high-quality examples that teach the model the desired behavior.

DPO

Definition

Direct Preference Optimization, a preference-tuning method that trains a model to favor preferred responses without a separate reward model.

Epoch

Definition

One full pass through the training dataset during model training.

Fine-Tuning Evaluation

Definition

Testing a tuned model against base-model behavior, target tasks, safety cases, and regression benchmarks.

Full Fine-Tuning

Definition

Updating all or most model weights during fine-tuning instead of using small adapters.

Human Preference Data

Definition

Examples where people compare or rate outputs to teach the model preferred behavior.

Hyperparameter

Definition

A training configuration value, such as learning rate or batch size, chosen before training begins.

Instruction Tuning

Definition

Fine-tuning a model on instruction-response examples so it follows user requests better.

Learning Rate

Definition

Learning Rate is a training hyperparameter that controls how large each model-weight update is during optimization.

LoRA

Definition

Low-Rank Adaptation, a parameter-efficient fine-tuning method that trains small adapter matrices instead of all model weights.

Model Merge

Definition

Combining weights or adapters from multiple models to create a new model variant.

PEFT

Definition

Parameter-efficient fine-tuning, a family of methods that adapt models by training only a small number of parameters.

PPO

Definition

Proximal Policy Optimization, a reinforcement learning algorithm often discussed in RLHF training pipelines.

Preference Dataset

Definition

A dataset containing preferred and rejected outputs used to train alignment behavior.

Prefix Tuning

Definition

A parameter-efficient method that trains prefix vectors added to the model input.

Prompt Tuning

Definition

A parameter-efficient method that learns soft prompt embeddings instead of changing the full model.

QLoRA

Definition

Quantized LoRA, a memory-efficient fine-tuning method that combines quantization with LoRA adapters.

Reward Model

Definition

Reward Model is a model trained to score outputs so a fine-tuning or alignment process can prefer better responses.

RLAIF

Definition

Reinforcement Learning from AI Feedback, where AI-generated preferences help guide model alignment.

RLHF

Definition

Reinforcement Learning from Human Feedback, where human preferences guide model behavior after pretraining.

SFT

Definition

Supervised Fine-Tuning, where a model is trained on labeled examples of desired instructions and responses.

Supervised Fine-Tuning

Definition

Training a pretrained model on labeled prompt-response examples.

Synthetic Data

Definition

Artificially generated examples used for testing, evaluation, training, simulation, or privacy-preserving development.

Test Dataset

Definition

Held-out data used for final evaluation after training decisions are complete.

Training Dataset

Definition

The examples used to update model parameters during training.

Validation Dataset

Definition

Held-out data used during development to tune settings and detect overfitting.

Explore more chapters or test your knowledge with quizzes.

Back to Agentic AI Glossary All glossary chapters Practice quizzes