Model Context Protocol
In-context learning
Reflection agents
Top-k and top-p sampling
Reasoning Models
Group Relative Policy Optimization
Proximal Policy Optimization
RLHF
ReAct Agent Model
Knowledge Distillation
Mixture of Experts