CRITERIA FOR TRAINING AI MODELS 

Supervised Fine Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF)