Supervised fine-tuning (SFT) is one step in the process of aligning AI models with human preferences, by training them on a dataset of examples showing desired behaviour.
What is supervised fine-tuning?
Supervised fine-tuning (SFT) is one step in the process of aligning AI models with human preferences, by training them on a dataset of examples showing desired behaviour.