Skip to content

Add support for the Training Method for finetuning, and for Direct-Preference Optimization (DPO) #317

Add support for the Training Method for finetuning, and for Direct-Preference Optimization (DPO)

Add support for the Training Method for finetuning, and for Direct-Preference Optimization (DPO) #317