Framework for synthetic data generation with AI feedback
python ai evaluation synthetic-data finetuning dpo huggingface synthetic-data-generation llm rlhf rlaif llm-evaluation ai-feedback
-
Updated
Dec 26, 2024 - Python