GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
qa knowledge-graph data-generation question-answering data-synthesis sft pretrain pretraining graphgen ai4science llm llm-training qwen xtuner llama-factory sft-data
-
Updated
Nov 26, 2025 - Python