[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
-
Updated
Oct 20, 2024 - Jupyter Notebook
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Official Repository of "LLM × DATA" Survey Paper
Official Repository of "LLM × DATA" Survey Paper
DSIR large-scale data selection framework for language model training
DSIR large-scale data selection framework for language model training
A Survey on Data Selection for Language Models
A Survey on Data Selection for Language Models
⛔ [DEPRECATED] Adapt Transformer-based language models to new text domains
⛔ [DEPRECATED] Adapt Transformer-based language models to new text domains
🐂 🔥Official repository for the paper "LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning".
🐂 🔥Official repository for the paper "LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning".
InstructionGPT-4
InstructionGPT-4
Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning".
Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning".
[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response Ranking for Efficient Instruction-Tuning of Large Language Models
[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response Ranking for Efficient Instruction-Tuning of Large Language Models
[ACL2025 Findings] Official code for MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space
[ACL2025 Findings] Official code for MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space
Add a description, image, and links to the data-selection topic page so that developers can more easily learn about it.
To associate your repository with the data-selection topic, visit your repo's landing page and select "manage topics."