Model Distillation and Fine-tuning

This project focuses on distilling large language models (LLMs) into smaller versions and fine-tuning them on domain-specific tasks. The distillation process uses a teacher model (gpt-neo-1.3B) to train a student model (distilgpt2). The model is then evaluated on several NLP benchmarks. Finally, it is finetuned on the Yahoo News Financial dataset and evaluated on FinQA.

Models:

Teacher Model: EleutherAI/gpt-neo-1.3B
Student Model: distilgpt2

Frameworks:

transformers
datasets
torch

Results

Evaluation results will be saved in:

benchmark_results.csv (Benchmark tasks)
finqa_benchmark_results.csv (FinQA evaluation)

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
distill_output/runs		distill_output/runs
distilled_student_model		distilled_student_model
src		src
yahoo_finance_finetuned		yahoo_finance_finetuned
.gitattributes		.gitattributes
.gitignore		.gitignore
README.MD		README.MD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model Distillation and Fine-tuning

Models:

Frameworks:

Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

NiamBashambu/model_distillation_evaluation

Folders and files

Latest commit

History

Repository files navigation

Model Distillation and Fine-tuning

Models:

Frameworks:

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages