Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training Code #2

Open
vgoklani opened this issue Apr 26, 2023 · 2 comments
Open

Training Code #2

vgoklani opened this issue Apr 26, 2023 · 2 comments

Comments

@vgoklani
Copy link

Are you planning to release the training and model files?

Thanks!

@macabdul9
Copy link
Collaborator

Hi @vgoklani,

All trained models are available here: https://huggingface.co/MBZUAI. For code, currently, we are a little occupied with training and evaluating more models but one can use Standford's Alpaca to train models on LaMini Instructions as we are following the same. For more details please look into our paper: https://arxiv.org/abs/2304.14402

Thanks again.

@haiduo
Copy link

haiduo commented Mar 8, 2024

Hi @vgoklani,

All trained models are available here: https://huggingface.co/MBZUAI. For code, currently, we are a little occupied with training and evaluating more models but one can use Standford's Alpaca to train models on LaMini Instructions as we are following the same. For more details please look into our paper: https://arxiv.org/abs/2304.14402

Thanks again.

Hello, could you please provide the code for distilling the student model from the teacher model? How is sequence-level distillation done here?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants