PromptTuner

See blog post for motivations

Get Started

This is a clean fork of NanoGPT, see original author's README for setup instructions

Trainng setup is single GPU, nanoGPT can fit on single V100/A100 32G GPU with up to 32 batch size during fine-tuning.

we have enabled 4 different tasks for prompt-tuning, these tasks are toy datasets generated from the 3 files:

train

python finetune.py --task arithmetics --use-mlp --lr 0.01 --decay-lr --min-lr 1e-3 --ckpt-name xxx

eval

python finetune.py --eval --task arithmetics --use-mlp --ckpt-name xxx

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
assets		assets
config		config
data		data
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_orig.md		README_orig.md
arithmetics_task.py		arithmetics_task.py
bench.py		bench.py
configurator.py		configurator.py
finetune.py		finetune.py
model.py		model.py
sample.py		sample.py
scaling_laws.ipynb		scaling_laws.ipynb
symbolic_task.py		symbolic_task.py
task_common.py		task_common.py
train.py		train.py
transformer_sizing.ipynb		transformer_sizing.ipynb
word_manipulate.py		word_manipulate.py