Socratic Question Generation

Requirements

This codebase requires pytorch and huggingface to be installed which can be installed using pip.

Generate Invalid questions

python generate_bad_questions.py
python classifiy_gen_questions.py
python filter_bad_questions.py
python create_preference_dataset.py

The invalid questions are stored in bad_questions.json. The classification results are stored in classifiy_gen_questions.json. The correct invalid questions are stored in valid_bad_questions.json. The preference dataset is stored in create_preference_dataset.py.

Preference Optimization

python finetune/sft_dpo.py

This code takes several arguments which can be seen using the -h flag. --sft flag corresponds to standard fine-tuning and --dpo corresponds to direct preference optimization.

Some example commands include:

Standard Fine-Tuning

python finetune/sft_dpo.py --sft --base_model codellama/CodeLlama-7b-Instruct-hf --model_name codellama_sft_b2 --batch_size 2 --grad_accum_steps 32 --epochs 5

DPO

python finetune/sft_dpo.py --dpo --base_model codellama/CodeLlama-7b-Instruct-hf --model_name codellama_sft_b2 --pt_model_name codellama_sft_b2 --batch_size 1 --grad_accum_steps 64 --epochs 2

Generate

python finetune/sft_dpo.py --generate --model_name codellama_sft_b2_dpo --pt_model_name codellama_sft_b2 --decoding greedy

LLama Zero-Shot Experiments

python zero-shot_llama_prompt.py

This code takes an argument --prompt cot for chain-of-thought prompting.

Evaluate

python evaluate.py --result_file <path_to_results_file>

Additionally, the code takes two arguments --zero True and --cot True for LLama zero-shot and CoT respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
__pycache__		__pycache__
finetune		finetune
metrics_results		metrics_results
preference_data		preference_data
results		results
socratic-debugging-benchmark		socratic-debugging-benchmark
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
bad_questions.json		bad_questions.json
classification_prompt.txt		classification_prompt.txt
classifiy_gen_questions.py		classifiy_gen_questions.py
create_preference_dataset.py		create_preference_dataset.py
data_stats.py		data_stats.py
evaluate.py		evaluate.py
filter_bad_questions.py		filter_bad_questions.py
generate_bad_questions.py		generate_bad_questions.py
prompt.txt		prompt.txt
prompt_llm.py		prompt_llm.py
utils.py		utils.py
valid_bad_questions.json		valid_bad_questions.json
zero-shot_llama_prompt.py		zero-shot_llama_prompt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Socratic Question Generation

Requirements

Generate Invalid questions

Preference Optimization

LLama Zero-Shot Experiments

Evaluate

About

Releases

Packages

Languages

umass-ml4ed/socratic-quest-gen

Folders and files

Latest commit

History

Repository files navigation

Socratic Question Generation

Requirements

Generate Invalid questions

Preference Optimization

LLama Zero-Shot Experiments

Evaluate

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages