Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models

In this repository, we present the code to our paper "Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models" by Wanyong Feng, Jaewook Lee, Hunter McNichols, Alexander Scarlatos, Digory Smith, Simon Woodhead, Nancy Otero Ornelas, and Andrew Lan. We explore a variety of approaches to distractor generation, including in-context learning(kNN), fine-tuning(FT), and chain-of-thought prompting(CoT), together with rule(RB)- and sampling-based(SB) baselines. The paper is accepted at the findings of NAACL 2024.

For any questions please email or raise an issue.

Installation

Pip

pip install -r requirements.txt

Conda

conda env create --file enviornment.yml

Run

Generate Distractors with kNN approach

python run.py 'openAI.model=gpt-3.5-turbo-1106' 'prompt.type=distractor_and_answer_with_feedback' 'retriever.type=KNN' 'retriever.encodingPattern=q+a+f'

Generate Distractors with CoT approach

python run.py 'openAI.model=gpt-4-1106-preview' 'prompt.type=zero_shot' 'retriever.type=none'

Generate Distractors with RB approach

python misconception_selection.py
python run.py 'openAI.model=gpt-4-1106-preview' 'prompt.type=rule_based_selection' 'retriever.type=misconception_selection' 'retriever.encodingPattern=q+a+f' 'data.testFilepath=data/eedi_test_20_cleaned_4_18_misconceptions.csv'

Generate Distractors with FT approach (GPT3.5)

python openai_finetune/data_processing.py
python openai_finetune/openai_finetune.py
python run.py 'dir_finetune_result.model_name=gpt_finetune' 'prompt.type=zero_shot' 'retriever.type=none'

Generate Distractors with FT approach (Mistral)

python mistral_finetune/data_processing.py
python mistral_finetune/train.py
python mistral_finetune/test.py --model_checkpoint <model_checkpoint_folder_path>
python run.py 'dir_finetune_result.model_name=mistral_finetune' 'prompt.type=zero_shot' 'retriever.type=none'

Generate Distractors with SB approach (GPT3.5)

python openai_finetune/sb_data_processing.py
python openai_finetune/openai_finetune.py --sb
python openai_finetune/post_sb_processing.py
python run.py 'dir_finetune_result.model_name=SB_sampling' 'prompt.type=zero_shot' 'retriever.type=none'

Generate Distractors with SB approach (Mistral)

python mistral_finetune/sb_data_processing.py
python mistral_finetune/train.py --sb
python mistral_finetune/test.py --sb --model_checkpoint <model_checkpoint_folder_path>
python mistral_finetune/post_sb_processing.py <num_distractors>
python run.py 'dir_finetune_result.model_name=mistral_SB' 'prompt.type=zero_shot' 'retriever.type=none'

Evaluating Distractors

python evaluation.py analysis/<result_filename> <num_distractors>

Citation

If you used our code or found this work useful in any way, please cite us!

@inproceedings{feng-etal-2024-exploring,
    title = "Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models",
    author = "Feng, Wanyong  and
      Lee, Jaewook  and
      McNichols, Hunter  and
      Scarlatos, Alexander  and
      Smith, Digory  and
      Woodhead, Simon  and
      Ornelas, Nancy  and
      Lan, Andrew",
    editor = "Duh, Kevin  and
      Gomez, Helena  and
      Bethard, Steven",
    booktitle = "Findings of the Association for Computational Linguistics: NAACL 2024",
    month = jun,
    year = "2024",
    address = "Mexico City, Mexico",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.findings-naacl.193",
    pages = "3067--3082",
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
analysis		analysis
conf		conf
data		data
mistral_finetune		mistral_finetune
openai_finetune		openai_finetune
.gitignore		.gitignore
ExperimentLogger.py		ExperimentLogger.py
OpenAIInterface.py		OpenAIInterface.py
PromptFactory.py		PromptFactory.py
README.md		README.md
RetrieverFactory.py		RetrieverFactory.py
data_loading_utils.py		data_loading_utils.py
environment.yml		environment.yml
evaluation.py		evaluation.py
misconception_selection.py		misconception_selection.py
ranking_model.py		ranking_model.py
requirements.txt		requirements.txt
run.py		run.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models

Installation

Pip

Conda

Run

Generate Distractors with kNN approach

Generate Distractors with CoT approach

Generate Distractors with RB approach

Generate Distractors with FT approach (GPT3.5)

Generate Distractors with FT approach (Mistral)

Generate Distractors with SB approach (GPT3.5)

Generate Distractors with SB approach (Mistral)

Evaluating Distractors

Citation

About

Releases

Packages

Contributors 2

Languages

umass-ml4ed/prompt_distractor_generation_NAACL

Folders and files

Latest commit

History

Repository files navigation

Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models

Installation

Pip

Conda

Run

Generate Distractors with kNN approach

Generate Distractors with CoT approach

Generate Distractors with RB approach

Generate Distractors with FT approach (GPT3.5)

Generate Distractors with FT approach (Mistral)

Generate Distractors with SB approach (GPT3.5)

Generate Distractors with SB approach (Mistral)

Evaluating Distractors

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages