This is official codebase for Ktrl+F Dataset construction pipeline.
- GPT-3.5
python ktrlf_2_1_target_selection_openai.py
--entity_evidence_cache_path ./dump/entity_evidence_dict.pickle
--input_path ./dump/1_query_generation.pickle
--openai_model_name gpt-3.5-turbo-0613
--openai_request_url https://api.openai.com/v1/chat/completions
--openai_api_key <openai-api-key>
--output_path ./dump/2_target_selection_chatgpt.pickle
- GPT-4
python ktrlf_2_1_target_selection_openai.py
--entity_evidence_cache_path ./dump/entity_evidence_dict.pickle
--input_path ./dump/2_target_selection_chatgpt.pickle
--openai_model_name gpt-4-0613
--openai_request_url https://api.openai.com/v1/chat/completions
--openai_api_key <openai-api-key>
--output_path ./dump/2_target_selection_gpt4.pickle
- Merge all results