- notebooks_data_prep: Contains notebooks used to clean and prepare the data
- notebooks_llm: Contains notebooks used for LLM experiments
- data: Contains the raw data, processed data, and data dictionary
- (Optional) Paper pdfs are put into this gdrive folder.
- Each paper is named using its
id
- If using rclone, download with
rclone ls gdrive:data/public_data/mdcalc-papers --drive-shared-with-me
- To download a single paper,
rclone copy gdrive:data/public_data/mdcalc-papers/10129.pdf . --drive-shared-with-me
- Each paper is named using its
- (Optional) Paper pdfs are put into this gdrive folder.
- Raw data is contained as a dataframe in the data/data_clean.pkl file.
- A semi-complete data dictionary is given in data/data_clean_dictionary.json.