A project to automatically extract relevant information from abstracts of clincal studies about psychedelic treatments of psychiatric conditions. This includes named entity recognition and single-/multilabel classification via fine-tuned BERT-based models.
This repository includes two submodule:
|-- PsyNamic-Webapp: the webapp written in dash
|-- PsyNamic-Prodigy: the dockerized prodigy setup used for annotating data
Either install this repository and the submodules in one go:
git clone --recurse-submodules git@github.com:Ineichen-Group/PsyNamic.git
or install the repos separaetly
git clone git@github.com:Ineichen-Group/PsyNamic.git
git clone git@github.com:Ineichen-Group/PsyNamic-Prodigy.git
git clone git@git@github.com:Ineichen-Group/PsyNamic-Webapp.git
Checkout the README of PsyNamic-Webapp repo: https://github.com/Ineichen-Group/PsyNamic-Webapp/README.md
Check out the Makefile.
Keeping the reference up to date
- Work within the submodules, commit and push
- Update the references in the parent repository PsyNmiac
git submodule update --remote
git add PsyNamic-Prodigy PsyNamic-Webapp
git commit -m "Updated submodule references to the latest commit"
git push origin main
- Anything related to the data and the annotation process is in the
datafolderdata/raw_datacontains the raw data from the literature search with ASReview and David's relevant/irrelevant classificationdata/prepared_datacontains the annotated data, cleaned and ready for trainingdata/prodigy_inputsanddata/prodigy_exportscontains the unannotated prodigy input and the annotated prodigy outputdata/iaacontains the inter-annotator agreement datadata/prediction_datacontains the unannoated data for the prediction and the whole pulling newest research from PubMed pipeline- it also contains all scripts to process the data #TODO: adjust paths; I moved some scripts around
modelsconains- the scripts for training and feeding data while training
- performance evaluation and plots --> the actual trained models are on the server
PsyNamic-Webappcontains the webapp for the living systematic reviewPsyNamic-Prodigycontains the prodigy setup for the annotation processtestcontains some not up-to-dat test cases for the data processing