This program trains a model that identifies biological relations between entities using distant supervision. The pubtator annotated files are loaded from https://zenodo.org/record/1243969#.W7_xsRNKiu4 . The HIV-1 Human Interaction database is used for the distant supervision. https://www.ncbi.nlm.nih.gov/genome/viruses/retroviruses/hiv-1/interactions/
Virtual Environment is recommended to be built using requirements.txt
git clone (Link to this repository)
virtualenv <VE_PATH_NAME>
source <VE_PATH_NAME>/bin/activate
cd <PATH TO THIS REPOSITORY ON LOCAL MACHINE>
pip install -r requirements.txt
Most important libraries
TensorFlow 1.5
NumPy 1.14
python relation_extraction_pubtator.py <MODE> <PUBTATOR_FILE> <DIRECTIONAL_DIRECTORY> <SYMMETRIC_DIRECTORY> <ENTITY_1_COLUMN> <ENTITY_2_COLUMN> <RELATION_COLUMN> <ENTITY_1> <ENTITY_2>
python relation_extraction_pubtator.py PREDICT <MODEL_DIRECTORY> <PUBTATOR_FILE> <ENTITY_1> <ENTITY_2> <RECURRENT_BOOL>