Skip to content

Commit

Permalink
Extract drug half-life and other attributes (#3)
Browse files Browse the repository at this point in the history
Merges #3

Extract half-life info of drugs.
Add curated half-life of drugs.
Try models for predicting half-life
  • Loading branch information
yhao-compbio authored and dhimmel committed May 17, 2019
1 parent 7b94454 commit 6b9ae38
Show file tree
Hide file tree
Showing 7 changed files with 10,139 additions and 0 deletions.
6 changes: 6 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,12 @@ This repository contains several code and data components:

+ `pubchem-map.ipynb` -- DrugBank compounds were mapped to [PubChem](https://pubchem.ncbi.nlm.nih.gov/search/) based on exact InChi string matches. The mapping is available as a [tsv file](data/pubchem-mapping.tsv).

+ `parse-halflife.ipynb` -- extracts half-life and other structural information from the Drugbank xml download into a [tsv file](data/drugbank_halflife.tsv) where each row represents a drug. The half-life information was listed as free text in Drugbank. We manually extract the numeric value from free text into a [xlsx file](data/drugbank_halflife_curated.xlsx). All values were converted to hours. If the value was listed as time range (e.g. a ~ b) in DrugBank, average was calculated (e.g. (a + b)/2).

+ `extract-curated-halflife.ipynb` -- extracts subset of drugs with curated half-life into a [tsv file](data/drugbank_subset_halflife_curated.tsv) where each row represents a drug.

+ `predict-halflife.ipynb` -- builds supervised learning models to predict half-life based on structural properties of drugs.

## License

DrugBank content and derivates are licensed under [CC BY-NC 4.0](https://creativecommons.org/licenses/by-nc/4.0/ "Creative Commons Attribution-NonCommercial 4.0 International"). Original content is released as [CC0 1.0](https://creativecommons.org/publicdomain/zero/1.0/ "CC0 1.0 Universal: Public Domain Dedication")
7,848 changes: 7,848 additions & 0 deletions data/drugbank_halflife.tsv

Large diffs are not rendered by default.

Binary file added data/drugbank_halflife_curated.xlsx
Binary file not shown.
Loading

0 comments on commit 6b9ae38

Please sign in to comment.