Skip to content

The goal of this project is to generate a linear regression model that accepts ChEMBL inhibitor data for a target of interest as input and produces inhibitor bioactivity predictions with respect to the specified target as output.

Notifications You must be signed in to change notification settings

cmk323/computational-drug-discovery-project

Repository files navigation

computational-drug-discovery-project

The goal of this project is to create a linear regression model that utilizes ChEMBL bioactivity data to generate inhibitor bioactivity predictions with respect to a specified target of interest. The test case shown here uses epidermal growth factor receptor (EGFR) as a target. This protein was selected as a target of interest due to its applications in cancer drug development.

to do

  • automate the target selection process (select ChEMBL ID with the most hits for IC50 activity data from search results)
  • test different regressors (random forest used as default)
  • implement a third "intermediate" bioactivity classifer in addition to active or inactive

About

The goal of this project is to generate a linear regression model that accepts ChEMBL inhibitor data for a target of interest as input and produces inhibitor bioactivity predictions with respect to the specified target as output.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published