This repository serves as a template for machine learning on high-throughput expression and solubility data.
The machine learning models have been implemented in Python in the form of IPython notebooks. The workflow is run in the following order:
- create_feature_matrix.ipynb
- classification_workflow.ipynb
- retrospective_analysis.ipynb
A reduced workflow was implemented for solubility data in the solubility subdirectory.
All information presented in this Git Repository is strictly for academic purposes.