DeepFMPO

Adopting the technique used in the paper: Deep Reinforcement Learning for Multiparameter Optimization in de novo Drug Design for optimising the pIC50 value of the molecules

Outputs of some of the experiments are in the folder "past outputs"

Instructions

To run the main program on the same data as used in the best outputs (in folder "past outputs/7July/clean_good_manual/"):

python Main.py

It is also possible to run the program on a custom set of lead molecules and/or fragments.

python Main.py fragment_molecules.smi lead_file.smi

Molecules that are generated during the process can be viewed by running:

python viewing_outputs.py -epoch epoch

where epoch is the epoch that should be viewed.

New molecules can also be generated from a saved generation model. For this run:

python viewing_outputs.py -gen 1

In the above run actions would be sampled from discrete distribution output by the actor(=> stochastic output) For performing the max probability action:

python viewing_outputs.py -gen 1 -stoch 0

Also, remember to change the appropriate file_path in the code. Please note that I have NOT SAVED the best generation model but an equivalent model is present and could very well be used.

In either of the ways, the output is as follows:

Displays two columns of molecules as PNG file. The first column contains the original lead molecule, while the second column contains modified molecules.
Displays a histogram containing the pIC50 distributions in the lead molecules and the final output.
Saves two csv files- one containing a table of all the changed molecules and one containing a table of all the molecules which have been made from inactive to active. These files are saved in the folder past outputs

Any global parameters can be changed by changing them in the file "Modules/global_parameters.py"

Short description of all code files

Main.py: The main file. This has to be run for training.
viewing_outputs.py: File to view outputs as described above.
Show_Epoch.py: Reads and decodes generated molecules, used by viewing_outputs.py
FMPO-Visualising the outputs.ipynb: Jupyter notebook used for testing parts of the code, as well as viewing outputs
Files inside "Modules":
1. build_encoding.py: Contains functions involved in building and saving encodings
2. file_reader.py: Contains functions involved in reading .smi and .csv input files
3. global_parameters.py: All global parameters can be set here
4. models.py: Architecture of Actor and Critic are present
5. mol_utils.py: Utility functions for handling molecules(like breaking fragments)
6. rewards.py: The predictive model is deployed here. Contains all funcions pertaining to generating the rewards.
7. similarity.py: Contains functions that can be used to calculate similarity coefficients- Tanimoto and Levenshtein/Edit Distance
8. training.py: Calculates the initial distribution and trains the actor and critic networks.
9. tree.py: Implements the tree class along with "btl": Build tree from list function

Short Description of non-code files:

Padel.txt: Contains the outputs of Padel file
descriptors.csv: used to store the initial descriptors in this file
uneval_desc.csv: in case, descriptors.csv contains NaN values, such rows are re-evaluated in uneval_desc.csv

Requirements

The following Python libraries are required to run it:

rdkit
numpy
sklearn
keras
pandas
bisect
Levenshtein
A backend to keras, such as theano, tensorflow or CNTK
xgboost

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
3.6		3.6
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
training_speed.txt		training_speed.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepFMPO

Instructions

Short description of all code files

Short Description of non-code files:

Requirements

About

Releases

Packages

Languages

License

mew-two-github/de-Novo-drug-Design

Folders and files

Latest commit

History

Repository files navigation

DeepFMPO

Instructions

Short description of all code files

Short Description of non-code files:

Requirements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages