PLM-PLI

Does protein pretrained language model facilitate the prediction of protein-ligand interaction?
A novel method that quantitatively assesses the significance of protein PLMs in PLI prediction

Directory Structure

├── AttentiveFP/           # GAT model for extracting drug features
├── data/                  # PLI task datasets
├── models/                # PLMs
├── args.yaml              # Drug molecule parameters
├── config.py              # Configuration file for parameter settings
├── data_handler.py        # PLI data processing tool
├── main.py                # Main function
├── ot_metric              # Quantitative transfer metrics based on OT
├── OTFRM                  # OTFRM analysis
├── plotter.py             # Plotting tool
├── README.md              # Readme file
├── requirements.txt       # Environment dependencies
├── train_test.py          # Engine for training and testing the model
├── utils.py               # Collection of utility functions

Requirements

conda create -n PLMPLI python==3.10.11
conda activate PLMPLI
cd PLM-PLI
pip install -r requirements.txt

Data Preparation

Place the processed datasets for PDBbind, Kinase, and DUD-E in the data/ directory. An example entry of the processed PDBbind dataset is shown below:

PDB-ID	seq	rdkit_smiles	label	set
11gs	PYTVV...GKQ	CC[C@@H](CSC[C@H]...C(=O)c1ccc(OCC(=O)O)c(Cl)c1Cl	5.82	train

Fine-tuning on PLI Tasks

Run main.py to perform fine-tuning from pre-trained PLMs to downstream PLI prediction. The following example demonstrates the command to fine-tune using ProtTrans as the PLM on the PDBBind task:

python main.py --model_name=prottrans --task=PDBBind

For more input parameter settings, please refer to config.py.

Acknowledgement

The SOFTWARE will be used for teaching or not-for-profit research purposes only. Permission is required for any commercial use of the Software.

Citations

If you use our method in your research, please cite our paper:

@article{zhang2023protein,
  author={Zhang, Weihong and Hu, Fan and Li, Wang and Yin, Peng},
  title={Does protein pretrained language model facilitate the prediction of protein-ligand interaction?},
  year={2023},
  journal={Methods},
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PLM-PLI

Directory Structure

Requirements

Data Preparation

Fine-tuning on PLI Tasks

Acknowledgement

Citations

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
AttentiveFP		AttentiveFP
models		models
Framework.jpg		Framework.jpg
LICENSE		LICENSE
OTFRM.ipynb		OTFRM.ipynb
README.md		README.md
args.yaml		args.yaml
config.py		config.py
data_handler.py		data_handler.py
main.py		main.py
ot_metric.py		ot_metric.py
plotter.ipynb		plotter.ipynb
requirements.txt		requirements.txt
train_test.py		train_test.py
utils.py		utils.py

License

brian-zZZ/PLM-PLI

Folders and files

Latest commit

History

Repository files navigation

PLM-PLI

Directory Structure

Requirements

Data Preparation

Fine-tuning on PLI Tasks

Acknowledgement

Citations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages