Skip to content

contains files to SU KB8024 project on transmembrane protein topology predictor

Notifications You must be signed in to change notification settings

amribeiror/protein_topol_predictor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Welcome to TopoPRED, a membrane protein topology predictor.

The main script for the predictor (TopoPRED.sh) contains the main commands.

Once run from the command line, TopoPRED.sh (1) generates the required folder structure, (2) runs a PSIBLAST to extract positional information for each amino acid residue in the query protein sequence(s) and (3) runs a linear SVM classifier using the Substitution Matrix information with a window size = 15; these are the parameters that yield more accurate results with a test dataset (confusion matrices can be found in the folder 'output').

For TopoPRED.sh to work, the query Fasta file **AND** the model SM_SVM_PSSMw15.pkl **AND** the python script SM_SVM_PSSMpredictor.py must be in the folder 'input'.

Additional scripts used to generate models for Random Forests (SM_RF_PSSM.py), Decision Trees (SM_DT_PSSM.py) and SVM using FM matrices (FM_SVM_PSSM.py) can be found in the folder 'scripts', but are not required for TopoPRED to work.

About

contains files to SU KB8024 project on transmembrane protein topology predictor

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published