Skip to content

How to Use NucleicNet with Command Line

NucleicNet edited this page May 29, 2019 · 2 revisions

How to Use NucleicNet with Command Line

The NucleicNet works on protein atomic model(s) written in PDB file format. Further specification on the input PDB file can be found in Specification on PDB input files. Users can put PDB file(s) into the "GridData" Folder for their analysis. After which, run the following:

# Generate features for protein atomic models

bash command_GenerateFeature.sh

# Analyse on features by deep learning module

bash command_DeepLearningModule.sh

# Organise deep learning predictions into visualisable forms

bash command_AnalysePrediction.sh

The purpose of each python script called within the bash script are annotated.

Output

Major results are stored in the "Out" folder. Supposed our input PDB file of protein is called "GridData/0000.pdb", below outlines the purpose of the resultant output files:

  • "Out/0000_pymol.pse": This is a pymol session that reveal binding pockets of each RNA constituent (e.g. The 4 bases A/U/C/G and the backbone constituent P/R for phosphate and ribose). Users can open this file by "pymol Out/0000_pymol.pse" (See Fig. 3a-c)
  • "Out/0000_R_logo_RNACColor.png": Optional. If binding sites had been ascertained before as a RNA-protein complex PDB file, we can also call "NucleicNet_SequenceLogo_RNACcolor.py" to retrieve NucleicNet-predicted RNA binding specificity on each base location in form of a Sequence Logo diagram. Supposed the corresponding RNA-protein Complex is stored in "Control/0000.pdb" with RNA chain R, our "Out/0000_R_logo_RNACColor.png" then refers to NucleicNet-predicted Sequence Logo indexed by RNA residue on chain R. (See Fig. 3-4)

We also include scripts and data to reproduce our study on Argonautes (See "command_AnalyseGridPrediction.sh"):

  • "ExperimentalSequencing/RipSeq_HMMlogPDifference.png": Using the NucleicNet to score miRNA sequence for Ago Binding. The result is compared with IP-Seq data (*.txt) stored in the "ExperimentalSequencing" Folder. (See Fig 5a)
  • "ExperimentalSequencing/Knockdown_Relation_All_Positive_publication.png" and "ExperimentalSequencing/Knockdown_Relation_All_Negative_publication.png" : Using the NucleicNet to evaluate miRNA loading efficiency. The result is compared with experimental Knockdown level (*.csv) stored in the "ExperimentalSequencing" Folder. (See Fig 5b)