Skip to content

Pipeline and shiny app for the downstreaming analysis of quantitative proteomic data

License

Notifications You must be signed in to change notification settings

SamueldelaCamaraFuentes/Downstreaming-analysis-workflow

Repository files navigation

Alt text

Shotgun Proteomics Downstreaming Analysis

In this repository, the downstreaming analysis code for quantitative proteomics analysis tool can be found.

In the present repository, we can find the quantitative proteomic data analysis software focused on analyzing both DDA and DIA label free and labelled TMT data from MaxQuant, MSFragger, Proteome Discoverer and DIA-NN computational platforms. Its functionalities include preprocessing of data, differential expression analysis, functional and protein interaction analysis, along with visualization of each of the aforementioned stages.

Additionally, in this repository, there is a pipeline for analyzing label-free and labelled quantitative proteomic data from several computational platforms such as MaxQuant, MSFragger, Proteome Discoverer and DIA-NN, whose files can be found in the "Pipeline" folder containing the main.R file, which drives the developed functionalities, and the files dataprocessing.R, quality_metrics.R, statisticalAnalysis.R, overviewfigures.R, enrichment.R, and Network.R, which contain the set of implemented functions. The set of files in the "Pipeline" folder serves the functions of the application in RStudio.


Requisites

  • "ProteinGroups.txt" file from MaxQuant. a test dataset can be found on "data" folder.
  • "combined_protein.tsv" file from MSFragger.
  • "report.pg.matrix.tsv" file along with "report.tsv" file from DIA-NN in the same directory.
  • Output file from Proteome Discoverer.

Outputs

  1. Quality metrics:
  • Boxplots
  • Dispersion plot
  • Scatter plots
  • Pre/post imputation plots
  • Histograms
  • Principal Component Analysis
  • Correlation plot
  • Q-Q plot
  1. Overview figures:
  • Volcano plot
  • Heatmap
  • Protein Intensity plot
  1. Enrichment:
  • Dotplot
  • Barplot
  • Manhattan plot
  1. Interactions:
  • Interaction networks
  1. Data:
  • data set after preprocessing
  • data set after statistical analysis
  • data set after functional analsysis.

Credits

Packages that make it work: dplyr, ggplot2, ggvenn, VIM, gplots, gprofiler2, igraph, plotly, limma, clusterprofiler, enrichplot, stringdb.

About

Pipeline and shiny app for the downstreaming analysis of quantitative proteomic data

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages