Cassiopeia LFQ is an R script that uses knitr/LaTeX to automatically execute and report standard data analysis steps for label-free proteomics data, such as quality control, filtering and normalization, differential expression testing (using limma-trend), GSEA (for human and mouse proteomes), and other insightful & exploratory output (PCA, k-means clustering, etc.): Cassiopeia LFQ was developed to automatize the standard label-free quantification (LFQ) data analysis workflow at the Max Perutz Labs Mass Spectrometry Facility (Vienna). The script’s input is a “proteinGroups.txt” file as produced by database searching using the proteomic software platform MaxQuant, as well as manually specified parameters that dictate what Cassiopeia is doing. To get an overview of all features, check out the "example.pdf" which showcases a standard Cassiopeia analysis output. For more information and instructions on how to use it, please refer to the user guide in this repository.
- Markus Hartl
- Weiqiang Chen
- Changed colors in heatmaps to a gradient ranging from blue-yellow-red. Added additional heatmaps that show log2-transformed and centered intensities (i.e., row-mean=0)
- Highlighting of proteins in volcano and MA plots is now achieved via text. Figure legends were consequently removed.
- Added optional GSEA analysis for mouse and human on the basis of the limma DE-testing results. GSEA result tables will be stored in "GSEA folder" alongside "dotplots"". Due to space constraints, the pdf report only shows enrichment maps.
- Cassiopeia_LFQ was changed to read "contaminants.fasta" instead of previous con_table.txt.
- con_table.txt was removed. A contaminants.fasta was included. User can now use their own contaminant database named "contaminants.fasta"
- If batch is not NULL, a PCA plot after imputation and batch correction is now plotted to illustrate the effect of batch correction.