A comprehensive survey of computational methods for multi-omic integration in cancer research.
Unsupervised multi-omic integration algorithms benchmarked across 5 data modalities on TCGA breast cancer, validated on TCAG holdout set and the transNEO cohort.
# Clone and restore environment
git clone https://github.com/sionaris/MultiOmicsSurvey.git
renv::restore()Study data can be found in the paper supplement and Zenodo
⚠️ renv notes — Packages not in lockfile
The following packages are not included in renv.lock and must be installed manually if needed:
| Package | Reason | Required for |
|---|---|---|
peakRAM |
HPC-only dependency | Benchmark memory profiling |
Rmosek |
Requires MOSEK license | wMKL, KLIC |
wMKL |
Custom Bioconductor install with C++ code edit |
wMKL algorithm |
devtools |
Development tool | Package installation from GitHub |
📁 Scripts — Analysis code
automated_scripts/— Helper functions (sourced by other scripts)Download_TCGA_data.R— Downloads and preprocesses TCGA-BRCAtransNEO_pseudocount_determination.R— RNA-seq normalisation for transNEOMOVICS/— Baseline MOVICS analysissingle_algorithm/— Individual method runs (+HPC scripts)method_comparisons/— Cross-method comparisons & benchmarksSurvival analysis/— Kaplan-Meier and Cox regressionConsensus/— Consensus clustering pipelines
📁 Python — Python-based methods
MOFA/— Multi-Omics Factor AnalysisMONET/— Multi Omic clustering by Non-Exhaustive TypesMSNE/— Multiple Similarity Network Embedding
📁 Resources — Supporting data
HPC output/— Results from cluster computing runs (individual runs)Pathways/— Gene sets for enrichment analysisTCGA/— Clinical data for TCGAtransNEO/— Validation cohort (paper)Performance/— Benchmark input data
📁 Results — Output files
single_algorithm/— Individual method outputsComparisons/— Cross-method comparison plotsPerformance_benchmarks/— Benchmark experiments (robustness, stability, scalability)Consensus/— Consensus clustering resultsSurvival_evaluations/— Survival analysis outputs
📁 docs — Documentation
R/function_documentation.pdf— Core function referenceR/benchmark_function_documentation.pdf— Benchmark analysis functionsPython/function_documentation.pdf— Custom Python functions for MONET and MSNE
| Category | Algorithms |
|---|---|
| Similarity Network | ab-SNF, ANF, MDICC, MSNE, NEMO, RWR-F, RWR-NF, SNF, Spectrum |
| Multiple Kernel Learning | CIMLR, KLIC, wMKL |
| Matrix Factorisation | MOFA, LRAcluster, MFA |
| Graph-based | MONET |
| Bayesian | iClusterBayes |
| Consensus | COCA |
Publication forthcoming
Apache 2.0