Copy number variation heterogeneity reveals biological inconsistency in hierarchical cancer classifications

This research combines a large number of copy nymber variants (CNV) profiles and hierarchical NCIt cancer classification system, and introduces several distance/similarity measurements besed on CNV and revealed biological inconsistency between CNV and cancer classification system.

Installation

Firstly you have to make sure that you have all dependencies in place. The simplest way to do so, is to use anaconda.

You can create an anaconda environment called cnv_heterogeneity.

conda env create -f requirements.txt
conda activate cnv_heterogeneity

Running

python get_group_clusters.py

The script get_group_clusters.py takes the CNA profiles as input and output "group_clusters" that samples with CNV profile distance lower than a threshold would be assigned to the same "group cluater".

python group_clusters_analysis.py

The script group_clusters_analysis.py takes the "group clusters" that we get in the previous step and apply a second clustering on the CNV feature matrix, and finally output the "group cluster" with similar CNV pattern.

By applying more analysis including analyse the distribution of the original NCIt entities we can systematically analyse the relationship between biological facts of CNV and NCIt cancer classification system.

Visualization

The users can visualize the CNV profiles of specific biosamples on progenetix database by simply type in biosample ids.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
cnv-heterogeneity-workflow.png		cnv-heterogeneity-workflow.png
pgxbs-kftvgk90.png		pgxbs-kftvgk90.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Copy number variation heterogeneity reveals biological inconsistency in hierarchical cancer classifications

Installation

Running

Visualization

About

Releases

Packages

ziyingyang96/cnv-heterogeneity

Folders and files

Latest commit

History

Repository files navigation

Copy number variation heterogeneity reveals biological inconsistency in hierarchical cancer classifications

Installation

Running

Visualization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages