Skip to content

tjcadd2020/Diagnosis-for-CD

Repository files navigation

Binder

Diagnosis-for-CD

Microbial genes outperform species and SNVs as diagnostic markers for Crohn’s disease according to artificial intelligence analyses of multicohort fecal metagenomes

Our global metagenomic analysis unravels the multi-dimensional alterations of the microbial communities in CD, and identified microbial genes as robust diagnostic biomarkers across cohorts. These genes are functionally related to CD pathology. Future research on these genes may lead to an effective non-invasive diagnostic tool for CD.

Descriptions for scripts

1.Raw data process

Metagenomic sequencing data:

(1) Quality control  

(2) Taxonomic annotation and abundance estimation  

(3) Gene prediction and abundance estimation  

2.SNV Calling

MIDAS was used to perform microbial SNV annotation.
The WMS reads were mapped to the database for SNV calling.

3.Differential analysis with MMUPHin

Identifying differential signatures from multi-cohorts by correcting batch effects.

4.Alpha diversity

Calculating alpha diversity of microbiome in CD patients and controls.

5.Beta diversity

Calculating beta diversity of microbiome in CD patients and controls.

6.Model contruction with FNN

Feedforward neural network (FNN) was employed to construct the diagnostic model:

(1) Neuron network construction  

(2) Ten fold cross-validation  

(3) Model trainning   

(4) Model testing  

7.Feature evaluation with SHAP

The feature importance was evaluated with SHapley Additive exPlanations (SHAP) to explain the output of machine learning model.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published