LuckyMicrobe

Introduction

A pipeline to analyse 16S rDNA sequencing amplicon data.
Three softwares were used in this pipeline (Qiime2, Usearch and LEfSe) to investigate your pair-end 16S rDNA sequencing data, you can process taxonomy composition analysis , microbialcommunity diversity analysis and differential abundance analysis through it.
Before using this pipeline, we assume that you are very familiar with the principles of these softwares and clearly know what you really want to get. Reading the following guide carefully and start the journey of data analysis !

Guide for use

A flow diagram for this pipeline

Note: As is shown above, there are totally four algorithms to get the feature table and representative sequences in the Qiime2 and Usearch, we provide three batches of them here except the vsearch methods in the dotted box. It is worth mentioning that the "dada2" file in version 0.1 only contains dada2 algorithm path.

Environment configuration

We make and test the pipeline in operation system: CentOS Linux release 7.6.1810.
You must install these three softwares mentioned above, of course, you needn't install the LEfSe if you don't want to process a differential abundance analysis.
The specific installation methods are recorded in the file "Installation.txt".

Getting started

You can enter the "example" directory and run the "run.batch" file by entering the command "sh run.batch" to test if this pipeline is compatible with your environment, if not any mistake shows, congratulations, you successfully build an appropriate environment to run this pipeline!(It will cost about 100 minutes when use 40 cores.)
After testing the environment, what you just need to do is to prepare the data and metadata files and change the parameters in "Configuration.txt" file for your analysis according to the example, such as preparing pair-end sequences data which contain a suffix "_1.fq.gz" and its counterpartner file with a suffix "_2.fq.gz". Another point you must take care of is the value in absolute-filepath colum of the metadata file must be under the directory "prefix_merged" with a suffix ".MG.fq" just like the metadata file in "example" directory.
Note: To save your time, the diversity analysis and differential abundance analysis won't run automatically. The methods to run them:

For differential abundance analysis, you just need to delete the "#" before "diffAbunAnalysis" at the end of the file "run.batch" and make some small changes in the "lefse.batch".
For diversity analysis, firstly you should ensure the level you rarefy by viewing the visualized results on the "Qiime 2 View" website(https://view.qiime2.org/), then change the parameters in "Configuration.txt", finally delete the "#" before "diversityAnalysis" and add "#" before other two functions at the end of the file "run.batch".

The parameters in "configuration.txt" are very important that you need to understand them thoroughly and change them according your own need.

Citation

Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, Alexander H, Alm EJ, Arumugam M, Asnicar F, Bai Y, Bisanz JE, Bittinger K, Brejnrod A, Brislawn CJ, Brown CT, Callahan BJ, Caraballo-Rodríguez AM, Chase J, Cope EK, Da Silva R, Diener C, Dorrestein PC, Douglas GM, Durall DM, Duvallet C, Edwardson CF, Ernst M, Estaki M, Fouquier J, Gauglitz JM, Gibbons SM, Gibson DL, Gonzalez A, Gorlick K, Guo J, Hillmann B, Holmes S, Holste H, Huttenhower C, Huttley GA, Janssen S, Jarmusch AK, Jiang L, Kaehler BD, Kang KB, Keefe CR, Keim P, Kelley ST, Knights D, Koester I, Kosciolek T, Kreps J, Langille MGI, Lee J, Ley R, Liu YX, Loftfield E, Lozupone C, Maher M, Marotz C, Martin BD, McDonald D, McIver LJ, Melnik AV, Metcalf JL, Morgan SC, Morton JT, Naimey AT, Navas-Molina JA, Nothias LF, Orchanian SB, Pearson T, Peoples SL, Petras D, Preuss ML, Pruesse E, Rasmussen LB, Rivers A, Robeson MS, Rosenthal P, Segata N, Shaffer M, Shiffer A, Sinha R, Song SJ, Spear JR, Swafford AD, Thompson LR, Torres PJ, Trinh P, Tripathi A, Turnbaugh PJ, Ul-Hasan S, van der Hooft JJJ, Vargas F, Vázquez-Baeza Y, Vogtmann E, von Hippel M, Walters W, Wan Y, Wang M, Warren J, Weber KC, Williamson CHD, Willis AD, Xu ZZ, Zaneveld JR, Zhang Y, Zhu Q, Knight R, and Caporaso JG. 2019. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nature Biotechnology 37: 852–857. https://doi.org/10.1038/s41587-019-0209-9
R.C. Edgar (2010), Search and clustering orders of magnitude faster than BLAST, Bioinformatics 26(19) 2460-2461
Nicola Segata, Jacques Izard, Levi Walron, Dirk Gevers, Larisa Miropolsky, Wendy Garrett, Curtis Huttenhower."Metagenomic Biomarker Discovery and Explanation" Genome Biology, 2011 Jun 24;12(6):R60

Cantact us

If you encounter any question during the use of this pipeline, please contact us by email ouyjh6@mail2.sysu.edu.cn

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Help		Help
example		example
ref_data		ref_data
README.md		README.md
configuration.txt		configuration.txt
lefse.batch		lefse.batch
q2a_usearchMergePairs.batch		q2a_usearchMergePairs.batch
q2b_dada2.batch		q2b_dada2.batch
q2b_unoise3.batch		q2b_unoise3.batch
q2c_uparse.batch		q2c_uparse.batch
q2d_taxaClassify.batch		q2d_taxaClassify.batch
q2e_taxaFilterMitoChlo.batch		q2e_taxaFilterMitoChlo.batch
q2f_treeBuildFilter.batch		q2f_treeBuildFilter.batch
q2x_alphaDiversity.batch		q2x_alphaDiversity.batch
q2x_ancom.batch		q2x_ancom.batch
q2x_betaDiversity.batch		q2x_betaDiversity.batch
q2x_dataGroup.batch		q2x_dataGroup.batch
q2x_featuresFilter.batch		q2x_featuresFilter.batch
q2x_rarefy.batch		q2x_rarefy.batch
run.batch		run.batch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LuckyMicrobe

Introduction

Guide for use

A flow diagram for this pipeline

Environment configuration

Getting started

Citation

Cantact us

About

Releases 4

Packages

Languages

Learnerhua/LuckyMicrobe-testing-

Folders and files

Latest commit

History

Repository files navigation

LuckyMicrobe

Introduction

Guide for use

A flow diagram for this pipeline

Environment configuration

Getting started

Citation

Cantact us

About

Resources

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages