Skip to content

jlab/qiita

 
 

Qiita (canonically pronounced cheetah)

Build Status Coverage Status

Advances in sequencing, proteomics, transcriptomics, metabolomics, and others are giving us new insights into the microbial world and dramatically improving our ability to understand their community composition and function at high resolution. These new technologies are generating vast amounts of data, even from a single study or sample, leading to challenges in storage, representation, analysis, and integration of the disparate data types. Qiita was designed to allow users address these new challenges by keeping track of multiple studies with multiple 'omics data. Additionally, Qiita is capable of supporting multiple analytical pipelines through a 3rd-party plugin system, allowing the user to have a single entry point for all their analyses. Qiita's main site provides database and compute resources to the global community, alleviating the technical burdens, such as familiarity with the command line or access to compute power, that are typically limiting for researchers studying microbial ecology.

Qiita is currently in production/stable status. We are very open to community contributions and feedback. If you're interested in contributing to Qiita, see CONTRIBUTING.md. If you'd like to report bugs or request features, you can do that in the Qiita issue tracker.

To install and configure your own Qiita server, see INSTALL.md. However, Qiita is not designed to be used locally but rather on a server, we therefore advise against installing your own version on a personal computer. Nevertheless, it can run just fine on a laptop or small computer for development and educational purposes. For example, for every single PR and release, we install Qiita from scratch as GitHub Actions, you can follow these steps.

For more specific details about Qiita's philosophy and design visit the Qiita main site tutorial.

Current features

  • Full study management: Create, delete, update samples in the sample and multiple preparation information files.
  • Upload files via direct drag & drop from the web interface or via scp from any server that allows these connections.
  • Study privacy management: Sandboxed -> Private -> Public.
  • Easy long-term sequence data deposition to the European Nucleotide Archive (ENA), part of the European Bioinformatics Institute (EBI) for private and public studies.
  • Raw data processing for Target Gene, Metagenomic, Metabolomic, Genome Isolates and BIOM files. NOTE: BIOM files can be added as new preparation files for downstream analyses; however, this cannot be made public in the system.
  • Basic downstream analyses using QIIME 2. Note that Qiita produces qza/qzv in the analytical steps but you can also convert non QIIME 2 artifacts.
  • Bulk download of studies and artifacts.
  • Basic study search in the study listing page.
  • Complex metadata search via redbiom.

For more detailed information visit the Qiita tutorial and the Qiita help.

Accepted raw files

  • Multiplexed SFF
  • Multiplexed FASTQ: forward, reverse (optional), and barcodes
  • Per sample FASTQ: forward and reverse (optional)
  • Multiplexed FASTA/qual files
  • Per sample FASTA, only for "Full Length Operon"

Packages

No packages published

Languages

  • Python 47.1%
  • Jupyter Notebook 32.3%
  • HTML 17.2%
  • JavaScript 1.7%
  • PLpgSQL 1.5%
  • Makefile 0.1%
  • Other 0.1%