Skip to content
tazro inutano ohta edited this page Jul 3, 2013 · 4 revisions

Quality control

DBCLS SRA sequencing quality control

DBCLS SRA Metadata toolkit project provides results of FastQC, one of the most popular quality control software for high-throughput sequencing.

Reasons that we built this are:

  • Failed sequencing data are also submitted
    • Due to grant restriction or other reasons
    • Data download requires long time, even if the file is broken
  • Number of read, GC content, and some other information is necessary for further analysis
    • Users perform QC before starting analysis anyway

Analysis pipeline

Users will perform further analysis on their analysis pipeline and computer platform. ENA and DRA also provide analysis platform for public data.

DDBJ Read Annotation Pipeline

DDBJ Read Annotation Pipeline

"DDBJ Read Annotation Pipeline is a cloud-computing based analytical platform for next-generation sequencing data."

Users can upload their local data files to perform analysis on the DDBJ super-computing platform, and also use DRA data. DDBJ also provide galaxy platform for secondary analysis. Provided primary analysis tools are as follows:

Reference Genome Mapping Tools

  • BLAT
  • Maq
  • bwa
  • SOAP
  • Bowtie
  • TopHat
  • Bowtie2

de novo Assembly

  • SOAPdenovo
  • ABySS
  • Velvet
  • Trinity

Mapping Contigs by de novo Assemble to Reference Sequences

  • BLAT

EBI galaxy

EBI Search Result

EBI galaxy

EBI provides the feature importing files to galaxy pipeline. Click "Fastq files (galaxy)" in the table on result page to import and do further analysis.

Clone this wiki locally