Skip to content

Sequence Data

Robert J. Gifford edited this page Oct 23, 2024 · 2 revisions

The sequence data in this project are organized into multiple distinct sources. Each source contains data in either GenBank XML or plain FASTA format. The type of data is indicated by the name of the source (all GenBank XML sources contain 'ncbi' in the name).

GenBank XML files are imported into this project directly from NCBI GenBank using an appropriately configured version of GLUE's GenBank importer module. The core Hepadnavirus-GLUE project contains a single NCBI-derived source - ncbi-refseqs - that contains 'master reference' genome sequences for each hepadnavirus species included in this project.