Schemas for the Data Working Group

The Global Alliance for Genomics and Health is an international coalition, formed to enable the sharing of genomic and clinical data.

The Data Working Group concentrates on data representation, storage, and analysis, including working with platform development partners and industry leaders to develop standards that will facilitate interoperability.

Each area of genomics and health has a dedicated team working to define those standards.

Reads Task Team

The Reads Task Team is focused on standards for accessing genomic read data -- collections of primary data collected from sequencing machines.

The team will deliver:

Data model. An abstract, mathematically complete and precise model of the data that is manipulated by the API. See the Avro directory for our in-progress work on defining v0.5 of the data model.
API Specification. A human-readable document introducing and defining the API, accompanied by a formal specification. See the documentation page for the published v0.1 API.
Reference Implementation. Open source working code demonstrating the API, ideally which can underpin real world working implementations.

Reference Variation Task Team

The Reference Variation Task Team is focused on standards for storing and accessing reference genome and variant data -- the results of analysis of primary data collected from sequencing machines.

File Formats Task Team

One small but essential part of this effort is the definition, standardisation, and improvement of basic file formats for sequence and variation data, and for associated infrastructure such as index formats.

These format specifications can be found in the samtools/hts-specs repository.

Metadata Task Team

The Metadata Task Team (MTT) concerns itself with data structures, attributes and values used to describe everything but the sequence. This includes metadata for individuals, samples, analyses, instrumentation a well as ontology representations for metadata. Naturally, the group interacts heavily with members of most other task teams and working groups.

MTT Wiki

Build Status

How to contribute changes

See the CONTRIBUTING.md documement.

License

See the LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 546 Commits
contrib		contrib
doc		doc
scripts		scripts
src/main/resources/avro		src/main/resources/avro
tests		tests
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Schemas for the Data Working Group

Reads Task Team

Reference Variation Task Team

File Formats Task Team

Metadata Task Team

Build Status

How to contribute changes

License

About

Releases

Packages

Languages

License

nlwashington/schemas

Folders and files

Latest commit

History

Repository files navigation

Schemas for the Data Working Group

Reads Task Team

Reference Variation Task Team

File Formats Task Team

Metadata Task Team

Build Status

How to contribute changes

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages