Skip to content

Latest commit

 

History

History
80 lines (41 loc) · 4.54 KB

README.md

File metadata and controls

80 lines (41 loc) · 4.54 KB

Parvovirus-GLUE

Welcome to Parvovirus-GLUE, a resource for comparative genomic analysis of parvoviruses, developed using the GLUE software framework.

Parvoviruses (family Parvoviridae) are a diverse group of small, non-enveloped DNA viruses that infect a broad range of animal species. This family includes several pathogens of humans and domesticated animals, as well as viruses being developed for therapeutic uses.

GLUE is an open, integrated software toolkit designed for storing and interpreting sequence data. It supports the creation of bespoke projects, incorporating essential data items for comparative genomic analysis, such as sequences, multiple sequence alignments, genome feature annotations, and other associated data.

Projects are loaded into the GLUE "engine," forming a relational database that represents the semantic relationships between data items. This foundation supports systematic comparative analyses and the development of sequence-based resources.

Parvovirus-GLUE contains genome feature definitions, multiple sequence alignments, and annotated reference sequences for all parvovirus species included in the most recent ICTV report.

This Parvovirus-GLUE project can be extended with additional layers, openly available via GitHub, including Parvovirus-GLUE-EVE, which extends Parvovirus-GLUE through the incorporation of endogenous parvoviral elements.

Please see the User Guide for more details.

Table of Contents

Key Features

  • GLUE Framework Integration: Built on the GLUE software framework, Parvovirus-GLUE offers an extensible platform for efficient, standardized, and reproducible computational genomic analysis of parvoviruses.

  • Phylogenetic Structure: Sequence data in Parvovirus-GLUE is organized in a phylogenetically-structured manner, allowing users to explore evolutionary relationships easily.

  • Rich Annotations: Annotated reference sequences enable rigorous comparative genomic analysis related to conservation, adaptation, structural context, and genotype-to-phenotype associations.

  • Reproducibility: Ensures fully reproducible analyses through data standards and a relational database.

  • Reusable Data Objects: High-value data items such as multiple sequence alignments are prepared once and reused.

  • Validation: Enforces high data integrity through cross-validation.

  • Standardisation of Genomic Coordinates: All sequences use the coordinate space of a chosen reference sequence.

  • Predefined Reference Sequences: Includes fully-annotated reference sequences for parvovirus species.

  • Alignment Trees: Links alignments constructed at distinct taxonomic levels, maintaining a standardised coordinate system.

Installation

Please see the User Guide for installation instructions.

Usage

GLUE contains an interactive command line environment focused on the development and use of GLUE projects by bioinformaticians. This provides a range of productivity-oriented features such as automatic command completion, command history and interactive paging through tabular data.

For detailed instructions on how to use Parvovirus-GLUE for your comparative genomic analysis, refer to the GLUE's reference documentation.

Data Sources

Parvovirus-GLUE is constructed using data published in the NCBI Nucleotide database.

Contributing

We welcome contributions from the community! If you're interested in contributing to Parvovirus-GLUE, please review our Contribution Guidelines.

Contributor Covenant

License

The project is licensed under the GNU Affero General Public License v. 3.0

Contact

For questions, issues, or feedback, please open an issue on the GitHub repository.