Skip to content

Background

Robert J. Gifford edited this page Oct 13, 2024 · 56 revisions

Contents


Overview


Parvovirus-GLUE is a specialized resource designed to facilitate comparative analysis of parvovirus genomes, built within the GLUE software framework.

Parvoviruses are a diverse group of small, non-enveloped DNA viruses that infect a wide variety of animal species, including many that are of significant medical and veterinary importance. The parvovirus family encompasses numerous pathogens affecting humans and domesticated animals, causing diseases such as canine parvoviral enteritis and parvovirus B19.

Beyond their role as pathogens, parvoviruses are also next-generation therapeutic tools. One group - the adeno-associated viruses (AAVs) - have been successfully developed as gene therapy vectors and are at the forefront of advancements in human gene therapy. In addition, rodent protoparvoviruses demonstrate natural oncotropism and oncolytic properties, making them promising candidates for cancer treatment.

Comparative genomics plays a crucial role in parvovirus research by enabling scientists to identify genetic variations across different parvovirus strains and species. This approach helps in understanding viral evolution, host specificity, and pathogenicity. By comparing the genomes of parvoviruses, researchers can uncover key genetic features that influence virulence, immune evasion, and tissue tropism—insights that are invaluable both for understanding disease mechanisms and for optimizing the design of gene therapy vectors or oncolytic agents.

The GLUE software framework provides an extensible platform for implementing comparative genomic analysis of viruses in an efficient, standardised and reproducible way. GLUE projects not only integrate key data items such as sequences, alignments, and genome feature annotations, but also establish complex semantic links between these data items using a relational database. This structure prepares sequences and their associated data for computational analysis, minimizing the need for labor-intensive pre-processing and enabling streamlined workflows.

Advances in Parvovirus Discovery


In recent years, high-throughput sequencing and new metagenomic methods have led to the discovery of numerous novel parvovirus species. Furthermore, progress in whole genome sequencing (WGS) has revealed that DNA sequences derived from parvoviruses commonly occur in animal genomes as endogenous viral elements.

Parvovirus EVEs Parvovirus EVEs

Some of the species in which novel parvoviruses and endogenous parvoviral elements (EPVs) have been identified.
Top row, left to right: Masai giraffe (Giraffa camelopardalis tippelskirchii), Tasmanian devil (Sarcophilus harrisii), elephants (Elephantidae), chinchilla (Chinchilla lanigera).
Bottom row, left to right: Northern fur seals (Callorhinus ursinus), pit vipers (Crotalinae), Leadbetter's possum (Gymnobelideus leadbeateri), Transcaucasian mole vole (Ellobius lutescens).

The abundance of sequence data from a diverse array of parvovirus species offers unparalleled opportunities to employ comparative approaches for exploring parvovirus biology.

Comparative Genomics and Parvovirus Research


Comparative genomics is a powerful approach for understanding the diversity, evolution, and functional biology of viruses. By comparing parvovirus genomes, researchers can gain insights into evolutionary history, host-virus interactions, and mechanisms of pathogenicity.

  1. Evolution & Phylogeny: Genome comparisons allow reconstruction of virus evolutionary history, and thereby facilitate understanding of parvovirus distribution and diversity.

  2. Host Range & Adaptation: Comparisons can identify viral traits for host specificity and adaptations that allow parvoviruses to infect new species.

  3. Tropism & Pathogenesis: By comparing virus strains, researchers can uncover genetic factors determining tissue specificity and pathogenicity.

  4. Immune Evasion: Identification of mutation hotspots and antigenic variation can inform vaccine development and understanding of viral evolution under immune pressure.

  5. Functional Genomics: Genomic comparisons can highlight functional elements and protein structure-function relationships critical for viral replication.

  6. Genomic Conservation: Conserved regions important for viral function are identified for drug or vaccine targeting, while variable regions help understand viral adaptation.

  7. Vaccine & Therapeutic Development: Identifying conserved elements aids in developing broad-spectrum vaccines, while optimizing gene therapy vectors (e.g., AAVs) through comparative analysis.

  8. Epidemiology: Genomic comparisons help track viral spread, detect emerging strains, and guide outbreak responses and vaccine updates.

  9. Paleovirology: EVEs provide a historical record of past infections. Comparisons of EVEs and virus genomes can reveal ancient viral genomes, revealing long-term viral evolution.

Parvovirus-GLUE and Comparative Genomics


Parvovirus-GLUE is designed to support large-scale comparative genomic analyses by providing standardized datasets and flexible analysis tools. These tools enable researchers to investigate parvovirus diversity, evolution, and biology across various species and strains. Key features include:

  1. Phylogenetic Analysis: Integration with RAXML allows users to construct phylogenetic trees from genomic data. Pre-configured alignments and reference sequences enable precise control over how alignment partitions are composed, while complex phylogenetic workflows can be streamlined using GLUE command files, making advanced analyses more accessible.

  2. Host Range, Tropism & Pathogenesis: Researchers can investigate host specificity, tissue tropism, and pathogenicity using standardized reference genomes. Parvovirus-GLUE allows comparative analysis across multiple isolates or strains, making it easier to identify genetic factors that drive these phenotypes.

  3. Immune Evasion & Vaccine Development: Users can explore mutation hotspots and conserved regions to study antigenic variation and immune evasion strategies. These insights are critical for designing vaccines or therapeutics targeting essential viral regions and for monitoring viral evolution in response to immune pressure.

  4. Genetic Variation Exploration: Parvovirus-GLUE offers a standardized yet flexible system for defining genome feature coordinates, enabling researchers to explore genetic variation at a high resolution. Commands for querying amino acid frequencies can be used in combination with genome feature coordinates and the Parvovirus-GLUE data model, allowing for stratified analyses using 'where' clauses to filter and organize sequence data.

  5. Functional Genomics: Parvovirus-GLUE provides data and tools for exploring parvovirus genetic variation in relation to linked information (evolutionary history, protein structure and function, host immune response). It thus provides a platform for identify and analysing functional elements, and shedding light on how specific mutations influence viral replication and pathogenicity.

  6. Variation Scanning: Parvovirus-GLUE provides a mechanism for defining sequence variations using regular expressions (REGEX) and scanning large datasets for the presence of these variations. This functionality is particularly useful for identifying specific mutations across consensus or reference sequences, as well as for analyzing deep sequencing data.

  7. Epidemiology & Outbreak Tracking: Parvovirus-GLUE supports the real-time tracking of viral spread and the emergence of new strains by comparing outbreak genomes with reference data. These comparisons can inform epidemiological studies, help manage outbreaks, and guide the development of updated vaccines.

  8. Customizable Analysis Pipelines: Researchers can customize datasets and analysis workflows for specific parvovirus genera or species. GLUE allows the scripting of multi-step processes such as alignment, feature annotation, and phylogeny construction, ensuring reproducibility and streamlining future analyses.

By offering these features, Parvovirus-GLUE provides researchers with the tools needed to address a wide range of comparative genomics questions, from evolutionary studies to epidemiological tracking, all within a unified and flexible framework.

Supporting Open Science and Collaboration


Parvovirus-GLUE addresses the challenges of low reproducibility and re-use in genomic research by providing an open, extensible resource for parvovirus and endogenous parvovirus (EPV) data. This resource is built upon a relational database framework, which preserves the semantic relationships between various data items, including:

  • EPV and virus genome sequences
  • Genome annotations
  • Multiple sequence alignments
  • Phylogenetic trees
  • Sequence-associated tabular data

This structured approach ensures that all data elements remain interconnected and easily accessible, enhancing the reproducibility of analyses conducted using Parvovirus-GLUE.

By hosting Parvovirus-GLUE on an online version control system (GitHub), we create a platform for collaborative efforts in mapping the diversity of parvoviruses and EPVs. This setup also facilitates broader utilization of evolution-related knowledge within the research community, promoting knowledge sharing and advancing parvovirus research.


Clone this wiki locally