Skip to content

Latest commit

 

History

History
13 lines (8 loc) · 861 Bytes

README.md

File metadata and controls

13 lines (8 loc) · 861 Bytes

va-spark

va-spark is a scalable and high performance toolkit for the analysis, annotation, and prioritization of genomic variants.

Introduction

va-spark was created by the software development team at Vinbigdata's Biomedical Information center, which leverages spark parallelism to speed up data processing times of genomic annotate tools like vep, annovar, snpeff, etc. With a simple architecture, making the integration of tools like vep, annovar, snpeff with spark easy and effective, the results of the integration is remarkable. The architecture of VEP is shown in the following figure:

va-spark integration flow

Table of contents