-
Notifications
You must be signed in to change notification settings - Fork 1
/
DESCRIPTION
28 lines (28 loc) · 1.24 KB
/
DESCRIPTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
Package: variantspark
Type: Package
Title: A 'Sparklyr' Extension for 'VariantSpark'
Version: 0.1.1
Authors@R: c(
person("Samuel", "Macêdo", email = "samuelmacedo@recife.ifpe.edu.br",
role = c("aut", "cre")),
person("Javier", "Luraschi", email = "javier@rstudio.com", role = "aut")
)
Maintainer: Samuel Macêdo <samuelmacedo@recife.ifpe.edu.br>
Description: This is a 'sparklyr' extension integrating 'VariantSpark' and R. 'VariantSpark' is
a framework based on 'scala' and 'spark' to analyze genome datasets,
see <https://bioinformatics.csiro.au/>. It was tested on datasets with 3000 samples
each one containing 80 million features in either unsupervised clustering approaches
and supervised applications, like classification and regression. The genome datasets
are usually writing in VCF, a specific text file format used
in bioinformatics for storing gene sequence variations. So, 'VariantSpark' is a great
tool for genome research, because it is able to read VCF files, run analyses and return
the output in a 'spark' data frame.
License: Apache License 2.0 | file LICENSE
LazyData: true
Imports:
sparklyr (>= 1.0.1)
RoxygenNote: 6.1.1
Suggests:
testthat
Encoding: UTF-8
Roxygen: list(markdown = TRUE)