You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Mehari is a software package for annotating VCF files with variant effect/consequence.
13
13
The program uses [hgvs-rs](https://crates.io/crates/hgvs) for projecting genomic variants to transcripts and proteins and thus has high prediction quality.
@@ -17,201 +17,22 @@ Other popular tools offering variant effect/consequence prediction include:
Mehari offers predictions that aim to mirror VariantValidator, the gold standard for HGVS variant descriptions.
20
+
Mehari offers HGVS predictions that aim to mirror VariantValidator, the gold standard for HGVS variant descriptions, and consequence predictions compatible with VEP.
21
21
Further, it is written in the Rust programming language and can be used as a library for users' Rust software.
22
22
23
-
## Supported Sequence Variant Frequency Databases
24
-
25
-
Mehari can import public sequence variant frequency databases.
26
-
The supported set slightly differs between import for GRCh37 and GRCh38.
To reduce compile times, we recommend using a pre-built version of `rocksdb`, either from the system package manager or e.g. via `conda`:
44
-
45
-
```bash
46
-
# Ubuntu
47
-
sudo apt-get install librocksdb-dev
48
-
49
-
# Conda
50
-
conda install -c conda-forge rocksdb
51
-
```
52
-
53
-
In either case, either add
54
-
```toml
55
-
[env]
56
-
ROCKSDB_LIB_DIR = "/usr/lib/"# in case of the system package manager, adjust the path accordingly for conda
57
-
SNAPPY_LIB_DIR = "/usr/lib/"# same as above
58
-
```
59
-
to `.cargo/config.toml` or set the environment variables `ROCKSDB_LIB_DIR` and `SNAPPY_LIB_DIR` to the appropriate paths:
60
-
61
-
```bash
62
-
export ROCKSDB_LIB_DIR=/usr/lib/
63
-
export SNAPPY_LIB_DIR=/usr/lib/
64
-
```
65
-
66
-
By default, the environment variables are defined in the `.cargo/config.toml` as described above, i.e. may need adjustments if not using the system package manager.
See [Getting Started](docs/getting_started.md) for more information on usage, and [Development Setup](docs/development.md) for more information on how to build mehari and its databases from scratch.
Copy file name to clipboardExpand all lines: docs/anno_seqvars.md
+5-4Lines changed: 5 additions & 4 deletions
Original file line number
Diff line number
Diff line change
@@ -22,16 +22,17 @@ Currently, Mehari will annotate variants using:
22
22
23
23
- The predicted impact on gene transcripts and the corresponding protein sequence (in the case of coding genes).
24
24
- Their frequency in the gnomAD exomes and genomes databases as well as the HelixMtDb database in the case of mitochondrial databases.
25
+
- Variant information from ClinVar, if any
25
26
26
27
## Command Line Invocation
27
28
28
-
You can invoke Mehari like this to annotate a VCF file `IN.vcf`to an output file `OUT.vcf` using the built (or downloaded) database as `path/to/db`.
29
+
You can invoke Mehari to annotate a VCF file `IN.vcf`creating an output file `OUT.vcf` using the built (or downloaded) databases – for example the transcript database – as follows:
29
30
30
31
```text
31
32
$ mehari annotate seqvars \
32
-
--path-db path/to/db \
33
-
--input-vcf IN.vcf \
34
-
--output-vcf OUT.vcf
33
+
--transcripts path/to/transcripts-db \
34
+
--path-input-vcf IN.vcf \
35
+
--path-output-vcf OUT.vcf
35
36
```
36
37
37
38
Note that the input and output files can optionally be gzip/bgzip compressed VCF files with suffixes (`.gz` or `.bgz`) or BCF files with suffix `.bcf`.
To reduce compile times, we recommend using a pre-built version of `rocksdb`, either from the system package manager or e.g. via `conda`:
3
+
4
+
```bash
5
+
# Ubuntu
6
+
sudo apt-get install librocksdb-dev
7
+
8
+
# Conda
9
+
conda install -c conda-forge rocksdb
10
+
```
11
+
12
+
In either case, either add
13
+
```toml
14
+
[env]
15
+
ROCKSDB_LIB_DIR = "/usr/lib/"# in case of the system package manager, adjust the path accordingly for conda
16
+
SNAPPY_LIB_DIR = "/usr/lib/"# same as above
17
+
```
18
+
to `.cargo/config.toml` or set the environment variables `ROCKSDB_LIB_DIR` and `SNAPPY_LIB_DIR` to the appropriate paths:
19
+
20
+
```bash
21
+
export ROCKSDB_LIB_DIR=/usr/lib/
22
+
export SNAPPY_LIB_DIR=/usr/lib/
23
+
```
24
+
25
+
By default, the environment variables are defined in the `.cargo/config.toml` as described above, i.e. may need adjustments if not using the system package manager.
Copy file name to clipboardExpand all lines: docs/index.md
+3-2Lines changed: 3 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -10,7 +10,8 @@ Why another software package?
10
10
library.
11
11
The latter serves as the basis for [VariantValidator.org](https://variantvalidator.org/) which is the gold standard for HGVS variant description generation and validation.
12
12
- Mehari is written in the Rust programming language which allows it to work fast, with low memory consumption (as a C++ program would) and being memory safe at the same time (as a Java/Python/Perl program would).
13
-
- As a Rust program, it can be embedded into the backend of the [VarFish](https://github.com/varfish-org/varfish-server) variant analysis platform.
13
+
- It can be used as a rust library, as is the case for e.g. the backend of the [VarFish](https://github.com/varfish-org/varfish-server) variant analysis platform
14
+
- Provides a REST API for sequence variant annotation (see `mehari server run --help`)
14
15
15
16
## What's Next?
16
17
@@ -27,4 +28,4 @@ We recommend to read the Mehari end-user documentation in the following order:
27
28
Since Mehari is written in the Rust programming language, we host the documentation on `docs.rs` written as Rust online documentation.
28
29
This has the advantage that the documentation is bundle with the program source code (and thus always up to date) and the latest documentation is always available at <https://docs.rs/mehari>.
29
30
30
-
The drawback is that the formatting of this may not be as end-user friendly as it could be but you will manage.
31
+
The drawback is that the formatting of this may not be as end-user friendly as it could be, but you will manage.
0 commit comments