A viral identification tool using machine learning with nucleotide and protein features
Report Bug
·
Request Feature
Table of Contents
This project was created to identify viral contigs in metagenomics. Add details about project and images
To get a local copy up and running follow these simple example steps.
VirBrant requires Python 3 and the following libraries (if installling through pip, libraries are automatically install)
- pandas
- scikit-learn
- biopython
VirBrant can only be forked from this repository. Once forked, enter into the files and compile the kmer counting program.
## git download here
cd VirBrant/scripts
tar xvf kmer-counter-master.zip
cd kmer-counter-master
make
VirBrant works as a python script. Once install via pip, VirBrant the command can be accessed. To get the help screen type:
VirBrant -h
The paramters of VirBrant are:
- -i: Input Fasta [required]
- -o: Output Directory [optional]
cd VirBrant
scripts/VirBrant.py -i data/Test/Viral_contigs.fasta -o data/Test/Output
Current Version: 0.0.1
Improvements to be made:
- Reduce feature space to allow for smaller file processing
- Grid search hyper parameters for models
- Build into python package
See the open issues for a list of proposed features (and known issues).
Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.
Distributed under the MIT License. See LICENSE
for more information.
Cody Glickman - @glickman_Cody - glickman.cody@gmail.com
Project Link: https://github.com/Strong-Lab/VirBrant
- James Costello
- Michael Strong
- Jo Hendrix