Skip to content

Study basecall frequencies in VCF files from bacterial genome mapping

Notifications You must be signed in to change notification settings

davidhwyllie/BUGMIX

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 

Repository files navigation

BUGMIX

Allows storage and fast random access to read depths in VCF files derived from bacterial reference mapping.

BugMix includes four main classes:

  • vcfStore- parses VCF files, computing minor allele frequencies and storing them in on-disc in indexed sqlite databases, one per vcf file.

  • mixtureReader - extracts information from one vcfStore'd item. Various methods are available extracting per-base, per-gene, or per arbitrary position set etc.

  • multiMixtureReader - extracts information from multiple vcfStore'd items.

  • summaryStore - a helper class used by the mixtureReader and multiMixtureReader classes to persist summary information generated in an RDBMS. - this ensures that computations, which may be expensive, only occur once. - the data is accessible from the RDBMS level but the persistence is transparent if using the mixture & multiMixtureReader classes.

Use cases addressed with this component concern

  • the identification of mixed samples of TB.
  • identification of regions where mis-mapping may have occurred

About

Study basecall frequencies in VCF files from bacterial genome mapping

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published