GenoPipe: identifying the genotype of origin within (epi)genomic datasets

Olivia W Lang, Divyanshi Srivastava, B Franklin Pugh, William K M Lai

Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA.

Department of Biochemistry & Molecular Biology, Pennsylvania State University, University Park, PA, 16801, USA.

Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA.

Cornell Institute of Biotechnology, Cornell University, Ithaca, NY 14850, USA.

🏠 GenoPipe Website Homepage 🏠

PMID : 37933851

Overview

Confidence in experimental results is critical for discovery. As the scale of data generation in genomics has grown exponentially, experimental error has likely kept pace despite the best efforts of many laboratories. Technical mistakes can and do occur at nearly every stage of a genomics assay (i.e., cell line contamination, reagent swapping, tube mislabelling, etc.) and are often difficult to identify post-execution. However, the DNA sequenced in genomic experiments contains certain markers (e.g., indels) encoded within and can often be ascertained forensically from experimental datasets. We developed the Genotype validation Pipeline (GenoPipe), a suite of heuristic tools that operate together directly on raw and aligned sequencing data from individual high-throughput sequencing experiments to characterize the underlying genome of the source material. We demonstrate how GenoPipe validates and rescues erroneously annotated experiments by identifying unique markers inherent to an organism’s genome (i.e., epitope insertions, gene deletions, and SNPs).

Name		Name	Last commit message	Last commit date
Latest commit History 299 Commits
DeletionID		DeletionID
EpitopeID		EpitopeID
StrainID		StrainID
paper		paper
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenoPipe: identifying the genotype of origin within (epi)genomic datasets

Olivia W Lang, Divyanshi Srivastava, B Franklin Pugh, William K M Lai

🏠 GenoPipe Website Homepage 🏠

PMID : 37933851

Overview

About

Releases 2

Packages

Contributors 4

Languages

License

CEGRcode/GenoPipe

Folders and files

Latest commit

History

Repository files navigation

GenoPipe: identifying the genotype of origin within (epi)genomic datasets

Olivia W Lang, Divyanshi Srivastava, B Franklin Pugh, William K M Lai

🏠 GenoPipe Website Homepage 🏠

PMID : 37933851

Overview

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 4

Languages

Packages