This repository contains the scripts and data necessary to reproduce our benchmarking experiment results. Please note that you may need to modify some paths to reproduce the results on your machine.
-
Euka Experiment:
- Script:
./run_euka.sh
- Description: Runs the euka experiment.
- Script:
-
HaploCart Experiment:
- Script:
./run_hc.sh
- Description: Runs the HaploCart experiment.
- Script:
-
Index Creation:
- Script:
./index.sh
- Description: Creates the minimizer and RYmer index files on disk.
- Script:
-
Evaluate:
- Script:
evaluate.py
- Description: Computes summary statistics from the
alignment_stats.csv
file.
- Script:
-
Compare Damage Estimate:
- Script:
compare_damage_est.py
- Description: Performs a cross-comparison of damage rate estimates from bam2prof.
- Script:
-
Interleave FASTQ:
- Script:
interleave_fastq.sh
- Description: Interleaves FASTQ files.
- Script:
-
Populate MD Field:
- Script:
make_MD.sh
- Description: Populates the MD field in the BAM files based on the reference, as required by bam2prof.
- Script:
-
Create Damage Profiles:
- Script:
make_profs.sh
- Description: Runs bam2prof to create the damage profiles (i.e., estimated nucleotide misincorporation matrices).
- Script:
-
Generate simulations for linear aligner benchmarking experiment:
- Script:
Snakefile
- Description: Generates simulations
- Script:
-
Compute statistics from BAM file
- Script:
parseBamMito.py
- Description: Compute statistics from BAM file
- Script:
- Estimate parameters for spurious alignment model
- Script:
main.py
- Description: Estimates parameters of the power law distribution for our spurious alignment model using a RYmer and minimizer index.
- Script: