Skip to content

Extremely high proportion of BND? #72

@Asadi7163

Description

@Asadi7163

I am running Severus on ONT long-read tumor–normal WGS data from a mm39 tumor model. The pipeline completes successfully and produces VCFs, but I am seeing that the overwhelming majority of somatic SVs are labeled as BND rather than DEL/DUP/INV/INS. Specifically, in my last two runs (1 without a VNTR-BED and 1 with) I saw that 99% of the SV's are labeled as BND.

Both tumor and normal BAMs are coordinate-sorted and indexed, aligned with minimap2, and I provided the mm39 TRF VNTR-BED to my second run. The data are not haplotype-tagged.

My questions:
1. Is this expected behavior for cancer genomes and/or ONT data?
2. Does this indicate that the SV graph is too complex to resolve simple events?
3. Are there recommended filters or post-processing steps to convert BNDs into more interpretable SVs?
4. Would adding phasing (e.g., WhatsHap) or increasing coverage change BND resolution?

I would appreciate any guidance on how to interpret or reduce the BND calls. Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions