Skip to content

SV end points for "OG" variants #54

@MaximilianStammnitz

Description

@MaximilianStammnitz

Hi @hannespetur,

Thanks again for fixing #43!

Browsing through my first VCFs, I found five variant classes ("VarType"): OG (very long/interchromosomal), UG (duplications), DG (deletions), FG (insertions), XG (complex indels?). For OG-type variants, I consistently obtain REF allele counts only (i.e. ABHet=-1;ABHom=1). This was very unexpected because some of these calls are definitely true heterozygotes.

So I dipped into the VCF's INFO field a bit more, and realise that for OGs the specified SV "END" point is actually the same as the "POS" identifier in column 2. This is not the case for the other two potential long-range VarTypes UG and DG. If this "END" point is indicative of GraphTyper's internal use, then to me it feels like the script might currently "go circular" for OGs, checking starting point <> starting point combinations which therefore nearly always render 100 % REF.

Could you please have a look at this? Happy to test any changes to this again.

Best,
Max

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions