-
Notifications
You must be signed in to change notification settings - Fork 21
Description
Hi @hannespetur,
Thanks again for fixing #43!
Browsing through my first VCFs, I found five variant classes ("VarType"): OG (very long/interchromosomal), UG (duplications), DG (deletions), FG (insertions), XG (complex indels?). For OG-type variants, I consistently obtain REF allele counts only (i.e. ABHet=-1;ABHom=1). This was very unexpected because some of these calls are definitely true heterozygotes.
So I dipped into the VCF's INFO field a bit more, and realise that for OGs the specified SV "END" point is actually the same as the "POS" identifier in column 2. This is not the case for the other two potential long-range VarTypes UG and DG. If this "END" point is indicative of GraphTyper's internal use, then to me it feels like the script might currently "go circular" for OGs, checking starting point <> starting point combinations which therefore nearly always render 100 % REF.
Could you please have a look at this? Happy to test any changes to this again.
Best,
Max