-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
some queries about SViper #22
Comments
HI @prasundutta87, thanks for your interest in SViper!
This means that after polishing the variant is not visible anymore in the data (in the corrected long reads).
This means that for the given variant, no proper region from the long read could be extracted. E.g. although a long read has a desired deletion of 200, the flanking regions of this deletion are mapped very poorly, or the mapping indicates a complex variant. SViper can only polish simple deletions and insertions.
I'll try to change this! But I can't promise that it will be in the next days.
The score is computed here: SViper/include/sviper/evaluate_final_mapping.h Lines 21 to 27 in 3b57a9c
It does not have a biological significance! As far as I remember my own code, it was experimentally derived and proved to work well on manual inspection.
Unfortunately this is very hard to answer and heavily depends on your use case. In general I can say that you should filter out variants with a
Can you filter the SVs before polishing them with SViper? Otherwise I might need to see if I can add an option that does not overwrite the quality scores but adds them in the INFO field. Best, |
Hi @smehringer , Thank you so much to answer my queries. Is there a document anywhere where the algorithm or working on SViper is mentioned anywhere. I am just trying to get my head around the algorithm (not the numerical/quantification bit, but the general concept of polishing). For example, when you say polishing the variant is not visible anymore in the corrected long reads, can this please be elaborated? Thanks for the suggestion to use SViper after my final filtering. Also, do we need to sort the BAMs by name? It seems to be specifically mentioned for utilities_merge_split_alignments tool. Currently I have coordinate sorted my BAMs. This may be trivial, but the master version of SViper is 2.0.0, but the most recent version is 2.1.0. At least it shows in the help that it is 2.0.0. Could you kindly clarify this? Regards, |
I've send you an email with my thesis that hopefully can answer most of your questions.
For sviper, sorting by coordinate is fine (even required).
I'm very sorry for the confusion! This is a documentatio bug. I forgot to change the version in the help page. I'll correct this. |
Hi,
I am working with trios ONT data for which I have Illumina short read data as well. I am using SViper 2.0.0 for polishing my SV breakpoints. I have a few queries and I would be grateful if I could be helped with them.
When I was testing the software with one set of sample, I found that most of the FAILs were FAIL5 ("The variant was polished away."). What is the reason behind this fail?
What is meant by FAIL3 (The long read regions do not fit)? Can this please be elaborated?
I am aware that no tags should be present. SViper skips the variants. With bcftools, I am getting the error that SKIP is not defined. If you are still developing the tool, can that please be added? Although I have changed the SVTYPE to INS, SViper checks the variant type by tags rather than SVTYPE.
I observed that the the SViper score is put on the QUAL field. How is the score calculated and does it have any biological significance? Should I filter my SVs based on SViper score again? Is there a threshold based on which I should remove SVs? I actually filter my SVs using QUAL value of the variant caller (cuteSV). So, replacing this value with the score effects my pipeline.
Regards,
Prasun
The text was updated successfully, but these errors were encountered: