-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Connecting GWAS database #29
Comments
For clarification, there are two main studies we've conducted regarding edge prediction on hetnets. From https://het.io/about/#cite: Hetionet v1.0 was created as part of Project Rephetio, i.e. Systematic integration of biomedical knowledge prioritizes drugs for repurposing. So you can mostly ignore "Heterogeneous network link prediction prioritizes disease-associated genes" and just focus on Hetionet v1.0.
GWAS data does make it into Hetionet. Copying from the methods:
The most important supplemental discussion on how we processed the GWAS catalog is at https://doi.org/10.15363/thinklab.d80.
Cool. One place to start would be putting a symptom into https://het.io/search/ and then subsetting to genes for the target node: This is a more manual exploratory approach. There are also more automated high-throughput approaches you could do with some scripting / programming. |
Thank you very much Daniel. I would like to learn more about automated high-throughput approaches for querying the database. Can you guide me to it? Also, in the search, I added breast cancer in source node, but the target nodes do not show up common predisposition genes like brca1 and brca2 etc. Why is that? Best, Nilesh |
The approach does find many types of paths that occur more than expected by chance between breast cancer and BRCA1 and BRCA2. So I think the question is more why don't BRCA1 and BRCA2 show up as the top result for breast cancer: One reason is that the metric we're ranking by is simplistic. The search result ranking is by number of significant types of paths. It does not take into account how significant those types of paths are. That being said, the top result of MYC has 19 significant types of paths (metapaths), while BRCA1 has 12... so it's not that far from the top. Will make another comment to address the rest of your questions. |
Hi Daniel,
My name is Nilesh Dharajiya, MD, and I am a molecular pathologist by training. I came across het.io recently and am very impressed by it. I have been playing with Neo4j since last 3 years and like the prospect of graph db in medical science. I saw that Hetionet combines data from 29 databases, which does not include GWAS database.
However, your abstract "Heterogeneous network link prediction prioritizes disease-associated genes” shows 698 associations extracted from GWAS catalog.
Have you every tried margin gwas data into the het.io graph db? I am specially looking for detecting relation ships between clinical symptoms to disease to genes and using calculating polygenic risk scores from gwas data so that with a symptoms, I can go all the way to genes involved in the pathogenesis. Is this possible? Do you know anyone who has done this?
Looking forward to hearing from you.
Best regards,
Nilesh
The text was updated successfully, but these errors were encountered: