Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use the entity2src.tsv to query the corresponding concepts #16

Open
freshnemo opened this issue Jul 10, 2020 · 1 comment
Open

Comments

@freshnemo
Copy link

Hi, thank for sharing this great task. But I want to ask how can I use entity2src to get the corresponding concepts. For example : Gene::100129669 [Hetionet] Biomedical knowledge graph https://het.io/about/ [STRING] https://string-db.org/, does this mean the identifier 100129669 are the same with Hetionet and STRING?

@gurdaspuriya
Copy link
Collaborator

The file entity2src.tsv maps the entities/nodes to the list of data sources they appear in (we use seven different data sources to construct the DRKG). Regarding the IDs, we use the following rules to assign IDs to the entities:
(i) Compound entities are mapped to the Drugbank ID and if not possible to the Chembl ID. If a compound can not be found to either of the two we use the native ID space and we include the name of the source as part of the entity’s name.
(ii) Gene entities are mapped to the Entrez ID.
(iii) Disease entities are mapped to the MESH ID space.
(iv) The remaining biological entities appear only in a single data source and hence we use the data source’s ID.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants