Comparative Enzyme-Disease Network Analysis from PubMed Literature

A Graph-Based Approach to Biomedical Insight

This project builds and visualizes enzyme-centric knowledge graphs by mining biomedical literature from PubMed. It focuses on comparing how different enzymes—defined by their Enzyme Commission (EC) numbers—are associated with specific diseases such as Cancer and SARS-CoV-2.

By mapping co-mentions of enzymes across thousands of research articles, this pipeline enables a data-driven exploration of enzyme-disease associations within the scientific literature.

🚀 Project Overview

This pipeline includes the following steps:

PMID Extraction
Collect disease-specific publication IDs from PubMed using automated search or manual input.
Metadata Retrieval
Fetch complete MEDLINE records for each publication using the NCBI Entrez API.
Enzyme Detection
Extract EC-numbered enzyme mentions from the RN (Registry Number) field.
Graph Construction
Build a bipartite network where:
- Nodes represent PMIDs and enzymes
- Edges represent co-mentions in a publication
Interactive Visualization
Render the graph using PyVis for interactive exploration, with enzyme classes visually grouped.
Graph Export
Export the network in both HTML (browser-based view) and GraphML (for Cytoscape/Gephi) formats.

🧪 Use Cases

Identify and compare enzyme involvement across different diseases
Discover potential biomarkers or shared therapeutic targets
Track research trends or gaps in enzyme-related disease studies
Generate hypotheses from high-level literature association maps

📈 Example Output

Visualizations are enriched by enzyme class and publication links.

Enzyme nodes are colored by diseases (e.g., Covid-19 and Cancer).
Interactive output allows zoom, search, and node inspection.

🧠 What's Next?

This foundation can be extended with:

Edge weighting based on enzyme frequency or co-occurrence strength
Semantic enrichment from MeSH terms or abstracts
Temporal tracking of enzyme-disease focus over time
Disease-disease comparison graphs based on enzyme overlap

Feel free to fork the repository or raise issues for improvements. For citation or integration into a pipeline, please contact the author.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Automate_common_enzymes_among_two_diseases.ipynb		Automate_common_enzymes_among_two_diseases.ipynb
Common-Enzymes-Detail.drawio.png		Common-Enzymes-Detail.drawio.png
LICENSE.md		LICENSE.md
P01_Enzymes_Network_Graph_Analysis.ipynb		P01_Enzymes_Network_Graph_Analysis.ipynb
P02_Enzymes_Network_Graph_Analysis.ipynb		P02_Enzymes_Network_Graph_Analysis.ipynb
README.md		README.md
Screenshot 2022-03-11 165751.png		Screenshot 2022-03-11 165751.png
Screenshot 2022-03-11 165902.png		Screenshot 2022-03-11 165902.png
manual_common_enzymes_among_two_diseases.ipynb		manual_common_enzymes_among_two_diseases.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Comparative Enzyme-Disease Network Analysis from PubMed Literature

A Graph-Based Approach to Biomedical Insight

🚀 Project Overview

🧪 Use Cases

📈 Example Output

🧠 What's Next?

About

Uh oh!

Releases

Packages

Languages

License

akshayonly/Viz-Common-Enzymes

Folders and files

Latest commit

History

Repository files navigation

Comparative Enzyme-Disease Network Analysis from PubMed Literature

A Graph-Based Approach to Biomedical Insight

🚀 Project Overview

🧪 Use Cases

📈 Example Output

🧠 What's Next?

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages