This project enables automated scientific research using Large Language Models (LLMs) and search APIs. I've deliberately kept this readme simple. More discussion on this topic can be found in the linked Medium community science post.
git clone https://github.com/jd-coderepos/deep-research.git
cd deep-research# Windows
python -m venv venv
venv\Scripts\activate
# macOS/Linux
python3 -m venv venv
source venv/bin/activatepip install -r requirements.txtCreate a .env file in the project root with the following content:
OPENAI_API_KEY=your-openai-api-key
FIRECRAWL_API_KEY=your-firecrawl-api-key
⚠️ Do not commit .env to version control.
To start the research process:
python src/main.pydeep-research/
├── src/              # Main source code
├── scripts/          # Research evaluation pipeline
├── data/             # Research data and reports
├── requirements.txt  # Python dependencies
└── README.md         # Setup instructionsAfter generating research reports, you can evaluate their quality using the qualitative analysis pipeline:
cd scripts
python qualitative_analysis_pipeline.pyThe pipeline provides comprehensive quality assessment across different depth-breadth configurations, generating:
- Multi-dimensional quality metrics
- Publication-ready visualizations
- Comparative analysis reports
- Statistical evaluation of research effectiveness
For detailed documentation on the evaluation pipeline, see scripts/README.md.
If you use this repository or its ideas in your research, please cite the following paper:
@misc{dsouza2025deepresearchtextecorecursiveagenticworkflow,
  title={DeepResearch$^{\text{Eco}}$: A Recursive Agentic Workflow for Complex Scientific Question Answering in Ecology}, 
  author={Jennifer D'Souza and Endres Keno Sander and Andrei Aioanei},
  year={2025},
  eprint={2507.10522},
  archivePrefix={arXiv},
  primaryClass={cs.AI},
  url={https://arxiv.org/abs/2507.10522}
}This project is a Python reimplementation of the original deep-research repository by David Zhang, developed in TypeScript. Credit goes to the original author for the concept and design of the deep-research workflow.
