research-agent

Autonomous technical research agent powered by LangGraph and Ollama. All LLM inference runs locally — no cloud APIs, no API keys, zero paid services.

Given a research question, the agent follows a Plan → Act → Observe → Reflect loop to iteratively gather evidence from the web and local documents, then produces a polished Markdown report with citations and a Mermaid architecture diagram.

Architecture

research_agent/
├── api/          # FastAPI app (JSON + SSE streaming API) + routers
├── cli/          # Typer CLI
├── graph/        # LangGraph state machine (plan/act/observe/reflect/write)
├── llm/          # Ollama HTTP client + LLM adapter
├── tools/        # Web search, URL fetch, Python sandbox, local docs, RAG stub
├── memory/       # SQLite persistence for runs
├── report/       # Markdown + Mermaid report rendering
└── util/         # Logging helpers
frontend/
├── src/
│   ├── App.jsx               # Main app component
│   ├── api.js                # API client (SSE streaming + JSON)
│   └── components/
│       ├── ResearchForm.jsx  # Question form + PDF upload
│       ├── ReportView.jsx    # Markdown + Mermaid report renderer
│       ├── ProgressTracker.jsx # Real-time research progress pipeline
│       ├── MermaidDiagram.jsx  # Mermaid rendering with sanitization
│       └── RunHistory.jsx    # Sidebar for past research runs
├── Dockerfile                # Multi-stage: node build → nginx serve
├── nginx.conf                # Static files + /api/ proxy (SSE-ready) + SPA fallback
├── index.html                # Vite entry point
└── vite.config.js            # Vite config with dev proxy

Agent Loop

graph LR
    Plan --> Act --> Observe --> Reflect
    Reflect -->|needs more evidence| Act
    Reflect -->|confident enough| WriteReport
    WriteReport --> Done

Quickstart

Prerequisites

Docker and Docker Compose
~4 GB free disk space for the Gemma model

1. Start services

cp .env.example .env
docker compose up -d

2. Pull the default model

make pull-model

This runs ollama pull gemma inside the Ollama container. First pull downloads ~5 GB. The model is persisted in a named Docker volume so it survives restarts.

3. Run a research query

CLI:

docker compose run --rm api python -m research_agent.cli.main \
  "What are the best practices for deploying LLMs in production?"

API:

curl -X POST http://localhost:8000/api/research \
  -H "Content-Type: application/json" \
  -d '{"question": "What are the best practices for deploying LLMs in production?"}'

Web UI:

Open http://localhost:3000 in your browser. The UI streams real-time progress updates as the agent works — you'll see each phase (Plan → Act → Observe → Reflect → Write) light up, along with the current iteration, active tool, evidence count, and plan steps.

Configuration

All configuration is via environment variables (or .env file):

Variable	Default	Description
`OLLAMA_HOST`	`http://ollama:11434`	Ollama API endpoint
`OLLAMA_MODEL`	`gemma`	Model to use for inference
`OLLAMA_TIMEOUT_SECONDS`	`120`	Timeout per LLM call
`LOG_LEVEL`	`INFO`	Logging verbosity

Using a different model

# In .env
OLLAMA_MODEL=llama3

# Pull it
docker compose exec ollama ollama pull llama3

CLI Options

research-agent research [OPTIONS] QUESTION

Options:
  --audience TEXT       Target audience: engineer or executive  [default: engineer]
  --depth TEXT          Desired depth of research               [default: thorough]
  --max-iters INTEGER  Maximum research iterations              [default: 6]
  --timebox INTEGER    Timebox in minutes                       [default: 5]
  --raw                Print raw markdown instead of rendered

API Endpoints

Method	Path	Description
`POST`	`/api/research`	Start a research run (JSON or SSE streaming)
`GET`	`/api/runs`	List previous runs
`GET`	`/api/runs/{run_id}`	Get a specific run result
`GET`	`/health`	Health check
`GET`	`/`	API info (JSON)

POST /api/research

Accepts multipart/form-data with fields: question (required), audience, desired_depth, max_iters, timebox_minutes, pdf_file (optional PDF upload).

JSON response (default):

curl -X POST http://localhost:8000/api/research \
  -F "question=What are the best practices for deploying LLMs in production?"

SSE streaming — set Accept: text/event-stream to receive real-time progress events:

curl -X POST http://localhost:8000/api/research \
  -H "Accept: text/event-stream" \
  -F "question=What are the best practices for deploying LLMs in production?"

SSE events: status (phase/iteration/tool/evidence updates after each node), plan (research plan steps), error, complete (final report + metadata).

Docker Services

Service	Port	Description
`ollama`	11434	Ollama model server (GPU-accelerated)
`api`	8000	FastAPI backend (pure JSON API)
`frontend`	3000	React SPA served by nginx

The frontend nginx container proxies /api/ requests to the backend (with proxy_buffering off for SSE support), so all traffic can go through port 3000.

Tools

The agent has access to these tools during research:

Tool	Description
`web_search`	DuckDuckGo search (no API key needed)
`fetch_url`	Fetch and extract content from URLs
`python_sandbox`	Execute Python in a sandboxed subprocess
`local_docs`	Search `./docs` and `./data` directories
`elastic_rag`	Elasticsearch RAG stub (implement to integrate)

Adding a custom tool

Create a class in research_agent/tools/ extending BaseTool
Implement the async run(self, *, query: str, **kwargs) -> ToolResult method
Register it in research_agent/tools/__init__.py TOOL_REGISTRY

Report Output

Every report contains:

Summary — concise overview
Key Findings — bullet list with evidence references
Recommendations — actionable items with tradeoffs
Architecture Diagram — Mermaid diagram
Sources — numbered citations with URLs

See docs/sample-output.md for a full example.

Development

Run tests

make test

Lint / format

make lint
make fmt

Make targets

Target	Description
`make up`	Build and start all services
`make down`	Stop all services
`make logs`	Tail service logs
`make pull-model`	Pull the default Ollama model
`make test`	Run pytest
`make lint`	Run ruff linter
`make fmt`	Run ruff formatter
`make run-example`	Run an example research query

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
docs		docs
frontend		frontend
research_agent		research_agent
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

research-agent

Architecture

Agent Loop

Quickstart

Prerequisites

1. Start services

2. Pull the default model

3. Run a research query

Configuration

Using a different model

CLI Options

API Endpoints

POST /api/research

Docker Services

Tools

Adding a custom tool

Report Output

Development

Run tests

Lint / format

Make targets

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

chrisfauerbach/research-agent

Folders and files

Latest commit

History

Repository files navigation

research-agent

Architecture

Agent Loop

Quickstart

Prerequisites

1. Start services

2. Pull the default model

3. Run a research query

Configuration

Using a different model

CLI Options

API Endpoints

POST /api/research

Docker Services

Tools

Adding a custom tool

Report Output

Development

Run tests

Lint / format

Make targets

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages