Feature/retriever query builder and ollama provider #18

OscarArroyoVega · 2025-11-07T20:40:29Z

feat: Add local LLM support with Ollama and rebuild query for production schema

Added Ollama as a local LLM provider option alongside OpenAI, Anthropic, and
Gemini, enabling development without API costs and improving response times
for local deployments.

Rebuilt the query builder to match the current production database schema with
complex joins across events, venues, and artists tables. The new query uses CTEs
to unnest artist relationships and aggregates artist data as both comma-separated
strings and JSON objects for flexible frontend rendering.

Why these changes:

Local LLM support reduces API costs during development and testing
The previous query structure didn't match production schema with venues/artists tables
Needed rich event data, including venue details and multiple artist relationships
Performance monitoring was insufficient to identify bottlenecks

Implementation details:

Added Ollama provider with timeout configuration in llm_factory()
Built complex SQL with CTE for artist unnesting and LEFT JOINs
Implemented comprehensive timing logs across query building, DB execution, and LLM calls
Modified API schema to accept date lists instead of date_range objects

Known limitations:

Date filtering generates N OR conditions (one per date in range) - inefficient for large ranges
Query and schema should be reviewed by data engineer for optimization opportunities
Current implementation blocks on date list generation in frontend

Performance metrics (214-day range, 2 events):

Query building: 18ms
DB execution: 137ms
LLM call: 1031ms (Gemini)
Total: 1.2s backend (plus ~60s cold start on first request) ATENTION TO THIS!

Next steps for improvement:

Add LLM pre-processing layer to extract filter fields from natural language
Implement conversational context/memory for multi-turn chat
Replace date list with simple date range in SQL (start_date/end_date)
Add voice-to-text input capability
Implement NLP sentiment analysis on responses
Create feedback collection system for negative interactions (RLHF dataset)
Add streaming responses for better UX during LLM processing
Optimize frontend loading with spinners and timeout handling

Refs: #17

…ng -infinity)

Added local LLM support via Ollama and rebuilt query builder to match production DB schema with events/venues/artists joins. Includes timing instrumentation for performance monitoring. Known issue: Date filtering uses N OR conditions - needs optimization. Schema and query require data engineer review. Next: Add filter extraction LLM, conversational memory, voice input, sentiment analysis, and RLHF feedback collection system.

…k of use

OscarArroyoVega added 6 commits November 6, 2025 11:33

docs: makefile shortcuts to run services (they are mounted but sleepi…

037a4bd

…ng -infinity)

refactor: add some changes in frontend UI

9a5a617

refactor: various small changes to readme.md makefile and parser.py

327c557

refactor: remove pgAdmin feature and container for simplicity and lac…

99cdb92

…k of use

feat: changes to frontend aesthetics, UX and brand alingment

d333ab4

OscarArroyoVega self-assigned this Nov 7, 2025

OscarArroyoVega merged commit ec4026b into main Nov 7, 2025
2 checks passed

OscarArroyoVega deleted the feature/retriever-llamaindex-Sql branch November 7, 2025 20:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/retriever query builder and ollama provider #18

Feature/retriever query builder and ollama provider #18

Uh oh!

OscarArroyoVega commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feature/retriever query builder and ollama provider #18

Feature/retriever query builder and ollama provider #18

Uh oh!

Conversation

OscarArroyoVega commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants