Local Ollama Integration #120

koala73 · 2026-02-19T08:11:39Z

koala73
Feb 19, 2026
Maintainer

The desktop app could leverage a locally-running Ollama instance as a fourth tier in the AI fallback chain (Groq → OpenRouter → Ollama → browser-side T5).

When detected on localhost:11434, the sidecar would route classification and summarization requests to any compatible model the user has pulled (e.g., Llama 3.1 8B, Mistral, Gemma). This eliminates cloud API dependency entirely for desktop users, no API keys needed, no rate limits, no data leaving the machine. The sidecar already runs all API handlers locally, so adding an Ollama adapter is a natural extension: probe the /api/tags endpoint at startup to detect available models, prefer quantized 8B variants for speed, and fall back to the next tier if Ollama isn't running or the request times out.

For users with Apple Silicon or modern GPUs, inference latency on 8B models is comparable to cloud round-trips, making this a zero-cost, fully private alternative.

jkrilov · 2026-02-19T13:17:57Z

jkrilov
Feb 19, 2026

Can you add an option to manually specify the endpoint? My LM Studio instance is running on another machine on my network.

0 replies

koala73 · 2026-02-19T20:02:57Z

koala73
Feb 19, 2026
Maintainer Author

Coming soon 😃

1 reply

koala73 Feb 19, 2026
Maintainer Author

It's live in 2.5.0 @jkrilov

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local Ollama Integration #120

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Local Ollama Integration #120

Uh oh!

koala73 Feb 19, 2026 Maintainer

Replies: 2 comments · 1 reply

Uh oh!

jkrilov Feb 19, 2026

Uh oh!

koala73 Feb 19, 2026 Maintainer Author

Uh oh!

koala73 Feb 19, 2026 Maintainer Author

koala73
Feb 19, 2026
Maintainer

Replies: 2 comments 1 reply

jkrilov
Feb 19, 2026

koala73
Feb 19, 2026
Maintainer Author

koala73 Feb 19, 2026
Maintainer Author