feature/fallback providers #39

sonic182 · 2025-09-02T13:36:29Z

Closes #34

Implement multi-provider support with provider routing and failover

Introduce a new :providers list in LlmComposer.Settings to replace deprecated :provider and :provider_opts keys.
Add validation in LlmComposer to enforce/suggest exclusive use of :providers and warn about deprecated keys.
Implement LlmComposer.ProviderRunner to handle provider execution, supporting multiple providers with fallback logic.
Add LlmComposer.ProviderRouter behaviour to define routing strategies for provider selection, failure handling, and blocking.
Provide a simple default provider router LlmComposer.ProviderRouter.Simple with exponential backoff blocking on provider failures.
Refactor LlmComposer.run_completion/3 to delegate to the ProviderRunner for provider selection and execution.
Optimize LlmComposer.Cache.Ets by switching put and delete calls to asynchronous casts for improved performance.
Maintain backward compatibility with deprecated settings keys, issuing warnings and supporting legacy calls until version 0.11.0.
Changed response_format key for response_schema for better definition of structured outputs that works for more than one provider, for the same schema definition

You may wanna test the router simple.ex that has the backoff blocking of provider, here is a very basic example, make sure to have ollama service stopped or not installed

openaikey = File.read!("./openaikey") |> String.trim()
Application.put_env(:llm_composer, :openai_key, openaikey)

defmodule MyCustomChat do

  @settings %LlmComposer.Settings{
    providers: [
      # Ollama not started, so will cause errors
      {LlmComposer.Providers.Ollama, [model: "whatever_its_gonna_be_not_started"]},
      {LlmComposer.Providers.OpenAI, [model: "gpt-4o-mini"]}
    ],
    system_prompt: "You are an assistant specialized in history.",
    auto_exec_functions: false,
    functions: []
  }

  def run_custom_chat() do
    # Define a conversation history with user and assistant messages
    messages = [
      %LlmComposer.Message{type: :user, content: "how mutch is 1 + 1?"},
    ]
    
    {:ok, res} = LlmComposer.run_completion(@settings, messages)
    
    res.main_response
  end
end


{:ok, pid} = LlmComposer.ProviderRouter.Simple.start_link([])

Enum.each(0..7, fn _ ->
  IO.inspect(MyCustomChat.run_custom_chat())
end)

place the example in llm_composer folder and do mix run <the example>.ex

…oviders

sonic182 · 2025-09-04T09:21:45Z

README.md


 **Note:** Ollama does not provide token usage information, so `input_tokens` and `output_tokens` will always be empty in debug logs and response metadata. Function calls are also not supported with Ollama.

-### Streaming Responses


lib/llm_composer/provider_router/simple.ex

lib/llm_composer/provider_router.ex

lib/llm_composer/provider_runner.ex

mmacia

I think that there's a lot of abstraction leaks launched to the final user (cache, router, runner, etc.) that are hidden behind default values. Let's try it and see if it works for us.

…ontext

victor23k

Really nice features! Next time I would split some of the smaller changes into different PRs.

lib/llm_composer/providers/google.ex

lib/llm_composer/providers/open_ai.ex

lib/llm_composer/providers/open_router.ex

mix.exs

Co-authored-by: Víctor Fernández <victor.fernandez@doofinder.com>

sonic182 added 7 commits September 2, 2025 10:42

working vertex

ecd79cc

fix credo

e9adb5e

updated README

3a31d4d

fix custom endpoint stuff

ca6a86d

fix dialyzer

ad65658

same version in readme

c91284b

basic, dummy fallback providers

dc7a2b6

sonic182 changed the base branch from master to feature/vertex_api September 2, 2025 13:36

sonic182 added 5 commits September 2, 2025 15:37

fix format

e839a39

provider router

915cf65

fix credo

2430abb

fix dialyzer

0d8a746

refacto

09370b9

sonic182 marked this pull request as draft September 2, 2025 15:55

Base automatically changed from feature/vertex_api to master September 3, 2025 11:52

Merge remote-tracking branch 'origin/master' into feature/fallback_pr…

1e71e06

…oviders

sonic182 marked this pull request as ready for review September 3, 2025 14:08

sonic182 added 4 commits September 3, 2025 16:58

some advance in backoff ban for simpler router

e39a7fb

working backoff block

4bada39

qr

32c2bbf

some logging

920abc2

sonic182 requested review from mmacia, dmoralesl, hectorperez and victor23k September 3, 2025 16:08

flexible cache mod

54e995a

sonic182 requested a review from AlvaroSN September 4, 2025 07:42

sonic182 added 2 commits September 4, 2025 11:15

docs and README reorganize

1b5caf6

bit fix in docs

1538b81

sonic182 commented Sep 4, 2025

View reviewed changes

sonic182 added 6 commits September 4, 2025 12:02

make router return the options too when selecting provider

9c05e9d

fix format

b097b2d

remove block_on_errors stuff, just blocks all the time with backoff

b3b031b

config reading better

5e755af

restore function in openrouter

8a04f67

Fix format

59308ef

victor23k requested changes Sep 4, 2025

View reviewed changes

lib/llm_composer/provider_router/simple.ex Outdated Show resolved Hide resolved

lib/llm_composer/provider_router.ex Outdated Show resolved Hide resolved

lib/llm_composer/provider_runner.ex Show resolved Hide resolved

sonic182 added 3 commits September 4, 2025 16:01

fix condition

e8bf6f2

fix config reading

9bb0d91

put new in provider_opts modification, notice of api_key in settings

4ca5a79

sonic182 requested a review from victor23k September 4, 2025 15:03

a bit of doc and removing {:block, ms}

b0922b0

mmacia approved these changes Sep 4, 2025

View reviewed changes

sonic182 added 6 commits September 4, 2025 17:40

provider atom in llm response

7ef9445

fix in schema responses, same schema multiple providers may work

6789ee5

renamed response format to response_schema for llm_composer library c…

024acdc

…ontext

fix structured output stuffs

b76400d

fix removal of key in schema

2adfa41

using JSON mod if loaded

701ebee

victor23k requested changes Sep 5, 2025

View reviewed changes

sonic182 and others added 4 commits September 5, 2025 13:03

Update lib/llm_composer/providers/google.ex

cd92317

Co-authored-by: Víctor Fernández <victor.fernandez@doofinder.com>

qr

13a0eea

qr

6262379

qr

d82d2ee

sonic182 requested a review from victor23k September 5, 2025 11:08

sonic182 added 5 commits September 5, 2025 13:10

qr

0e31a3c

fix a bit README

ac9a6c9

fix in docs

d00603c

changelog and version 0.12.0 for warning

150f41d

renamed ProviderRunner -> ProvidersRunner

cce1737

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feature/fallback providers #39

feature/fallback providers #39

Uh oh!

sonic182 commented Sep 2, 2025 •

edited

Loading

Uh oh!

sonic182 Sep 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mmacia left a comment

Uh oh!

victor23k left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!


		Note: Ollama does not provide token usage information, so `input_tokens` and `output_tokens` will always be empty in debug logs and response metadata. Function calls are also not supported with Ollama.

		### Streaming Responses

feature/fallback providers #39

Are you sure you want to change the base?

feature/fallback providers #39

Uh oh!

Conversation

sonic182 commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sonic182 Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mmacia left a comment

Choose a reason for hiding this comment

Uh oh!

victor23k left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonic182 commented Sep 2, 2025 •

edited

Loading