Implement models for BrainForge service #8

NeonDaniel · 2024-12-18T23:54:08Z

Description

Implement models for LLM MQ requests/responses
Implement models for LLM requests via HANA endpoints

Issues

Adds incoming user query to history in LLMRequest.to_completion_kwargs. Missed in #4

Other Notes

…a" BrainForge personas

Add `LLMGetInference.as_llm_request` convenience method

neon_data_models/models/api/llm.py

neon_data_models/models/api/mq/brainforge.py

neon_data_models/models/api/llm.py

neon_data_models/models/api/http/brainforge.py

NeonBohdan · 2024-12-23T17:30:33Z

neon_data_models/models/api/http/brainforge.py

+
+
+class LLMGetModelsHttpResponse(BaseModel):
+    models: List[BrainForgeLLM]


This returns both all models and all personas, making Persona related requests useless

Maybe the brainforge_get_personas endpoint isn't necessary at all? The only use case I see for it now is if some client wants to get a specific model@revision without parsing all of the available models

I think that it will never be the case
Because every service wants to request all available info at once and validate requests before been sent

But we can keep it, and deside later

… handling Refactor tokenizer model names to be more descriptive

neon_data_models/models/api/mq/brainforge.py

NeonDaniel added 12 commits December 18, 2024 15:52

Define models for BrainForge services

637e418

Allow empty string system_prompt in LLMPersona to support "vanill…

e02a0b6

…a" BrainForge personas

Fix fields defined with descriptions as default values

ef2fbac

Add BrainForgeLLM.vllm_spec convenience property

a660766

Add `LLMGetInference.as_llm_request` convenience method

Start defining HTTP models for brainforge endpoints

d44fbcb

Add HTTP models for BrainForge service endpoints

ad2d42c

Add missing models.api.http init file

48b7146

Append user query to history in LLMRequest.to_completion_kwargs

966534b

Add missing license notice

11f194c

Add docstrings to clarify submodule intended usage

ca32adc

Update documentation and tests to account for history handling change

9dd2ac2

Fix test error

449dcca

NeonDaniel requested a review from NeonBohdan December 19, 2024 00:09

NeonDaniel added 5 commits December 19, 2024 14:27

Allow persona definition with None system prompt

55bf516

Prevent inserting None system prompt in completion request history

bf36ffa

Define models for generic completion requests/responses

24a170c

Define models for tokenizer request/response

8b83a41

Update raw endpoint models based on actual usage

82d8e1b