Sofatutor tweaks #4

mfittko · 2025-05-19T13:00:43Z

Overview

Proxy: Centralized error handling with enhanced OpenAI exception mapping and parsing for clearer logs and standardized responses.
Proxy: Rename function from image_generation to moderation in the moderations endpoint; change call type from audio_speech to pass_through_endpoint for accurate logging/metrics.
OpenAI (audio speech): Add streaming support via context managers and implement deferred streaming to avoid prematurely closing upstream streams; maintains sync compatibility while improving async performance.
CI: Add .github/workflows/sofatutor_image.yml and remove other workflows on this branch to keep only the Sofatutor image workflow.

Based on PR #4.

Changes

litellm/proxy/proxy_server.py
- Centralized error handling for proxy exceptions
- Map/parse OpenAI errors to improve logging and response formatting
- Rename moderations function image_generation → moderation
- Update call type audio_speech → pass_through_endpoint
litellm/llms/openai/openai.py
- Support streaming responses using context managers for efficient byte iteration
- Implement deferred streaming to prevent premature upstream close; keeps sync behavior intact while enhancing async
.github/workflows/sofatutor_image.yml
- New workflow dedicated to Sofatutor image build/publish

Rationale

Improve reliability and clarity of proxy error handling and observability
Align function and call type naming with actual behavior for better analytics
Enable robust audio speech streaming paths with low memory usage and fewer edge-case failures
Keep CI minimal/specific for Sofatutor image builds on this branch

Changelog

Implement deferred streaming for OpenAI audio speech methods
Enhance OpenAI audio speech methods to support streaming via context managers
Change call type: audio_speech → pass_through_endpoint
Rename moderations function: image_generation → moderation
Centralize proxy error handling; improve OpenAI error parsing/mapping
CI: keep only sofatutor_image.yml; move workflow

Files Changed

A .github/workflows/sofatutor_image.yml
M litellm/llms/openai/openai.py
M litellm/proxy/proxy_server.py

Notes

No API surface changes intended; naming updates affect logging/observability only
Audio streaming changes are backward compatible for sync callers

…sistants-logging

…onfiguration updates. Fixed test initialization issues and ensured all tests pass. Updated documentation to reflect changes in logging functionality.

…enAI exception mapping and parsing. Added functions to parse OpenAI error messages and handle proxy exceptions, improving error logging and response formatting.

…o "moderation" in the moderations endpoint, ensuring accurate logging and call type handling.

…through_endpoint" for improved logging and handling in the audio processing workflow.

…sistants-logging

…weaks

…ing context managers, allowing for efficient byte iteration without buffering.

…g for efficient byte iteration without prematurely closing the upstream stream. This change enhances the async audio speech functionality while maintaining compatibility with existing synchronous behavior.

…ing context managers, allowing for efficient byte iteration without buffering.

…g for efficient byte iteration without prematurely closing the upstream stream. This change enhances the async audio speech functionality while maintaining compatibility with existing synchronous behavior.

…TTFB, bytes)

…plugin missing)

…leaner test setup in TTS deferred streaming tests

…ehavior and ensure proper streaming iteration

…ction for speech calls and add unit tests for verification

…ng alerts and implement unit test for missing webhook scenario

… payload for cost calculation

- Remove unused 'import openai' from cloud_watch.py - Remove test_assistants_logging test - Update docs to remove Assistants API mentions

…jects When using the Responses API with a prompt object, OpenAI returns the instructions field as a list of message objects (expanded from the prompt template) rather than a string. The OpenAI SDK correctly defines this as: instructions: Union[str, List[ResponseInputItem], None] But LiteLLM's ResponsesAPIResponse had: instructions: Optional[str] This caused a Pydantic ValidationError when streaming responses tried to parse ResponseCreatedEvent because it expected a string but received a list. This fix updates the type to accept both formats: instructions: Optional[Union[str, ict[str, Any]]]]List Added tests for: - Non-streaming responses with instructions as list - Non-streaming responses with instructions as string - Streaming events (ResponseCreatedEvent, ResponseInProgressEvent, ResponseCompletedEvent) with instructions as list

mfittko added 6 commits May 19, 2025 14:22

feat: Add CloudWatch integration module

fc9dd92

feat: Add cloudwatch_callback_params to module initialization

85bbc17

test: Add CloudWatch and Assistants API logging tests

fb5889b

test: Update proxy server tests to accommodate new logging

f0c7fb8

docs: Add PR description for CloudWatch and Assistants API logging

5bb25f8

docs: Add CloudWatch logging documentation to main docs

02fe169

mfittko self-assigned this May 19, 2025

mfittko added 4 commits September 22, 2025 22:05

remove assistants api changes

9bf5ab9

Merge remote-tracking branch 'origin/main' into feature/cloudwatch-as…

e39e3c4

…sistants-logging

Enhance CloudWatch logging integration with comprehensive tests and c…

ef7f9fe

…onfiguration updates. Fixed test initialization issues and ensured all tests pass. Updated documentation to reflect changes in logging functionality.

ci: keep only GitHub workflow (sofatutor_image.yml) on sofatutor-tweaks

6c487da

mfittko force-pushed the sofatutor-tweaks branch from b89f58e to 6c487da Compare September 22, 2025 20:57

mfittko added 18 commits September 22, 2025 23:01

move workflow

09b547e

Implement centralized error handling in proxy server with enhanced Op…

e935157

…enAI exception mapping and parsing. Added functions to parse OpenAI error messages and handle proxy exceptions, improving error logging and response formatting.

Update proxy server to change function name from "image_generation" t…

d5f27aa

…o "moderation" in the moderations endpoint, ensuring accurate logging and call type handling.

Update proxy server to change call type from "audio_speech" to "pass_…

95a06cf

…through_endpoint" for improved logging and handling in the audio processing workflow.

Merge remote-tracking branch 'origin/main' into feature/cloudwatch-as…

83b9e45

…sistants-logging

Merge branch 'feature/cloudwatch-assistants-logging' into sofatutor-t…

084c12d

…weaks

Enhance OpenAI audio speech methods to support streaming responses us…

0558a55

…ing context managers, allowing for efficient byte iteration without buffering.

Implement deferred streaming for OpenAI audio speech methods, allowin…

70bba22

…g for efficient byte iteration without prematurely closing the upstream stream. This change enhances the async audio speech functionality while maintaining compatibility with existing synchronous behavior.

Enhance OpenAI audio speech methods to support streaming responses us…

7e0b132

…ing context managers, allowing for efficient byte iteration without buffering.

Implement deferred streaming for OpenAI audio speech methods, allowin…

f69183a

…g for efficient byte iteration without prematurely closing the upstream stream. This change enhances the async audio speech functionality while maintaining compatibility with existing synchronous behavior.

Add verify_tts_streaming.py to test TTS streaming behavior (headers, …

2b77d76

…TTFB, bytes)

test: add minimal deferred TTS streaming unit test (skipped if async …

10ba952

…plugin missing)

refactor: replace custom fake client class with SimpleNamespace for c…

517cf69

…leaner test setup in TTS deferred streaming tests

test: enhance deferred TTS streaming test to verify context manager b…

38c0c58

…ehavior and ensure proper streaming iteration

move test file to whre it belongs

23af750

feat(logging): enhance TTS logging to ensure standard payload constru…

4d71977

…ction for speech calls and add unit tests for verification

Merge branch 'fix/tts-proxy-streaming' into sofatutor-tweaks

5622330

fix(SlackAlerting): add non-blocking error handling for failed tracki…

e9970c8

…ng alerts and implement unit test for missing webhook scenario

mfittko added 3 commits September 26, 2025 13:10

feat(logging): improve TTS input handling and ensure standard logging…

944da1f

… payload for cost calculation

Merge upstream stable tag v1.80.0-stable.1

04a6718

Remove Assistants API references from CloudWatch logging

aebbe55

- Remove unused 'import openai' from cloud_watch.py - Remove test_assistants_logging test - Update docs to remove Assistants API mentions

mfittko force-pushed the sofatutor-tweaks branch from b823a4f to aebbe55 Compare November 25, 2025 16:32

mfittko changed the base branch from feature/cloudwatch-assistants-logging to main November 25, 2025 16:32

mfittko changed the base branch from main to v1.80.0-stable November 25, 2025 16:36

mfittko and others added 5 commits November 25, 2025 17:41

temp

71b8b0a

Remove temp file

9c662d3

docs: add runbook for updating sofatutor-tweaks to latest stable tag

56c0c25

docs: fix date in runbook

3589dae

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sofatutor tweaks #4

Sofatutor tweaks #4

Uh oh!

mfittko commented May 19, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Sofatutor tweaks #4

Are you sure you want to change the base?

Sofatutor tweaks #4

Uh oh!

Conversation

mfittko commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Changes

Rationale

Changelog

Files Changed

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mfittko commented May 19, 2025 •

edited

Loading