Release v1.49.4 · BerriAI/litellm

What's Changed

(refactor router.py ) - PR 3 - Ensure all functions under 100 lines by @ishaan-jaff in #6181
[Bug Fix]: fix litellm.caching imports on python SDK by @ishaan-jaff in #6219
LiteLLM Minor Fixes & Improvements (10/14/2024) by @krrishdholakia in #6221
test(router_code_coverage.py): check if all router functions are dire… by @krrishdholakia in #6186
(refactor) use helper function _assemble_complete_response_from_streaming_chunks to assemble complete responses in caching and logging callbacks by @ishaan-jaff in #6220
(refactor) OTEL - use safe_set_attribute for setting attributes by @ishaan-jaff in #6226

Full Changelog: v1.49.3...v1.49.4

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.49.4

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	200.0	212.5333893868387	6.244178319118513	0.0	1869	0	178.2565319999776	1357.8999799999565
Aggregated	Passed ✅	200.0	212.5333893868387	6.244178319118513	0.0	1869	0	178.2565319999776	1357.8999799999565