LiteLLM call to Sagemaker failing #9157

swagulkarni · 2025-03-12T01:55:48Z

I am trying to invoke model endpoint hosted on Sagemaker. Here is my entry in config.yaml

model_list:
  - model_name: jumpstart-model
    litellm_params:
      model: sagemaker/jumpstart-dft-meta-textgeneration-l-20250311-203442
      aws_access_key_id: <Access_key_id>
      aws_secret_access_key: <Secret_access_key>
      aws_region_name: us-east-1

Here is the curl command

curl --location 'http://0.0.0.0:4000/chat/completions' \
--header 'Content-Type: application/json' \
--data ' {
      "model": "jumpstart-model",
      "hf_model_name": "meta-llama/Llama-2-7b-chat-hf",
      "messages": [
        {
          "role": "user",
          "content": "what llm are you"
        }
      ]
    }
'

But getting this error

{"error":{"message":"litellm.ServiceUnavailableError: SagemakerException - Too little data for declared Content-Length. Received Model Group=jumpstart-model\nAvailable Model Group Fallbacks=None","type":null,"param":null,"code":"503"}

What is the root cause here?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LiteLLM call to Sagemaker failing #9157

LiteLLM call to Sagemaker failing #9157

swagulkarni commented Mar 12, 2025

LiteLLM call to Sagemaker failing #9157

LiteLLM call to Sagemaker failing #9157

Comments

swagulkarni commented Mar 12, 2025