Amazon SageMaker Llama 2 Inference via Response Streaming
sagemaker
sagemaker-endpoint
response-streaming
large-language-models
text-generation-inference
llama2
large-model-inference
-
Updated
Jun 28, 2024 - Jupyter Notebook