ServerlessLLM / ServerlessLLM Star 390 Code Issues Pull requests Serverless LLM Serving for Everyone. cuda pytorch model-serving model-as-a-service huggingface-transformers large-language-models serverless-inference Updated Dec 24, 2024 Python
Picovoice / serverless-picollm Star 10 Code Issues Pull requests LLM Inference on AWS Lambda aws-lambda serverless llm serverless-inference llm-inference llm-compression Updated Jun 3, 2024 Python
tensorchord / modelz-py Star 7 Code Issues Pull requests Python SDK and CLI for modelz.ai, which is a developer-first platform for prototyping and deploying machine learning models. machine-learning serverless inference llm serverless-inference Updated Oct 12, 2023 Python