SK LLM Gateway

A unified management platform for large language models, supporting multiple vendors and models with a unified API interface.

✨ Features

🚀 Multi-vendor Support: Unified access to OpenAI, Anthropic, Google, Alibaba, Baidu, Tencent, ByteDance, Zhipu, and more
🔄 Model Switching: Dynamically switch between different AI models
🎨 Visual Models: Support for GPT-4 Vision, Claude, and other visual understanding models
📊 Management Dashboard: Gradio-based web interface for easy model management
🔧 Unified API: Single API endpoint compatible with OpenAI format
⚡ High Performance: Built with FastAPI for optimal performance
🔐 Secure: API key management and secure configuration

🏗️ Architecture

┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐
│   Client Apps   │    │  SK-LLM Gateway │    │  AI Providers   │
│                 │───▶│                 │───▶│   OpenAI        │
│  - Mobile Apps  │    │  - Unified API  │    │   Anthropic     │
│  - Web Apps     │    │  - Model Mgmt   │    │   Google        │
│  - Desktop Apps │    │  - Load Balance │    │   Alibaba       │
└─────────────────┘    └─────────────────┘    │   Baidu         │
                                              │   ...           │
                                              └─────────────────┘

🚀 Quick Start

1. Install UV

macOS/Linux:

curl -LsSf https://astral.sh/uv/install.sh | sh

Windows:

powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

2. Project Setup

# Enter project directory
cd backend-llm

# Run with UV (auto handles virtual environment)
uv run uvicorn main:app --reload

3. Configure Environment

# Copy environment config file
cp .env.example .env

# Edit .env file, add your API keys
nano .env

4. Start Service

# Run with UV (auto handles virtual environment)
uv run uvicorn main:app --host 0.0.0.0 --port 8000 --reload

5. Access Services

Management Dashboard: http://localhost:8000/admin
API Documentation: http://localhost:8000/docs
Alternative API Docs: http://localhost:8000/redoc

6. Test Features

Visit management dashboard, click "Model Testing" tab
Test language models: Enter messages and view responses
Test visual models: Upload images and enter descriptions

📦 Supported Providers

Language Models

OpenAI: GPT-4, GPT-3.5, etc.
Anthropic: Claude 3, Claude 2
Google: Gemini Pro, Gemini Vision
Alibaba: Tongyi Qianwen
Baidu: ERNIE Bot
Tencent: Hunyuan
ByteDance: Skylark
Zhipu: GLM-4
DeepSeek: DeepSeek Chat

Visual Models

OpenAI: GPT-4 Vision
Anthropic: Claude 3 Vision
Google: Gemini Pro Vision
Zhipu: GLM-4 Vision

🔧 API Usage

Chat Completions

curl -N -X POST "http://localhost:8000/api/v1/chat/completions" \
-H "Content-Type: application/json" \
-d '{
  "messages": [{"role": "user", "content": "Write a quick sort in Python"}],
  "stream": true
}'

Vision Completions

curl -X POST "http://localhost:8000/api/v1/vision/completions" \
-F "image=@path/to/your/image.jpg" \
-F "prompt=What's in this image?" \
-F "model=gpt-4-vision-preview"

🎬 Management Dashboard

After starting the service, visit http://localhost:8000/admin to access:

Model Management

Switch between active language and vision models
Dynamically add new model configurations
View model status and configuration info

Model Testing

Language Model Testing: Enter custom test messages on the left side of "Model Testing" tab
Visual Model Testing: Upload images and mixed text-image testing on the right side
- Upload images and enter descriptions for complete visual understanding tests
- Images are automatically converted to base64 format, supporting common image formats

🚀 Deployment Options

Development Environment

For development, we recommend using uv and uvicorn directly as it supports hot reload:

# Run with uv, no need to manually activate virtual environment
uv run uvicorn main:app --host 0.0.0.0 --port 8000 --reload

Production Environment

In production, we don't recommend using uvicorn directly. Instead, use a more robust process manager.

Option 1: Gunicorn + Uvicorn (Recommended)

This is the golden combination for deploying Python ASGI applications (like FastAPI). Gunicorn acts as the process manager, responsible for starting and managing multiple Uvicorn worker processes, better utilizing server multi-core resources and ensuring service stability.

Install gunicorn:
```
uv pip install gunicorn
```
Start service with gunicorn:
```
# Start 4 Uvicorn worker processes
gunicorn -w 4 -k uvicorn.workers.UvicornWorker main:app -b 0.0.0.0:8000
```
- -w 4: Number of worker processes, usually recommended as (2 * CPU cores) + 1
- -k uvicorn.workers.UvicornWorker: Tell Gunicorn to use Uvicorn worker class
- -b 0.0.0.0:8000: Bind service address and port

Option 2: Docker Containerized Deployment

Packaging the entire application into a Docker image is the standard way to deploy in modern cloud environments. Here's a sample Dockerfile:

# Use official Python base image
FROM python:3.11-slim

# Set working directory
WORKDIR /app

# Install uv
RUN pip install uv

# Copy config files and dependency definition files
COPY pyproject.toml ./

# Install dependencies with uv
# --system means install to system Python environment, as it's isolated in container
RUN uv pip install --system -e .

# Copy all application code
COPY . .

# Expose port
EXPOSE 8000

# Container start command (using Gunicorn)
CMD ["gunicorn", "-w", "4", "-k", "uvicorn.workers.UvicornWorker", "main:app", "-b", "0.0.0.0:8000"]

📁 Project Structure

backend-llm/
├── app/
│   ├── admin/          # Gradio management interface
│   ├── api/            # API endpoints
│   ├── core/           # Core configuration
│   ├── schemas/        # Data models
│   └── services/       # Business logic
├── config/             # Model configurations
│   ├── models.yaml     # Model definitions
│   └── providers/      # Provider configs
├── docs/               # Provider-specific docs
├── main.py             # Application entry point
├── pyproject.toml      # Project dependencies
└── requirements.txt    # Python requirements

🔧 Configuration

Environment Variables

Copy .env.example to .env and configure:

# Platform Configuration
PLATFORM_NAME=SK-LLM Gateway
DEBUG=false

# API Keys (Add your keys)
OPENAI_API_KEY=your_openai_key
ANTHROPIC_API_KEY=your_anthropic_key
GOOGLE_API_KEY=your_google_key
ALIBABA_API_KEY=your_alibaba_key
BAIDU_API_KEY=your_baidu_key
TENCENT_API_KEY=your_tencent_key
BYTEDANCE_API_KEY=your_bytedance_key
ZHIPU_API_KEY=your_zhipu_key
DEEPSEEK_API_KEY=your_deepseek_key

Model Configuration

Edit config/models.yaml to customize available models:

language_models:
  - provider: openai
    model: gpt-4
    display_name: GPT-4
    max_tokens: 8192
    
  - provider: anthropic
    model: claude-3-opus
    display_name: Claude 3 Opus
    max_tokens: 200000

🤝 Contributing

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🔗 Links

Repository: https://github.com/sihuangtech/sk-llm-gateway
Documentation: API_DOCS.md
Issue Tracker: https://github.com/sihuangtech/sk-llm-gateway/issues

📞 Support

If you have any questions or need help, please:

Check the documentation
Search existing issues
Create a new issue with detailed description

Community

Star this repo if you find it helpful!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github		.github
app		app
config		config
docs		docs
.env.example		.env.example
.gitignore		.gitignore
API_DOCS.md		API_DOCS.md
LICENSE		LICENSE
README.md		README.md
README_zh-CN.md		README_zh-CN.md
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Uh oh!

License

sihuangtech/sk-llm-gateway

Folders and files

Latest commit

History

Repository files navigation

SK LLM Gateway

✨ Features

🏗️ Architecture

🚀 Quick Start

1. Install UV

2. Project Setup

3. Configure Environment

4. Start Service

5. Access Services

6. Test Features

📦 Supported Providers

Language Models

Visual Models

🔧 API Usage

Chat Completions

Vision Completions

🎬 Management Dashboard

Model Management

Model Testing

🚀 Deployment Options

Development Environment

Production Environment

Option 1: Gunicorn + Uvicorn (Recommended)

Option 2: Docker Containerized Deployment

📁 Project Structure

🔧 Configuration

Environment Variables

Model Configuration

🤝 Contributing

📝 License

🔗 Links

📞 Support

Community

⭐ Star History

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Languages

Packages