GitHub

Deploy LLMs to your own cluster & cloud via web browser

LMIgnite ignites your cluster with LLM deployments. It is your one-click solution to deploy high-performance, enterprise-grade LLM serving infrastructure into your own cluster and cloud environments.

Why LMIgnite?

LMIgnite is:

Self-hosted: You run your LLM with your own machines. It's cheap and private.
High-performance: We give you the best performance by deeply integrating open-source LLM projects, including inference engine (vLLM ), inter-inference-engine communication (LMCache ) and production-level orchestration (vLLM production stack ).

Feature highlights:

🌐 Easy-to-use: Deploy LLMs just by your browser.
🚀 One-click runnable: Run the bash script, and the webpage for LLM deployment will pop up for you.
⚡ 3-10x faster response times through our own communication library (LMCache ) and orchestration library (vLLM production stack ).
🏢 Enterprise-ready with multi-tenancy, autoscaling, and high availability
🔧 Wide support across clouds such as AWS, GCP, Azure, and Lambda, plus on-premises
📊 Built-in monitoring and performance analytics

Prerequisites

Before you begin, make sure you have the following:

Lambda Labs API key to connect to Lambda Cloud (the support for other clouds and self-host Kubernetes cluster is coming soon!)
Hugging Face access token for model access

Visit the documentation on how to get them.

Currently, we support MacOS with automatic dependency installation. For users with other OS (Windows/Linux), you may need to install docker compose manually before using the following one-click script.

Quick Start

1. Run the one-click installation script

Video tutorial:

(Make sure you have lambda API key and huggingface token in hand)

Hit Command(⌘) + Space on your keyboard, and type terminal (or other terminal that you like), and then hit enter to launch the terminal.

Then, simply copy and paste the following command to your terminal

bash <(curl -fsSL https://raw.githubusercontent.com/LMCache/LMIgnite/refs/heads/main/install.command)

This script will guide you through the installation process, and will pop up a browser for you to deploy LLMs in your own cloud!

2. Create a Cluster

Video tutorial:

In the left sidebar, click Cluster, then hit + Create Cluster
Fill in Cluster Configuration:
- Name (e.g., test)
- Cloud Provider (e.g., Lambda Labs)
- Region (e.g., us-south-1)
- GPU Type & Count (e.g., 8 × H100)
- Hugging Face Token (paste your HF access token)
Click Create Cluster at the bottom right
Wait until the status shows Active (Pending → init → wait_k8s → Active)

3. Create Deployments

Video tutorial:

In the left sidebar, click Deployments, then hit + Create Deployment
Search or select from existing model cards (e.g., meta-llama/Llama-3.1-8B-Instruct)
Configure basics:
- Deployment Name: give it a descriptive name (e.g., llama-8b-test)
- Target Cluster: select one of your Active clusters
Click Create Deployment to quick-start, or Next: Advanced for fine control
Monitor the deployment status progression

4. Play with your deployment

TBD: include the chatting and the curl request section in README and examples.

Documentation

Check our online documentation for detailed explanations!

Tips

If port 3001 is in use, you can change it in docker-compose.yml
If cluster creation fails, try switching to a different region
Some models on Hugging Face are "gated" and require access approval
Check deployment logs if creation fails

Contributing

We welcome contributions! Please check our documentation for development guidelines.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github/workflows		.github/workflows
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
install.command		install.command

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deploy LLMs to your own cluster & cloud via web browser

Why LMIgnite?

Prerequisites

Quick Start

1. Run the one-click installation script

Video tutorial:

2. Create a Cluster

Video tutorial:

3. Create Deployments

Video tutorial:

4. Play with your deployment

Documentation

Tips

Contributing

About

Uh oh!

Releases

Packages

Languages

License

nijaba/LMIgnite

Folders and files

Latest commit

History

Repository files navigation

Deploy LLMs to your own cluster & cloud via web browser

Why LMIgnite?

Prerequisites

Quick Start

1. Run the one-click installation script

Video tutorial:

2. Create a Cluster

Video tutorial:

3. Create Deployments

Video tutorial:

4. Play with your deployment

Documentation

Tips

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages