Name	Name	Last commit message	Last commit date
Latest commit fairyshine ⚠️[FIX] Fix OpenAIChatModel /chat/completion output with logits Sep 28, 2024 9dd8903 · Sep 28, 2024 History 34 Commits
.github/workflows	.github/workflows	[FIX] test connectivity to PYPI	Sep 19, 2024
asset	asset	Initialize the repo	Sep 14, 2024
docs	docs	🔨[DEV] Support OpenAI API Format for transformers model.	Sep 25, 2024
src/fastmindapi	src/fastmindapi	⚠️[FIX] Fix OpenAIChatModel /chat/completion output with logits	Sep 28, 2024
tests	tests	🔨[DEV] improve OpenAIChatModel	Sep 27, 2024
.gitignore	.gitignore	⚠️[FIX] Add local_tests dir	Sep 27, 2024
LICENSE	LICENSE	Initial commit	Sep 14, 2024
MANIFEST.in	MANIFEST.in	⚠️[FIX] Add local_tests dir	Sep 27, 2024
README.md	README.md	🚀[NEW] Add MCTS	Sep 27, 2024
mkdocs.yml	mkdocs.yml	📄[DOC] Development - Build Custom Function Library	Sep 24, 2024
pyproject.toml	pyproject.toml	🔨[DEV] quickly run the server through scripts	Sep 23, 2024
requirements.txt	requirements.txt	Initialize the repo	Sep 14, 2024

Repository files navigation

FastMindAPI

An easy-to-use, high-performance(?) backend for serving LLMs and other AI models, built on FastAPI.

🚀 Quick Start

Install

pip install fastmindapi

Use

Run the server

# in Shell
fastmindapi-server --port 8000

# in Python
import fastmindapi as FM

server = FM.Server()
server.run()

Access via client / HTTP requests

curl http://IP:PORT/docs#/

import fastmindapi as FM

client = FM.Client(IP="x.x.x.x", PORT=xxx) # 127.0.0.1:8000 for default

client.add_model_info_list(model_info_list)
client.load_model(model_name)
client.generate(model_name, generation_request)

🪧 We primarily maintain the backend server; the client is provided for reference only. The main usage is through sending HTTP requests. (We might release FM-GUI in the future.)

✨ Features

Model: Support models with various backends

✅ Transformers
- TransformersCausalLM ( AutoModelForCausalLM)
- PeftCausalLM ( PeftModelForCausalLM )
✅ llama.cpp
- LlamacppLM (Llama)
MLC LLM
vllm
...

Modules: More than just chatting with models

Function Calling (extra tools in Python)
Retrieval
Agent
...

Flexibility: Easy to Use & Highly Customizable

Load the model when coding / runtime
Add any APIs you want

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FastMindAPI

🚀 Quick Start

Install

Use

Run the server

Access via client / HTTP requests

✨ Features

Model: Support models with various backends

Modules: More than just chatting with models

Flexibility: Easy to Use & Highly Customizable

About

Releases 9

Packages

Languages

License

fairyshine/FastMindAPI

Folders and files

Latest commit

History

Repository files navigation

FastMindAPI

🚀 Quick Start

Install

Use

Run the server

Access via client / HTTP requests

✨ Features

Model: Support models with various backends

Modules: More than just chatting with models

Flexibility: Easy to Use & Highly Customizable

About

Resources

License

Stars

Watchers

Forks

Releases 9

Packages 0

Languages

Packages