A Workflow-based Multi-platform AI Inference and Deployment Framework | 一款基于工作流的多端AI推理部署框架
-
Updated
Oct 3, 2025 - C++
A Workflow-based Multi-platform AI Inference and Deployment Framework | 一款基于工作流的多端AI推理部署框架
A mock Azure OpenAI API for seamless testing and development, supporting both streaming and non-streaming responses. Easily emulate OpenAI completions with token-based streaming in a local or Dockerized environment.
Multi-agentic researcher (RAG)
A Streamlit-based spam classifier that predicts whether a message is spam or not spam using machine learning.
Comprehensive guide to FastAPI, Pydantic, and SQLAlchemy for AI engineers. Learn API design, validation, and ORM workflows with practical examples and setup 🐙
Compare PyTorch vs Triton inference latency with CLI tools, benchmarks, and performance plots.
Add a description, image, and links to the ai-deployment topic page so that developers can more easily learn about it.
To associate your repository with the ai-deployment topic, visit your repo's landing page and select "manage topics."