ai-inference

Here are 24 public repositories matching this topic...

bentoml / BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

python machine-learning deep-learning model-serving multimodal mlops ml-engineering ai-inference llm generative-ai llmops llm-serving model-inference-service llm-inference inference-platform

Updated Dec 5, 2025
Python

uxlfoundation / scikit-learn-intelex

Star

Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application

python machine-learning big-data analytics gpu scikit-learn machine-learning-algorithms data-analysis hacktoberfest ai-training oneapi ai-inference swrepo ai-machine-learning

Updated Dec 5, 2025
Python

uxlfoundation / oneDAL

Star

oneAPI Data Analytics Library (oneDAL)

data-science machine-learning big-data analytics cpp machine-learning-algorithms data-analysis hacktoberfest ai-training onedal oneapi ai-inference swrepo ai-machine-learning

Updated Dec 6, 2025
C++

intel / dffml

Star

The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.

Updated Aug 25, 2024
Python

blace-ai / blace-ai

Star

Cross-platform c++ sdk & model hub for cross-platform AI inference. Ready-to-deploy models including Segment Anything 3, Depth Anything 2 and Gemma.

cpp ai-sdk libtorch onnxruntime ai-inference ai-hub

Updated Nov 26, 2025
C++

okba14 / NeuroHTTP

Star

High-Performance AI-Native Web Server — built in C & Assembly for ultra-fast AI inference and streaming.

open-source web-server high-performance assembly grpc low-latency neural http3 c-language grpc-server ai-inference ai-native ai-inference-server

Updated Oct 23, 2025
C

philips-software / go-hsdp-api

Star

Client library to interact with various APIs used within Philips in a simple and uniform way

iot auditing logging iam pki fhir cartel cdr fhir-client cdl ironio tdr hsdp hsdp-api ai-training ai-inference ai-workspace

Updated Mar 11, 2025
Go

redbco / infermesh

Star

GPU-aware inference mesh for large-scale AI serving

rust distributed-systems fault-tolerance high-availability service-mesh observability inference-engine model-serving ml-infrastructure ai-inference gpu-inference ai-infrastructure gpu-mesh

Updated Sep 25, 2025
Rust

ayutaz / uPiper

Sponsor

Star

Unity TTS plugin: Piper neural synthesis + OpenJTalk Japanese + Unity AI Inference Engine. Windows/Mac/Linux/Android/iOS ready. High-quality voices for games & apps.

text-to-speech unity tts unity-plugin ai-inference

Updated Dec 1, 2025
C#

tinyBigGAMES / JetInfero

Sponsor

Star

Local LLM Inference Library

pascal library win64 c-cpp ai-inference local-inference llama-cpp procedural-api

Updated Jan 30, 2025
Pascal

open-vela / apps_mlearning_tflite-micro

Star

Customed version of Google's tflite-micro

tflite-micro ai-inference inference-embedded-engine

Updated Sep 17, 2025
C++

dhanushk-offl / ai-inference-backend-boilerplate

Sponsor

Star

A powerful, faster, scalable full-stack boilerplace for AI inference using Node.js, Python, Redis, and Docker

nodejs redis ai boilerplate-template transformer backend-api ai-backend ai-inference ai-backend-server

Updated May 11, 2025
JavaScript

valvebara / valvebara

Star

No more Hugging Face cost leaks.

nextjs billing save-money huggingface vercel ai-inference

Updated Dec 19, 2024
TypeScript

855princekumar / yawcam-ai-dockerized

Star

Dockerized Yawcam-AI, Edge-ready AI NVR with CPU and CUDA builds, RTSP support, persistent storage, YOLO inference, and EdgePulse optimization.

docker raspberry-pi devops automation rtsp yolo video-surveillance object-detection mlops edge-ai gpu-container ai-inference pistream-lite edgepulse yawcam-ai

Updated Dec 4, 2025
Dockerfile

arbitrary-number / arbitrary-number

Star

Arbitrary Numbers

python machine-learning deep-learning tensorflow gpu cuda pytorch nvidia model-serving numerical-computing model-optimization edge-inference ai-inference ai-performance consumer-gpu

Updated Aug 12, 2025
Python

KDsudheera / greenhouse-ai-fancontroller

Star

🌱 Intelligent IoT greenhouse fan controller using AI/ML for automated climate control. Features ESP32 + DHT22 sensors, real-time Firebase integration, Flutter mobile app with TensorFlow Lite on-device inference, and Wokwi simulation. Complete full-stack solution demonstrating IoT + AI integration.

iot relay machine-learning firebase esp32 mobile-app platformio embedded-systems flutter real-time-database tensorflow-lite edge-ai smart-farming greenhouse-automation wokwi ai-inference dht22sensor

Updated Aug 18, 2025
Dart

robert008 / flutter_face_kit

Star

A personal demo project for Flutter + ONNX Runtime integration. Not related to any company work.A comprehensive on-device face recognition SDK for Flutter

open-source opencv ffi face-recognition flutter-plugin flutter-demo insightface onnxruntime ai-inference ondevice-ai