Skip to content
View codezjx's full-sized avatar
😱
Overtime
😱
Overtime

Block or report codezjx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

llm

19 repositories

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 115,870 9,227 Updated Jan 30, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 63,857 7,626 Updated Jan 31, 2025

DeepSeek LLM: Let there be answers

Makefile 4,898 665 Updated Feb 4, 2024

LLM inference in C/C++

C++ 72,397 10,431 Updated Jan 31, 2025

Universal LLM Deployment Engine with ML Compilation

Python 19,793 1,639 Updated Jan 24, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 17,960 1,286 Updated Jan 27, 2025

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,147 468 Updated Nov 6, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,081 1,154 Updated May 23, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 13,194 856 Updated Jan 23, 2025

Ollama Python library

Python 6,079 515 Updated Jan 28, 2025

Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework

Python 1,295 390 Updated Jan 6, 2025

Examples and guides for using the Gemini API

Jupyter Notebook 10,536 1,238 Updated Jan 31, 2025
446 15 Updated Aug 9, 2023

A primitive library for neural network

C++ 1,308 217 Updated Nov 24, 2024

Mamba SSM architecture

Python 13,864 1,193 Updated Jan 18, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 35,656 5,404 Updated Jan 31, 2025

MiniCPM on Android platform.

Python 624 50 Updated Apr 11, 2024

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,457 100 Updated Jan 23, 2025

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

Dart 1,646 185 Updated Nov 4, 2024