Change the repository type filter
All
Repositories list
19 repositories
- 📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
hgemm-mma
Publicffpa-attn-mma
Public📚FFPA(Split-D): Yet another Faster Flash Prefill Attention with O(1) GPU SRAM complexity for headdim > 256, ~2x↑🎉vs SDPA EA.Awesome-LLM-Inference
Public📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism, etc. 🎉🎉lite.ai.toolkit
Publicflashinfer
Public- 📒200-page PDF Notes for "Statistical Learning Methods-Li Hang", detailed explanations of various math formulas, implemented in R.🎉
torchlm
Public💎A high level python lib for face landmarks detection: training, eval, export, inference(Python/C++) and 100+ data augmentations.RVM-Inference
Publicnetron-vscode-extension
Publicyolov5face-toolkit
Public🍅 YOLO5Face 2021 with MNN/NCNN/TNN/ONNXRuntimessrnet-toolkit
Publicfsanet-toolkit
Publicmgmatting-toolkit
Publicscrfd-toolkit
Public🍅🍅 Super fast accurate face detector ! SCRFD(CVPR 2021) with MNN/TNN/NCNN/ONNXRuntime C++. (https://github.com/DefTruth/lite.ai.toolkit)nanodet-toolkit
Public🍅🍅NanoDet、NanoDet-Plus with ONNXRuntime/MNN/TNN/NCNN C++. (https://github.com/DefTruth/lite.ai.toolkit)yolox-toolkit
Public