flash-attention-3

Here are 7 public repositories matching this topic...

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

mla vllm llm-inference awesome-llm flash-attention tensorrt-llm paged-attention deepseek flash-attention-3 deepseek-v3 minimax-01 deepseek-r1 flash-mla qwen3

Updated Nov 28, 2025
Python

windreamer / flash-attention3-wheels

Star

Pre-built wheels that erase Flash Attention 3 installation headaches.

python windows wheel hopper llm flash-attention flash-attention-3

Updated Jan 9, 2026
Python

PRITHIVSAKTHIUR / Qwen-Image-Edit-2509-LoRAs-Fast

Star

Qwen-Image-Edit-2509-LoRAs-Fast is a high-performance, user-friendly web application built with Gradio that leverages the advanced Qwen/Qwen-Image-Edit-2509 model from Hugging Face for seamless image editing tasks.

python kernel numpy torch pytorch peft torchvision diffusion-models huggingface-transformers huggingface-spaces diffusers flash-attention-3 qwen2-5-vl qwen-image-edit qwen3-vl qwen-image-edit-2509 aoti

Updated Dec 23, 2025
Python

gietema / attention

Star

Toy Flash Attention implementation in torch

torch flash-attention flash-attention-2 flash-attention-3

Updated Sep 22, 2024
Python

PRITHIVSAKTHIUR / Qwen-Image-Edit-2511-LoRAs-Fast-Lazy-Load

Star

Demonstration for the Qwen-Image-Edit-2511 model with lazy-loaded LoRA adapters for advanced single- and multi-image editing. Supports 7 specialized LoRAs including photo-to-anime, multi-angle camera control, pose transfer (Any-Pose), upscaling, style transfer, light migration, and manga tone. Features fast inference (4 steps default).

python numpy transformers pytorch image-editor image-generation lora image-to-image diffusers qwen flash-attention-3 qwen-image-edit-2511

Updated Jan 8, 2026
Python

PRITHIVSAKTHIUR / Qwen-Image-Edit-Object-Manipulator

Star

Demonstration for the Qwen/Qwen-Image-Edit-2511 model, specialized in object manipulation via lazy-loaded LoRA adapters. Supports adding or removing specific elements (e.g., logos, accessories, clothing) in single- or multi-image inputs while preserving lighting, realism, and background details. Features precise prompt control and fast inference.

torch python3 pytorch lora fast-inference torchvision huggingface-transformers huggingface-spaces huggingface-diffusers flash-attention-3 qwen-image-edit-2511

Updated Jan 5, 2026
Python

PRITHIVSAKTHIUR / Qwen-Image-Edit-2509-LoRAs-Fast-Fusion-Lazy-Load

Star

Demonstration for the Qwen/Qwen-Image-Edit-2509 model, enhanced with lazy-loaded LoRA adapters for specialized image editing tasks like texture application, object fusion, material transfer, and light migration. Uses a fused Lightning LoRA for rapid inference (4 steps default)

torch image-editor lora torchvision diffusion-models huggingface-transformers huggingface-models huggingface-datasets diffusers flash-attention-3 qwen-image-edit-2509

Updated Dec 22, 2025
Python

Improve this page

Add a description, image, and links to the flash-attention-3 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the flash-attention-3 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flash-attention-3

Here are 7 public repositories matching this topic...

xlite-dev / Awesome-LLM-Inference

windreamer / flash-attention3-wheels

PRITHIVSAKTHIUR / Qwen-Image-Edit-2509-LoRAs-Fast

gietema / attention

PRITHIVSAKTHIUR / Qwen-Image-Edit-2511-LoRAs-Fast-Lazy-Load

PRITHIVSAKTHIUR / Qwen-Image-Edit-Object-Manipulator

PRITHIVSAKTHIUR / Qwen-Image-Edit-2509-LoRAs-Fast-Fusion-Lazy-Load

Improve this page

Add this topic to your repo