efficient-ai

Dynamic Attention Mask (DAM) generate adaptive sparse attention masks per layer and head for Transformer models, enabling long-context inference with lower compute and memory overhead without fine-tuning.

inference-optimization sparse-attention efficient-ai

Updated Jun 16, 2025
Python

Liu-Hy / WMDD

Star

Official PyTorch implementation of the paper "Dataset Distillation via the Wasserstein Metric" (ICCV 2025).

efficiency optimal-transport distillation dataset-distillation efficient-ai

Updated Aug 5, 2025
Python

sujin-1013 / task-aware-DMO

Star

Task-Aware Dynamic Model Optimization for Multi-Task Learning (IEEE Access 2023)

deep-learning mtl multi-task-learning model-compression decathlon ai-research lightweight-model efficient-ai

Updated Apr 21, 2025

fangvv / HWGNAS

Star

Code for paper "Automated Design for Hardware-aware Graph Neural Networks on Edge Devices"

neural-networks neural-architecture-search latency-prediction edge-devices graph-neural-networks gnn jetson-nano on-device-ai inference-acceleration hardware-aware-nas efficient-ai

Updated Aug 22, 2025
Python

Improve this page

Add a description, image, and links to the efficient-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the efficient-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

efficient-ai

Here are 10 public repositories matching this topic...

NVlabs / Long-RL

cokeshao / Awesome-Multimodal-Token-Compression

jeho-lee / Awesome-On-Device-AI-Systems

tiannuo-yang / SearchAgent-X

BaiTheBest / SparseLLM

erectbranch / MIT-Efficient-AI

ResponsibleAILab / DAM

Liu-Hy / WMDD

sujin-1013 / task-aware-DMO

fangvv / HWGNAS

Improve this page

Add this topic to your repo