Change the repository type filter
All
Repositories list
51 repositories
- [NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
- EfficientViT is a new family of vision models for efficient high-resolution vision.
torchsparse
Public- [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
- A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.
- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
smoothquant
Publicspvnas
Public archive[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolutiontemporal-shift-module
Public[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understandingstreaming-llm
Publictinychat-tutorial
Public- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory
TinyChatEngine
PublicTinyChatEngine: On-Device LLM Inference LibraryQuest
Public- [CVPR'22] Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation
- [CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs
mcunet
Public[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning- On-Device Training Under 256KB Memory [NeurIPS'22]
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment