MIT HAN Lab

All

51 repositories

data-efficient-gans
Public
[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training
tensorflow generative-adversarial-network image-generation gans data-efficient neurips-2020 pytorch
Python
•
BSD 2-Clause "Simplified" License
•175•1.3k•27•0•Updated Sep 24, 2024Sep 24, 2024
qserve
Public
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Python
•
Apache License 2.0
•19•408•23•2•Updated Sep 5, 2024Sep 5, 2024
proxylessnas
Public
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
acceleration automl specialization efficient-model on-device-ai hardware-aware
C++
•
MIT License
•284•1.4k•0•2•Updated Aug 30, 2024Aug 30, 2024
spatten
Public
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
rtl attention hardware-acceleration spinalhdl llm-inference
Scala
•
MIT License
•7•70•1•0•Updated Aug 27, 2024Aug 27, 2024
fastcomposer
Public
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Python
•
MIT License
•36•652•16•0•Updated Aug 21, 2024Aug 21, 2024
distrifuser
Public
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
acceleration parallelism generative-model diffusion-models generative-ai
Python
•
MIT License
•21•562•7•0•Updated Aug 17, 2024Aug 17, 2024
efficientvit
Public
EfficientViT is a new family of vision models for efficient high-resolution vision.
imagenet segmentation high-resolution vision-transformer efficientvit segment-anything
Python
•
Apache License 2.0
•164•1.8k•88•3•Updated Aug 9, 2024Aug 9, 2024
torchsparse
Public
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
acceleration pytorch
Cuda
•
MIT License
•139•1.2k•22•1•Updated Jul 31, 2024Jul 31, 2024
bevfusion
Public archive
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
camera pytorch lidar object-detection sensor-fusion semantic-segmentation 3d-perception
Python
•
Apache License 2.0
•409•2.3k•0•0•Updated Jul 31, 2024Jul 31, 2024
torchquantum
Public
A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.
machine-learning system deep-learning neural-network quantum pytorch quantum-computing quantum-machine-learning quantum-simulation ml-for-systems
Jupyter Notebook
•
MIT License
•198•1.3k•58•7•Updated Jul 21, 2024Jul 21, 2024
llm-awq
Public
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Python
•
MIT License
•184•2.4k•123•8•Updated Jul 16, 2024Jul 16, 2024
hardware-aware-transformers
Public
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
natural-language-processing machine-translation transformer specialization efficient-model hardware-aware
Python
•
Other
•50•326•3•0•Updated Jul 14, 2024Jul 14, 2024
smoothquant
Public
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Python
•
MIT License
•139•1.2k•62•1•Updated Jul 12, 2024Jul 12, 2024
spvnas
Public archive
[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
computer-vision deep-learning efficiency pytorch lidar architecture-search point-cloud 3d-deep-learning semantickitti
Python
•
MIT License
•109•583•0•0•Updated Jul 11, 2024Jul 11, 2024
lite-transformer
Public archive
[ICLR 2020] Lite Transformer with Long-Short Range Attention
nlp pytorch transformer
Python
•
Other
•81•596•0•0•Updated Jul 11, 2024Jul 11, 2024
temporal-shift-module
Public
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
acceleration low-latency video-understanding efficient-model temporal-modeling tsm nvidia-jetson-nano
Python
•
MIT License
•418•2.1k•93•6•Updated Jul 11, 2024Jul 11, 2024
streaming-llm
Public
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Python
•
MIT License
•361•6.6k•39•2•Updated Jul 11, 2024Jul 11, 2024
tinychat-tutorial
Public
C++
•13•37•3•1•Updated Jul 10, 2024Jul 10, 2024
tinyengine
Public
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory
c microcontroller cpp pytorch codegenerator tinyml deep-learning quantization edge-computing neural-architecture-search
C
•
MIT License
•131•795•33•1•Updated Jul 8, 2024Jul 8, 2024
TinyChatEngine
Public
TinyChatEngine: On-Device LLM Inference Library
c arm deep-learning cpp x86-64 quantization edge-computing cuda-programming on-device-ai large-language-models
C++
•
MIT License
•68•720•31•3•Updated Jul 4, 2024Jul 4, 2024
Quest
Public
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
Cuda
•11•174•3•0•Updated Jul 3, 2024Jul 3, 2024
lmquant
Public
Python
•
Apache License 2.0
•7•107•12•0•Updated Jun 12, 2024Jun 12, 2024
litepose
Public
[CVPR'22] Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation
pose-estimation efficient-models litepose
Python
•
MIT License
•37•304•19•1•Updated Jun 5, 2024Jun 5, 2024
gan-compression
Public
[CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs
compression pytorch gans pix2pix cyclegan image-to-image-translation conditional-gans gaugan
Python
•
Other
•148•1.1k•3•6•Updated Jun 5, 2024Jun 5, 2024
patch_conv
Public
Patch convolution to avoid large GPU memory usage of Conv2D
Python
•
MIT License
•5•74•1•1•Updated May 26, 2024May 26, 2024
sparsevit
Public
[CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Python
•
Apache License 2.0
•5•59•2•0•Updated Apr 24, 2024Apr 24, 2024
mcunet
Public
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning
deep-learning pytorch neural-architecture-search tinyml microncontroller
Python
•
MIT License
•82•462•22•2•Updated Mar 29, 2024Mar 29, 2024
tiny-training
Public
On-Device Training Under 256KB Memory [NeurIPS'22]
edge-ai on-device-training learning-on-the-edge
Python
•
MIT License
•60•432•7•0•Updated Mar 29, 2024Mar 29, 2024
once-for-all
Public
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
acceleration nas automl edge-ai efficient-model tinyml
Python
•
MIT License
•334•1.9k•53•6•Updated Dec 14, 2023Dec 14, 2023
tinyml
Public
Python
•
MIT License
•138•750•6•1•Updated Nov 29, 2023Nov 29, 2023