- Minsk
Highlights
- Pro
Stars
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
High-resolution models for human tasks.
Official inference repo for FLUX.1 models
[CVPR 2023] DepGraph: Towards Any Structural Pruning
[CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs
Learning Dual Memory Dictionaries for Blind Face Restoration
A playbook for systematically maximizing the performance of deep learning models.
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
Character Animation (AnimateAnyone, Face Reenactment)
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Official implementations for paper: Anydoor: zero-shot object-level image customization
Concept Sliders for Precise Control of Diffusion Models
An extension for Automatic1111 to work around Stable Diffusion's "clone problem". It automatically modifies your prompts with random names, nationalities, hair style and hair color to create more v…
An A1111 extension for interpolating tokens into embeddings
A library helping to gather stats and run checks during training deep learning models with Pytorch
Demo code for "LOHO: Latent Optimization of Hairstyles via Orthogonalization".
Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" (AIM ECCVW 2022) https://arxiv.org/abs/2201.13433
[CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation
A PyTorch reimplementation of the paper Free-Form Image Inpainting with Gated Convolution (DeepFill v2) (https://arxiv.org/abs/1806.03589)
BBDM: Image-to-image Translation with Brownian Bridge Diffusion Models