GitHub - abdelnour13/images_generation_latent_diffusion_models: This repository contains an implementation of latent diffusion models for image generation as well as the weights of three models trained on three different datasets (Anime Faces dataset,Google's cartoon faces and bitmojies dataset).

Images Generation with Latent Diffusion models

This repository contains the code for generating images with the Latent Diffusion models, as described in the paper Latent Diffusions for Generative Modeling,tested on the following datasets : Anime Faces dataset, Google cartoon faces and Bitmoji faces .

Generation process every 50 iteration (Anime Faces Dataset)

For all the datasets the models were trained to generate images unconditionally,additionally for Anime Faces dataset the model was trained to generate an image from a binary mask as well (Results),and for the Google cartoon faces dataset a model to generate images from categorical attributes was also trained (Results) while for the Bitmoji faces dataset i experimented with generating images from a color palette (Results).

Checkpoints are available on kaggle here.

VQVAE Training results (Google's cartoon dataset)

VQVAE Output during training

Original Vs Reconstructed images

Latent diffusion model results (Google's cartoon dataset)

Latent diffusion output during training (every 2000 iterations)

Latent diffusion generation process (decoded every 20 timesteps)

Setup

git clone git@github.com:abdelnour13/images_generation_latent_diffusion_models.git
conda create -n latent-diffusion
conda activate latent-diffusion
pip install requirements.txt

Download data

python3 download.py [-h] --datasets {celeb_a,anime_faces,cartoon_faces,anime_faces_2,art_pictograms,bitmojie}

Create an experiment

python3 create.py [-h] [--on-exists {error,overwrite}] --name NAME --type {vqvae,diffusion}

Models training

To train the VQVAE

cd src/training
python3 vqvae.py --experiment EXPERIMENT

To train the Latent diffusion model

cd src/training
python3 diffusion.py --experiment EXPERIMENT

Note :

you can also lunch tensorboard in a seprate terminal window with the command below to visualize the metrics and generated/reconstructed images while the model is training :

tensorboard --logdir=runs

Acknowledgement :

Here i share some youtube videos that helped me in understanding both the theory behined diffusion models as well as key implementation details :

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
data		data
demo		demo
experiments		experiments
notebooks		notebooks
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
create.py		create.py
definitions.py		definitions.py
download.py		download.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Images Generation with Latent Diffusion models

VQVAE Training results (Google's cartoon dataset)

Latent diffusion model results (Google's cartoon dataset)

Setup

Download data

Create an experiment

Models training

Acknowledgement :

About

Uh oh!

Releases

Packages

Uh oh!

Languages

abdelnour13/images_generation_latent_diffusion_models

Folders and files

Latest commit

History

Repository files navigation

Images Generation with Latent Diffusion models

VQVAE Training results (Google's cartoon dataset)

Latent diffusion model results (Google's cartoon dataset)

Setup

Download data

Create an experiment

Models training

Acknowledgement :

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages