Stable Diffusion Model Implementation

This project is an open-source implementation of the Stable Diffusion model, aimed at generating high-quality images from textual descriptions. Our implementation is based on a combination of several key components, including a U-Net architecture, Variational Autoencoder (VAE), CLIP for prompt embedding, a custom scheduler, and time embeddings to facilitate temporal coherence in generated images.

Acknowledgments

Special thanks to Umar Jamil for his invaluable contributions to this project. We also extend our gratitude to the authors of the original Stable Diffusion paper for their groundbreaking work in the field of text-to-image & image-to-image generation, which has significantly inspired and guided our implementation.

Project Structure

The project is organized as follows:

── assets                           # Contains images for README
├── LICENSE
├── models
│   ├── clip                        # CLIP model for prompt embedding
│   ├── stable_diffusion            # Scheduler and pipeline
│   ├── unet                        # U-Net architecture for image generation
│   └── vae                         # VAE decoder and encoder
├── pretrained                      # put pretrained model in this folder
├── README.md
├── requirements.txt
├── sd.ipynb                        # Example usage
├── tokenizer
└── utils                           # Transformer blocks for various utility functions

1. Environment Setup

We recommend creating a virtual environment and installing the required dependencies:

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable Diffusion Model Implementation

Acknowledgments

Project Structure

1. Environment Setup

2. Training the Model

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
assets		assets
models		models
tokenizer		tokenizer
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
sd.ipynb		sd.ipynb

License

aiden200/Stable_Diffusion_Implementation

Folders and files

Latest commit

History

Repository files navigation

Stable Diffusion Model Implementation

Acknowledgments

Project Structure

1. Environment Setup

2. Training the Model

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages