Flux2-from-scratch

This repository implements the Flux2 model from scratch, specifically focusing on training the Flux2 Transformer. To simplify the process, I'm leveraging the existing AutoEncoder and Text Encoder.

The base implementation is taken from the official black-forest-labs/flux2 repository.

Note: I'll explain the entire implementation in detail on my blog once the project is complete.

Datasets

The following datasets will be used for this project:

Fredtt3/Flux2-Image: Base dataset with the images and prompts
Fredtt3/Flux2-Image-Processed: Dataset already processed for transformer training (Not yet available)

Installation

uv pip install torch==2.9 transformers==4.57.6 https://github.com/mjun0812/flash-attention-prebuild-wheels/releases/download/v0.7.0/flash_attn-2.8.3+cu128torch2.9-cp312-cp312-linux_x86_64.whl flashinfer-python https://github.com/FredyRivera-dev/Flux2-from-scratch.git

Note: I'm providing a pre-compiled version of Flash Attention for PyTorch 2.9, so you don't have to wait to compile it from scratch.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
flux		flux
media		media
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Flux2-from-scratch

Datasets

Installation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Flux2-from-scratch

Datasets

Installation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages