KnobGen

Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models

KnobGen is a dual-pathway framework that empowers sketch-based image generation diffusion model by seamlessly adapting to varying levels of sketch complexity and user skill. KnobGen employs a Coarse-Grained Controller (CGC) module for leveraging high-level semantics from both textual and sketch inputs in the early stages of generation, and a Fine-Grained Controller (FGC) module for detailed refinement later in the process.

More details available in our paper.

Quick Demo

🚀 News

[2024-09-27] 🔥 Initial release of KnobGen code!
[2024-10-02] 🔥 The paper is released on arXiv.

Installation

To set up the environment, please follow these steps in the terminal:

git clone https://github.com/aminK8/KnobGen.git
cd KnobGen
conda env create -f environment.yml
conda activate knobgen

Prepare the Dataset

We utilized the MultiGen-20M dataset, originally introduced by UniControl.

Train

For run training, use the appropriate command based on the model:

# For T2I-Adapter:
bash job_adapter_training.sh

# For ControlNet:
bash job_controlnet_training.sh

Inference

To run inference, use the appropriate command based on the model:

# For T2I-Adapter:
bash job_adapter_inference.sh

# For ControlNet:
bash job_controlnet_inference.sh

Results

Our method democratizes sketch-based image generation by effectively handling a broad spectrum of sketch complexity and user drawing ability—from novice sketches to those made by seasoned artists—while maintaining the natural appearance of the image.

More demos

Comparison With Baseline

The effect of our Knob mechanism

Citation

If you liked our paper, please consider citing it

@misc{navardknobgen,
      title={KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models}, 
      author={Pouyan Navard and Amin Karimi Monsefi and Mengxi Zhou and Wei-Lun Chao and Alper Yilmaz and Rajiv Ramnath},
      year={2024},
      eprint={2410.01595},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2410.01595}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
configs		configs
knobgen		knobgen
misc		misc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
inference_adapter.py		inference_adapter.py
inference_controlnet.py		inference_controlnet.py
job_adapter_inference.sh		job_adapter_inference.sh
job_adapter_training.sh		job_adapter_training.sh
job_controlnet_inference.sh		job_controlnet_inference.sh
job_controlnet_training.sh		job_controlnet_training.sh
train_adapter.py		train_adapter.py
train_controlnet.py		train_controlnet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KnobGen

Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models

Quick Demo

🚀 News

Table of Contents

Installation

Prepare the Dataset

Train

Inference

Results

Citation

About

Releases

Packages

Contributors 4

Languages

License

aminK8/KnobGen

Folders and files

Latest commit

History

Repository files navigation

KnobGen

Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models

Quick Demo

🚀 News

Table of Contents

Installation

Prepare the Dataset

Train

Inference

Results

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages