Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the different version of models and datasets. #15

Open
syguan96 opened this issue Aug 8, 2024 · 1 comment
Open

About the different version of models and datasets. #15

syguan96 opened this issue Aug 8, 2024 · 1 comment

Comments

@syguan96
Copy link

syguan96 commented Aug 8, 2024

Hi @HaozheZhao, this is a great work. I tried to filter out some categories to train Instructpix2pix.

I noticed that you have released "UltraEdit_500k", "UltraEdit_Segion-Based_100k", and the complete dataset. Can you tell me how to divide these subsets? If possible, could you tell me the difference between "BleachNick/SD3-UltraEdit_freeform", "BleachNick · SD3-UltraEdit w_mask", and "BleachNick/SD3-Ult Edit_mask"?

Thanks for your help!

@HaozheZhao
Copy link
Owner

Hi

Thank you for your kind words about the project!

Here's a breakdown of the datasets and differences between them:

  1. Complete Dataset: This includes 4 million freeform image editing entries generated by our pipeline. It is part of the broader UltraEdit initiative.

  2. UltraEdit_Region-Based_100k: This subset supports region-based image editing and includes a mask image for each editing pair. It's designed for tasks where specific regions of an image are targeted for editing.

  3. UltraEdit_500k: This is a sampled subset of the complete dataset, containing 500k entries of freeform image editing data. We created this subset to maintain a comparable size with other similar datasets and to facilitate evaluation and ease of use.

Regarding your questions about the specific models:

  • SD3-UltraEdit_freeform: This model is trained exclusively with the Freeform image editing dataset, which contains 4 million entries.

  • SD3-UltraEdit w_mask: This model is trained using both the freeform (4M) and region-based (100K) image editing data. It supports both freeform and region-based image editing.

  • SD3-Ult Edit_mask: This appears to be an accidentally uploaded empty folder. We will remove it shortly.

Please feel free to reach out if you have any further questions!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants