GitHub - DraconicDragon/danbooru-e621-tag-list-processor: Python script that scrapes tags from Danbooru and downloads e621 tags from db_export index and then formats them for use in AI related autocomplete extensions

stuff is out of date but should still work for basic tag list making and everything on my machine is scattered everywhere, update might happen in idk a week to a month and also the github actions workflow doesnt work because uh idk im dumb idk how to git gud

Stuff is still out of date but the wildcards branch is up to date and has all the code i used to make the per gender wildcard + the new category merged csv files, the main script dbr e6 tag processor doesnt have this, the operations i did are split across many files, ill try to clean that up at some point

Create and Format Danbooru and e621 Tags for AI Autocomplete Extensions

This script scrapes tags from Danbooru and downloads e621 tags from the DB export index, formatting them for AI autocomplete extensions. It can generate separate or merged tag lists for Danbooru and e621, with options like alias inclusion, minimum post thresholds, and filtering by alias status.

CSV files can be found here: https://github.com/DraconicDragon/dbr-e621-lists-archive OR on my CivitAI profile here: https://civitai.com/models/950325

Output CSV files have been tested with sd-webui-tagcomplete and pythongosssss/ComfyUI-Custom-Scripts autocomplete feature

The tag lists are automatically updated every 2 months by a GitHub actions workflow and saved in the tag-lists folder so even if I forget to update the tag lists manually, they will always be updated automatically.

Running the Script Yourself

Method 1: Local

Install the dependencies first pip install pandas requests beautifulsoup4 or from the txt file pip install -r requirements.txt
Run the script and answer the questions after which the processing begins.
You can also just spam enter to use the default values which will give you both Danbooru and e621 tag lists including active and deleted aliases and the merged list.

The tag lists should be saved in the same directory as the script when it's done (to check you can just check the terminal output, it should print out the location of the saved files)

Method 2 (Requires GitHub account): GitHub actions

SOON:tm:
Fork this repo
Go to the "Actions" tab of the forked repo
Select the X on the left
Click X on the right and the workflow with optional custom settings
Wait for the workflow to finish and go to X and scroll down, you will be able to download the artifact there which should contain the output CSV file(s)

Danbooru scraping part in the code was copied from here (many thank): https://github.com/BetaDoggo/danbooru-tag-list

im bad at repo titles

TODO

Remove list archive as submodule and just link the repo from this readme, submodule purpose is different than i thought

_{personal note regarding pandas because im too lazy to actually read it: dbr_e6_tag_processor.py:175: FutureWarning: The behavior of DataFrame concatenation with empty or all-NA entries is deprecated. In a future version, this will no longer exclude empty or all-NA columns when determining the result dtypes. To retain the old behavior, exclude the relevant entries before the concat operation.
tag_df = pd.concat([tag_df, pd.DataFrame(data)], ignore_index=True)}

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
.github/workflows		.github/workflows
dbr-e621-lists-archive @ da09510		dbr-e621-lists-archive @ da09510
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
dbr_e6_tag_processor.py		dbr_e6_tag_processor.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Create and Format Danbooru and e621 Tags for AI Autocomplete Extensions

Running the Script Yourself

Method 1: Local

Method 2 (Requires GitHub account): GitHub actions

TODO

About

Releases

Contributors 2

Languages

License

DraconicDragon/danbooru-e621-tag-list-processor

Folders and files

Latest commit

History

Repository files navigation

Create and Format Danbooru and e621 Tags for AI Autocomplete Extensions

Running the Script Yourself

Method 1: Local

Method 2 (Requires GitHub account): GitHub actions

TODO

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Contributors 2

Languages