Vision docs 📝 #42096

merveenoyan · 2025-11-07T16:04:10Z

This is a big PR to refresh outdated parts of vision docs and add new ones for newer tasks.

Here's some action items:

add SAM2 FT to mask-generation tutorial
add trackio to vision examples (requires extended support from Trainer)
add universal segmentation docs (@ariG23498 is taking it up)
fix dataset in semantic segmentation doc (there was a canonical one which didn't load, fixed it)
replace video classification with VJEPA, change video backend (torchvideo tools no longer work)
(maybe) swap models with newer ones

HuggingFaceDocBuilderDev · 2025-11-07T16:13:56Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

molbap · 2025-11-07T17:16:00Z

Subscribed, thanks for starting this! Feel free to ping me for a review

merveenoyan added 2 commits November 5, 2025 18:44

add mask generation fine-tuning docs

02e0fd8

initial commit

eb8af93

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vision docs 📝 #42096

Vision docs 📝 #42096

merveenoyan commented Nov 7, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 7, 2025

Uh oh!

molbap commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Vision docs 📝 #42096

Are you sure you want to change the base?

Vision docs 📝 #42096

Conversation

merveenoyan commented Nov 7, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 7, 2025

Uh oh!

molbap commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants