[WIP] Support `auto_doctring` in Processors #42101

yonigozlan · 2025-11-07T17:31:21Z

What does this PR do?

Add support for Processors in @auto_docstring.

auto_docstring will pull custom args from custom kwargs and add them to the .__doc__. For example, for...

…asses

…rom-processors

… (temporarily)

…rom-processors

…m/yonigozlan/transformers into remove-attributes-from-processors

…rom-processors

* Super * Super * Super * Super --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* detectron2 - part 1 * detectron2 - part 2 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

…ingface#41978) fix autoawq[kernels] Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Reduce scope of cross-generate * Rm generate_sall configs * Workflow benchmarks more * Prevent crash when FA is not installed

* Change ssh runner type * Add wait step to SSH runner workflow * Rename wait step to wait2 in ssh-runner.yml * Remove wait step from ssh-runner.yml Removed the wait step from the SSH runner workflow. * Update runner type for single GPU A10 instance * Update SSH runner version to 1.90.3 * Add sha256sum to ssh-runner workflow * Update runner type and remove unused steps

…face#41931) * fix 3 failed test cases for video_llama_3 model on Intel XPU Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com> * update Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com> * adjust format Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com> * update code Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com> --------- Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>

* adding option for 2.5 * minor - arg in conversion script * getting started on modelling.py * minor - shouldve been using modular * adressing comments + fixing datatype/device _get method * minor * commiting suggestion Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * docs + first test * ruff fix * minor fix * ruff fix * model fix * model fix * fine-grained check, with a hardcoded score from the original Hf implementation. * minor ruff * update tests values with CI hardware * adding 2.5 to conversion script * Apply style fixes --------- Co-authored-by: Sahil Kabir <sahilkabir@Sahils-MacBook-Pro.local> Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

* Update image_processing_owlv2_fast.py fixed padding value * fixed padding value * Change padding constant value from 0.5 to 0.0 * Fixed missed padding value in modular_owlv2.py --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>

…face#42002) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* change import time * style

The correct argument name is pseudoquantization. Since there is no error on passing wrong arguments name (which is arguably an anti-pattern), this is difficult for users to debug.

* fix * delete --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

A comma is missing between two parameters in the signature of compute_loss function.

Changed how benchmark cfgs are chosen

* Fix continuous batching tests * make fixup

* add back * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…gface#41997) * accept kwargs in image proc from_pretrained * only use kwargs that are in cls.valid_kwargs * remove specific logic for _from_auto * add image_seq_length to Images_kwargs for backward compatibility * fix missing image kwargs in pix2struct

…SmolVLM processors (huggingface#41871) * Fix default image_rows and image_cols initialization in Idefics3 and SmolVLM processors * Fix default initialization of image_rows and image_cols in Idefics3 and SmolVLM processors

* Add GLPNImageProcessorFast for torch backend * Address review feedback - Simplified to_dict() method - Keep tensors as torch instead of converting to numpy for heterogeneous shapes - Removed unnecessary shape guards in post_process_depth_estimation - Improved variable names (tgt -> target_size, d -> resized) - Removed unnecessary GLPNImageProcessorKwargs class * Address review feedback - Simplified to_dict() method - Keep tensors as torch instead of converting to numpy for heterogeneous shapes - Removed unnecessary shape guards in post_process_depth_estimation - Improved variable names (tgt -> target_size, d -> resized) - Removed unnecessary GLPNImageProcessorKwargs class * commits after 2nd review * Address all review feedback and add explicit batched test - Simplified to_dict() with descriptive variable names (d->output_dict) - Fixed resize operation: changed from crop to proper resize with interpolation - Added padding for heterogeneous batch shapes in both slow and fast processors - Fused rescale and normalize operations for efficiency - Improved all variable names (tgt->target_size, d->depth_4d->resized) - Added GLPNImageProcessorKwargs class in slow processor and imported in fast - Renamed test_equivalence_slow_fast to test_slow_fast_equivalence - Added explicit test_slow_fast_equivalence_batched test - All 20 tests passing * using padding from utils * simplify glpn image processor fast * fix docstring --------- Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co> Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>

* added fast processor for fuyu (huggingface#36978) * updated docs for fuyu model (huggingface#36978) * updated test_image_processing and image_processing_fuyu_fast * updated fuyu.md and image_processing_fuyu_fast (huggingface#36978) * updated test_image_processing_fuyu (huggingface#36978) * formatted image_processing_fuyu_fast and test_image_processing_fuyu (huggingface#36978) * updated tests and fuyu fast image processing (huggingface#36978) * Merge branch 'fuyu-fast-image-processors' of https://github.com/DeXtAr47-oss/transformers into fuyu-fast-image-processors * fixed format (huggingface#36978) * formatted files (huggingface#36978) * formatted files * revert unnecessary changes * clean up and process by group --------- Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co>

* fix * add comment * better fix * style * Update src/transformers/modeling_utils.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Deprecate Text2Text and related pipelines * Try a restructure * make fixup * logging -> logger

* FP-Quant backwards * fp-quant v0.3.0 docker * availability version bump * fp_quant==0.3.1 * fp_quant v0.3.2

…ng-in-processor

…rom-processors

github-actions · 2025-11-07T18:09:15Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: align, altclip, aria, aya_vision, bark, blip, blip_2, bridgetower, bros, chameleon, chinese_clip, clap, clip, clipseg, clvp, cohere2_vision

…ctring-in-processor

…ng-in-processor

yonigozlan and others added 30 commits October 15, 2025 15:47

remove attributes and add all missing sub processors to their auto cl…

f48a47b

…asses

remove all mentions of .attributes

d5d5c58

cleanup

dd505b5

fix processor tests

6a1448f

fix modular

a292900

remove last attributes

63a255d

fixup

ef73759

Merge remote-tracking branch 'upstream/main' into remove-attributes-f…

b5e8b2e

…rom-processors

fixes after merge

f14ff3c

fix wrong tokenizer in auto florence2

0306430

fix missing audio_processor + nits

01cb815

Override __init__ in NewProcessor and change hf-internal-testing-repo…

49ec906

… (temporarily)

Merge remote-tracking branch 'upstream/main' into remove-attributes-f…

7dd5682

…rom-processors

fix auto tokenizer test

946cc5c

add init to markup_lm

b0cb3e0

update CustomProcessor in custom_processing

3b9e846

remove print

53de7a4

Merge branch 'main' into remove-attributes-from-processors

93d2c4d

Merge remote-tracking branch 'upstream/main' into remove-attributes-f…

feeec28

…rom-processors

nit

4a6b080

Merge branch 'remove-attributes-from-processors' of https://github.co…

02402a0

…m/yonigozlan/transformers into remove-attributes-from-processors

fix test modeling owlv2

757e1f1

fix test_processing_layoutxlm

bf763b2

Fix owlv2, wav2vec2, markuplm, voxtral issues

0799a0a

Merge remote-tracking branch 'upstream/main' into remove-attributes-f…

bf1a4b6

…rom-processors

add support for loading and saving multiple tokenizer natively

e3f130d

remove exclude_attributes from save_pretrained

cc45a7e

Run slow v2 (huggingface#41914)

6b9e7c9

* Super * Super * Super * Super --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fix detectron2 installation in docker files (huggingface#41975)

0ccb0e3

* detectron2 - part 1 * detectron2 - part 2 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fix autoawq[kernels] installation in quantization docker file (hugg…

1eeece5

…ingface#41978) fix autoawq[kernels] Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

remi-or and others added 26 commits November 6, 2025 16:04

More data in benchmarking (huggingface#41848)

55938f4

* Reduce scope of cross-generate * Rm generate_sall configs * Workflow benchmarks more * Prevent crash when FA is not installed

Fix run slow v2: empty report when there is only one model (hugging…

f639ad6

…face#42002) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

[kernels] change import time in KernelConfig (huggingface#42004)

135543a

* change import time * style

DOC Fix typo in argument name: pseudoquant (huggingface#41994)

adf6777

The correct argument name is pseudoquantization. Since there is no error on passing wrong arguments name (which is arguably an anti-pattern), this is difficult for users to debug.

Fix torch+deepspeed docker file (huggingface#41985)

f37903b

* fix * delete --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Correct syntax error in trainer.md (huggingface#42001)

6a5d5ce

A comma is missing between two parameters in the signature of compute_loss function.

Reduce the number of benchmark in the CI (huggingface#42008)

1f8ae37

Changed how benchmark cfgs are chosen

Fix continuous batching tests (huggingface#42012)

9488b26

* Fix continuous batching tests * make fixup

add back logging_dir (huggingface#42013)

0a703ee

* add back * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

[v5] Deprecate Text2Text and related pipelines (huggingface#41996)

a63b6da

* Deprecate Text2Text and related pipelines * Try a restructure * make fixup * logging -> logger

[FPQuant] MXFP8 and MXFP4 backwards support (huggingface#41897)

a3f3937

* FP-Quant backwards * fp-quant v0.3.0 docker * availability version bump * fp_quant==0.3.1 * fp_quant v0.3.2

Merge remote-tracking branch 'upstream/main' into support-auto_doctri…

5b552a9

…ng-in-processor

add working auto_docstring for processors

09d5527

add auto_docstring to processors first part

b542e95

add auto_docstring to processors part 2

552509c

modifs after review

8979645

Merge remote-tracking branch 'upstream/main' into remove-attributes-f…

6cc30f9

…rom-processors

Merge branch 'remove-attributes-from-processors' into support-auto_do…

30f1b92

…ctring-in-processor

yonigozlan force-pushed the support-auto_doctring-in-processor branch from b1300e4 to 30f1b92 Compare November 7, 2025 18:10

Merge remote-tracking branch 'upstream/main' into support-auto_doctri…

bd5aae2

…ng-in-processor

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Support `auto_doctring` in Processors #42101

[WIP] Support `auto_doctring` in Processors #42101

yonigozlan commented Nov 7, 2025

Uh oh!

github-actions bot commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

[WIP] Support auto_doctring in Processors #42101

Are you sure you want to change the base?

[WIP] Support auto_doctring in Processors #42101

Conversation

yonigozlan commented Nov 7, 2025

What does this PR do?

Uh oh!

github-actions bot commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

[WIP] Support `auto_doctring` in Processors #42101

[WIP] Support `auto_doctring` in Processors #42101