Fix helper fn for new processor config format #42085

zucchini-nlp · 2025-11-07T09:51:53Z

What does this PR do?

As per title, these helpers are used in vLLM and don't work if we save a processor after v5. Now all processor's subcomponents are saved in one single place in processor_config.json, so we need to check it as well when trying to locate the config file

fyi @hmellor

HuggingFaceDocBuilderDev · 2025-11-07T10:00:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

molbap

Thanks, looks fine to me! just a couple questions

molbap · 2025-11-07T13:07:30Z

src/transformers/models/auto/feature_extraction_auto.py

+    if "audio_processor" in feature_extractor_dict:
+        feature_extractor_dict = feature_extractor_dict["audio_processor"]
+    else:
+        feature_extractor_dict = feature_extractor_dict.get("feature_extractor", feature_extractor_dict)


I'm a bit confused by the logic here, can you remind me why we need to get this / why audio_processor is directly accessible?

This is to get the actual config from nested structure if we're loading from processor_config.json. In audio models we have no standardization unfortunately, and some call the attribute as audio_processor or feature_extractor

We need to get any of the two keys if available, otherwise just return

Crystal clear, thanks, actual audio processors will help a lot when they arrive, cc @eustlb for reference

Oh yes, much needed!

molbap · 2025-11-07T13:13:12Z

src/transformers/models/auto/image_processing_auto.py

        )
        return {}

+    resolved_config_file = resolved_config_files[0]


So this means if two are present we don't take the PROCESSOR, why do we iterate on both?

good point, I haven't noticed that we're prioritizing IMAGE_PROCESSOR here. Imo the priority should be as follows if we find more than one file on the hub:

PROCESSOR -> IMAGE_PROCESSOR and for videos PROCESSOR -> VIDEO_PROCESSOR -> IMAGE_PROCESSOR

Do you think this priority makes sense? I will update accordingly

Yeah it respects the encapsulation of concerns for this set of class, IMO ok!

tests/models/auto/test_processor_auto.py

github-actions · 2025-11-07T14:47:31Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

fix the helper fn for new processor config format

17c836f

zucchini-nlp requested review from molbap and yonigozlan November 7, 2025 09:51

molbap approved these changes Nov 7, 2025

View reviewed changes

change the priority order

68bebfb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix helper fn for new processor config format #42085

Fix helper fn for new processor config format #42085

zucchini-nlp commented Nov 7, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 7, 2025

Uh oh!

molbap left a comment

Uh oh!

molbap Nov 7, 2025

Uh oh!

zucchini-nlp Nov 7, 2025

Uh oh!

molbap Nov 7, 2025

Uh oh!

zucchini-nlp Nov 7, 2025

Uh oh!

molbap Nov 7, 2025

Uh oh!

zucchini-nlp Nov 7, 2025

Uh oh!

molbap Nov 7, 2025

Uh oh!

Uh oh!

github-actions bot commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix helper fn for new processor config format #42085

Are you sure you want to change the base?

Fix helper fn for new processor config format #42085

Conversation

zucchini-nlp commented Nov 7, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Nov 7, 2025

Uh oh!

molbap left a comment

Choose a reason for hiding this comment

Uh oh!

molbap Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

molbap Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

molbap Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

molbap Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants