Handle hf_inference in single file #2766

Wauplin · 2025-01-20T17:30:07Z

cc @hanouticelina this is a suggestion for PR #2757

Main changes are:

IMO provider_helper.prepare_payload should not have a expect_binary parameter => since the provider_helper depends on the task, we already know if the input must be a binary or not
I've made model: Optional[str] required in provider_helper.prepare_payload for all providers and all tasks. At the moment the model is only passed if we know the provider will use it. But in practice the InferenceClient and providers implementation should be independent. Hence passing it all the time and letting the provider decide what to do with the info.
(the biggest change) use classes (yeah, I know... 🤦‍♂️) for hf-inference provider. This reduces by a lot the complexity / duplicated code. I'm defining HfInferenceTask for generic tasks, HFInferenceBinaryInputTask for tasks which require a binary input (so no need for expect_binary: bool anymore) and HFInferenceConversational for specific task.

Let me know what you think. We can either discuss some changes here or merge it and continue on your PR.

hanouticelina

make sense, thanks a lot! let's merge it

* Add first version of third-party providers support * add task level in model id mappings * raise error when task is not supported by a provider + some improvements * small (big) refactoring * multiple fixes * add hf inference tasks * Handle hf_inference in single file (#2766) * harmonize prepare_payload args and add automatic-speech-recognition task * backward compatibility with custom urls * first draft of tests * InferenceClient as fixture + skip if no api_key * give name to parametrized tests * upload cassettes * make quali * download sample files from prod * fix python3.8 * small improvement for better readability Co-authored-by: Lucain <lucain@huggingface.co> * make style * fixing more tests * test url building * fix and record async client tests * re-add cassettes * fix * add cassettes back * fix test * hopefully this will fix the test * fix sentence similarity test --------- Co-authored-by: Lucain <lucain@huggingface.co> Co-authored-by: Lucain Pouget <lucainp@gmail.com>

Handle hf_inference in single file

9cc7449

Wauplin requested a review from hanouticelina January 20, 2025 17:30

hanouticelina approved these changes Jan 20, 2025

View reviewed changes

hanouticelina merged commit 20e8d3a into inference-providers-compatibility Jan 20, 2025
11 of 15 checks passed

hanouticelina deleted the inference-providers-compatibility-suggestion branch January 20, 2025 21:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle hf_inference in single file #2766

Handle hf_inference in single file #2766

Uh oh!

Wauplin commented Jan 20, 2025

Uh oh!

hanouticelina left a comment

Uh oh!

Uh oh!

Uh oh!

Handle hf_inference in single file #2766

Handle hf_inference in single file #2766

Uh oh!

Conversation

Wauplin commented Jan 20, 2025

Uh oh!

hanouticelina left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!