New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add multi api inference engine #1343

Merged

elronbandel merged 33 commits into main from multi-api-engine

Nov 19, 2024

Member

elronbandel commented Nov 12, 2024 •

edited

Loading

With multi api inference engine you can feed into your application:

     MyClass(
           model=CrossProviderInferenceEngine("llama-3.1-405b", api="aws")
      )

Then change it by editing the model.api argument or by general settings:
You can set unitxt.settings.default_provider="watsonx"
or export UNITXT_DEFAULT_PROVIDER="watsonx"

elronbandel added 4 commits

November 12, 2024 17:10


          Add multi api inference engine

de868ab

Signed-off-by: elronbandel <elronbandel@gmail.com>

Fix

c40d87d

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Set to greedy decoding

d53eb69

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Merge branch 'main' into multi-api-engine

e4a0799

yoavkatz reviewed

View reviewed changes

examples/evaluate_benchmark_with_custom_api.py Outdated Show resolved Hide resolved

yoavkatz reviewed

View reviewed changes

examples/evaluate_benchmark_with_custom_api.py Outdated Show resolved Hide resolved

yoavkatz reviewed

View reviewed changes

src/unitxt/catalog/engines/model/llama_3_8b_instruct.json Outdated

		@@ -0,0 +1,12 @@
		{
		"__type__": "multi_api_inference_engine",

Member

yoavkatz Nov 17, 2024

Not sure if adding to the catalog is needed. The API mapping can be done in a single place, and not repeated per model.

elronbandel added 4 commits

November 17, 2024 20:30


          Merge branch 'main' into multi-api-engine

e41a0fa


          Some fixes

b36b7ab

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Fix consistency and preparation

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Update

28bafa2

Signed-off-by: elronbandel <elronbandel@gmail.com>

yoavkatz reviewed

View reviewed changes

src/unitxt/catalog/system_prompts/general/be_concise.json Outdated Show resolved Hide resolved

elronbandel added 8 commits

November 18, 2024 11:32


          Merge branch 'main' into multi-api-engine

ccc72ae


          Merge branch 'main' into multi-api-engine

06991ca


          Fix test

3c861fb

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Merge branch 'multi-api-engine' of https://github.com/IBM/unitxt into…

086aae8

… multi-api-engine


          Make all args None

f9cd539

Signed-off-by: elronbandel <elronbandel@gmail.com>

Try

4165c78

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Fix grammar

f202c3a

Signed-off-by: elronbandel <elronbandel@gmail.com>

Fix

bd8e176

Signed-off-by: elronbandel <elronbandel@gmail.com>

elronbandel requested a review from yoavkatz

November 18, 2024 13:45

elronbandel and others added 6 commits

November 18, 2024 15:54


          Change api to provider

b686f95

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Merge branch 'main' into multi-api-engine

b4dfe3b


          Added support for param renaming.

4c91d5e

Added BAM and improved error messages.

Signed-off-by: Yoav Katz <katz@il.ibm.com>


          Fix merge issues

eaead52

Signed-off-by: Yoav Katz <katz@il.ibm.com>


          Updated to CrossProviderModel

4c5ba45

Signed-off-by: Yoav Katz <katz@il.ibm.com>


          Update name back to InferenceEngine terminology

00dbd30

Signed-off-by: elronbandel <elronbandel@gmail.com>

yoavkatz approved these changes

View reviewed changes

elronbandel added 2 commits

November 19, 2024 10:48


          Align all examples with chat api and cross provider engines

a0373f8

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Add vllm inference engine

4fa6f8e

Signed-off-by: elronbandel <elronbandel@gmail.com>

elronbandel and others added 7 commits

November 19, 2024 11:00


          Fix blue bench to use cross provider engine

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Merge branch 'main' into multi-api-engine

986d268


          Added watsonx-sdk to MultiProviderInferenceEngine

728fcc3

Add example to evaluate same datasets  and models with multiple providers and formats

Signed-off-by: Yoav Katz <katz@il.ibm.com>


          Make hf tests deterministic

9414f54

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Fix llmaj with chat api

69388b5

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Add inference documentation

e921b01

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Fix examples

9946ab6

Signed-off-by: elronbandel <elronbandel@gmail.com>

yoavkatz reviewed

View reviewed changes

docs/docs/inference.rst Outdated Show resolved Hide resolved

yoavkatz reviewed

View reviewed changes

docs/docs/inference.rst Show resolved Hide resolved

elronbandel and others added 2 commits

November 19, 2024 17:18


          Fix examples

ececf85

Signed-off-by: elronbandel <elronbandel@gmail.com>


          Update docs/docs/inference.rst

71365e7

Co-authored-by: Yoav Katz <68273864+yoavkatz@users.noreply.github.com>

yoavkatz approved these changes

View reviewed changes

elronbandel merged commit ccc5338 into main

18 checks passed

elronbandel deleted the multi-api-engine branch

November 19, 2024 15:37

ShirApp pushed a commit that referenced this pull request


          Add multi api inference engine (#1343)

* Add multi api inference engine

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Set to greedy decoding

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Some fixes

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix consistency and preparation

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Update

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix test

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Make all args None

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Try

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix grammar

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Change api to provider

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Added support for param renaming.

Added BAM and improved error messages.

Signed-off-by: Yoav Katz <katz@il.ibm.com>

* Fix merge issues

Signed-off-by: Yoav Katz <katz@il.ibm.com>

* Updated to CrossProviderModel

Signed-off-by: Yoav Katz <katz@il.ibm.com>

* Update name back to InferenceEngine terminology

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Align all examples with chat api and cross provider engines

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Add vllm inference engine

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix blue bench to use cross provider engine

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Added watsonx-sdk to MultiProviderInferenceEngine

Add example to evaluate same datasets  and models with multiple providers and formats

Signed-off-by: Yoav Katz <katz@il.ibm.com>

* Make hf tests deterministic

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix llmaj with chat api

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Add inference documentation

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix examples

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix examples

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Update docs/docs/inference.rst

Co-authored-by: Yoav Katz <68273864+yoavkatz@users.noreply.github.com>

---------

Signed-off-by: elronbandel <elronbandel@gmail.com>
Signed-off-by: Yoav Katz <katz@il.ibm.com>
Co-authored-by: Yoav Katz <katz@il.ibm.com>
Co-authored-by: Yoav Katz <68273864+yoavkatz@users.noreply.github.com>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet