Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add multi api inference engine #1343

Open
wants to merge 27 commits into
base: main
Choose a base branch
from
Open

Add multi api inference engine #1343

wants to merge 27 commits into from

Commits on Nov 12, 2024

  1. Add multi api inference engine

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    de868ab View commit details
    Browse the repository at this point in the history
  2. Fix

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    c40d87d View commit details
    Browse the repository at this point in the history
  3. Set to greedy decoding

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 12, 2024
    Configuration menu
    Copy the full SHA
    d53eb69 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    e4a0799 View commit details
    Browse the repository at this point in the history

Commits on Nov 17, 2024

  1. Configuration menu
    Copy the full SHA
    e41a0fa View commit details
    Browse the repository at this point in the history
  2. Some fixes

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 17, 2024
    Configuration menu
    Copy the full SHA
    b36b7ab View commit details
    Browse the repository at this point in the history

Commits on Nov 18, 2024

  1. Fix consistency and preparation

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    0593788 View commit details
    Browse the repository at this point in the history
  2. Update

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    28bafa2 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    ccc72ae View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    06991ca View commit details
    Browse the repository at this point in the history
  5. Fix test

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    3c861fb View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    086aae8 View commit details
    Browse the repository at this point in the history
  7. Make all args None

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    f9cd539 View commit details
    Browse the repository at this point in the history
  8. Try

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    4165c78 View commit details
    Browse the repository at this point in the history
  9. Fix grammar

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    f202c3a View commit details
    Browse the repository at this point in the history
  10. Fix

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    bd8e176 View commit details
    Browse the repository at this point in the history
  11. Change api to provider

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    b686f95 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    b4dfe3b View commit details
    Browse the repository at this point in the history
  13. Added support for param renaming.

    Added BAM and improved error messages.
    
    Signed-off-by: Yoav Katz <katz@il.ibm.com>
    yoavkatz committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    4c91d5e View commit details
    Browse the repository at this point in the history
  14. Fix merge issues

    Signed-off-by: Yoav Katz <katz@il.ibm.com>
    yoavkatz committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    eaead52 View commit details
    Browse the repository at this point in the history
  15. Updated to CrossProviderModel

    Signed-off-by: Yoav Katz <katz@il.ibm.com>
    yoavkatz committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    4c5ba45 View commit details
    Browse the repository at this point in the history
  16. Update name back to InferenceEngine terminology

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 18, 2024
    Configuration menu
    Copy the full SHA
    00dbd30 View commit details
    Browse the repository at this point in the history

Commits on Nov 19, 2024

  1. Align all examples with chat api and cross provider engines

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 19, 2024
    Configuration menu
    Copy the full SHA
    a0373f8 View commit details
    Browse the repository at this point in the history
  2. Add vllm inference engine

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 19, 2024
    Configuration menu
    Copy the full SHA
    4fa6f8e View commit details
    Browse the repository at this point in the history
  3. Fix blue bench to use cross provider engine

    Signed-off-by: elronbandel <elronbandel@gmail.com>
    elronbandel committed Nov 19, 2024
    Configuration menu
    Copy the full SHA
    8115091 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    986d268 View commit details
    Browse the repository at this point in the history
  5. Added watsonx-sdk to MultiProviderInferenceEngine

    Add example to evaluate same datasets  and models with multiple providers and formats
    
    Signed-off-by: Yoav Katz <katz@il.ibm.com>
    yoavkatz committed Nov 19, 2024
    Configuration menu
    Copy the full SHA
    728fcc3 View commit details
    Browse the repository at this point in the history