Skip to content

Releases: microsoft/aoai-api-simulator

v0.6 - 2024-11-06

06 Nov 13:50
Compare
Choose a tag to compare
  • Add support for translation endpoint i.e. whisper models. (#59 @wtulloch)
    • This also includes internal changes to support non-token based rate limiting
  • Add support for specifying the dimension parameter in in embeddings requests for text-embedding-3 and later models (#55 - @tanya-borisova)
  • Ensure that an API key is always generated if not provided (#56 - @lucashuet93)
  • BREAKING CHANGE Requests for an incompatible model (e.g. chat requests for an embedding model) fail with a 400 error (#58 - @tanya-borisova)
  • Terraform deployment option (#60 @mluker)
  • Support for ARM architecture for local Docker builds (#32 @mluker)
  • Numerous fixes and repo improvements: #24, #26, #38, #41, #42, #43, #45, #51 @martinpeck

v0.5 - 2024-08-27

06 Nov 10:11
Compare
Choose a tag to compare
  • Migrate to current repo from previous repo
    • BREAKING CHANGE:: rename aoai-simulated-api to aoai-api-simulator in code (also for aoai_simulated_api package)
    • BREAKING CHANGE:: update metric prefix from aoai-simulated-api. to aoai-api-simulator.
  • Return to sliding window rate limiting. This change moves from the limits package to a custom rate-limiting implementation to address performance with sliding windows (#51)
  • Update rate-limit handling for tokens based on experimentation (limited set of models currently - see #52)