Skip to content

Releases: aws-samples/foundation-model-benchmarking-tool

Multiple model copies on a single EC2 instance

10 Sep 00:34
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.5...v2.0.6

Intel CPU support with vLLM

27 Aug 16:21
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.4...v2.0.5

AMD CPU support

23 Aug 15:59
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.3...v2.0.4

EFA support for /tmp dir, Mistral on AWS Chips

22 Aug 19:26
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.2...v2.0.3

Llama3.1-8b on SageMaker, DJL serving fixes

18 Aug 15:55
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.0.1...v2.0.2

Model evaluations

14 Aug 14:34
Compare
Choose a tag to compare

What's Changed

Full Changelog: v1.0.52...v2.0.0

Llama3.1 on AWS Chips

08 Aug 15:12
Compare
Choose a tag to compare

What's Changed

Full Changelog: v1.0.51...v1.0.52

FMBench website, Llama3.1

06 Aug 12:33
Compare
Choose a tag to compare

What's Changed

Full Changelog: v1.0.50...v1.0.51

FMBench support for RAG + Model evaluations

This version contains [Work in Progress] code for evaluating both:

  1. Majority Vote: Using a Panel of LLM Evaluators to check for RAG eval on whether a given candidate model output is correct or incorrect.
  2. Average Pooling: Using a Panel of LLM Evaluators to evaluate candidate model responses using 'user-defined' subjective evaluation criteria.

Llama3-8b on EC2 inf2.48xlarge

12 Jul 16:58
Compare
Choose a tag to compare

What's Changed

Full Changelog: v1.0.49...v1.0.50