Skip to content

[TESTS] Use FP32 inference precision, FP16 KV cache precision for pipelines #22

[TESTS] Use FP32 inference precision, FP16 KV cache precision for pipelines

[TESTS] Use FP32 inference precision, FP16 KV cache precision for pipelines #22

LLM bench tests (3.11)

succeeded Jan 6, 2025 in 34m 59s