Skip to content

Actions: predibase/lorax

docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
327 workflow runs
327 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Set maximum grpc message receive size to 2GiB (#667)
docs #327: Commit a2c1fc1 pushed by noyoshi
November 7, 2024 20:10 26s main
November 7, 2024 20:10 26s
Fix sliding window + compile bug (#666)
docs #326: Commit e03f989 pushed by ajtejankar
November 6, 2024 01:37 24s main
November 6, 2024 01:37 24s
Convert to Triton Punica kernels (#658)
docs #325: Commit 902a68c pushed by tgaddair
November 5, 2024 18:37 23s main
November 5, 2024 18:37 23s
added metrics docs, updated links in main docs (#663)
docs #324: Commit b3944ad pushed by noyoshi
November 1, 2024 22:58 21s main
November 1, 2024 22:58 21s
Fix seqlen bug for sliding window models like Mistral v0.1 (#660)
docs #323: Commit bd92e52 pushed by tgaddair
October 31, 2024 17:11 22s main
October 31, 2024 17:11 22s
Support for Embeddings with XLM-RoBERTa and Adapters (#656)
docs #322: Commit 3f108f7 pushed by tgaddair
October 31, 2024 16:42 26s main
October 31, 2024 16:42 26s
Fix absent fp8_kv property on llama and qwen models (#662)
docs #321: Commit c2441e2 pushed by arnavgarg1
October 30, 2024 18:02 25s main
October 30, 2024 18:02 25s
Support FP8 KV Cache (#652)
docs #320: Commit 2ff1c71 pushed by ajtejankar
October 29, 2024 19:41 24s main
October 29, 2024 19:41 24s
Prompt prefix caching for multi-LoRA (#655)
docs #319: Commit 373c3e6 pushed by tgaddair
October 23, 2024 00:31 23s main
October 23, 2024 00:31 23s
Fix PREDIBASE_API_TOKEN env var being thrown away (#654)
docs #318: Commit 71ca771 pushed by joseph-predibase
October 22, 2024 18:37 24s main
October 22, 2024 18:37 24s
Chunked prefill (#653)
docs #317: Commit 6c5ca67 pushed by tgaddair
October 21, 2024 19:04 28s main
October 21, 2024 19:04 28s
feat: Function calling with output schema enforcement (#536)
docs #316: Commit 418b9fa pushed by tgaddair
October 16, 2024 23:37 22s main
October 16, 2024 23:37 22s
change runner 2 (#650)
docs #315: Commit d9ed1a6 pushed by magdyksaleh
October 16, 2024 21:42 25s main
October 16, 2024 21:42 25s
October 16, 2024 21:34 25s
docs
docs #313: Commit 808127d pushed by magdyksaleh
October 16, 2024 21:10 27s main
October 16, 2024 21:10 27s
Added backwards compatible field to OpenAI json_object API (#648)
docs #312: Commit 974c2b2 pushed by tgaddair
October 16, 2024 19:48 26s main
October 16, 2024 19:48 26s
try using arc runner for build (#646)
docs #311: Commit c8f361e pushed by noyoshi
October 16, 2024 18:26 20s main
October 16, 2024 18:26 20s
Enhance Structured Output Interface (#644)
docs #310: Commit 4fb4d69 pushed by tgaddair
October 16, 2024 17:39 21s main
October 16, 2024 17:39 21s
Fix compile for qwen-2.5-32b (#645)
docs #309: Commit 8ac729b pushed by tgaddair
October 16, 2024 16:55 26s main
October 16, 2024 16:55 26s
Add --disable-sgmv flag (#639)
docs #308: Commit 3818e1a pushed by joseph-predibase
October 16, 2024 00:03 46s main
October 16, 2024 00:03 46s
docs
docs #307: by tgaddair
October 15, 2024 18:01 29s main
October 15, 2024 18:01 29s
Return n choices for chat completions API (#638)
docs #306: Commit bea8834 pushed by tgaddair
October 15, 2024 16:56 22s main
October 15, 2024 16:56 22s
Look for language model lm head (#640)
docs #305: Commit 2a22063 pushed by Infernaught
October 15, 2024 16:56 25s main
October 15, 2024 16:56 25s
pass correct stuff to predibase-reporter (#635)
docs #304: Commit f1ef0ee pushed by magdyksaleh
October 8, 2024 19:09 21s main
October 8, 2024 19:09 21s
Fix cuda graph tracing without lora ranks (#634)
docs #303: Commit 0c1cec2 pushed by tgaddair
October 7, 2024 17:59 1m 35s main
October 7, 2024 17:59 1m 35s