unify xpu and cpu backend and use paged attention #5273
Triggered via pull request
December 2, 2024 02:11
Status
Cancelled
Total duration
1m 59s
Artifacts
–
Annotations
4 errors
build (2.5.*)
Canceling since a higher priority waiting request for 'INC - Test-paged_attn' exists
|
build (2.5.*)
The operation was canceled.
|
build (2.4.0)
Canceling since a higher priority waiting request for 'INC - Test-paged_attn' exists
|
build (2.4.0)
The operation was canceled.
|