build

Implementation of tiled attention with bf16 and circular buffers which reduces memory requirements by 4x on longer context on gemma models. #6481

Job	Run time
bazel	7m 59s
ubuntu-latest (make) Release	49s
macos-latest (make) Release	2m 6s
windows-latest (windows) Release	8m 36s
	19m 30s

Provide feedback