Skip to content

sdpa: support attn_mask.requires_grad, support expanded number of heads in attn_mask #5327

sdpa: support attn_mask.requires_grad, support expanded number of heads in attn_mask

sdpa: support attn_mask.requires_grad, support expanded number of heads in attn_mask #5327

Annotations

1 warning

pytester (ubuntu-22.04, 3.11, latest)

succeeded Dec 17, 2024 in 24m 31s