Skip to content

Commit c6b6ca3

Browse files
committed
Add enum for attention implementations. Fix inconsistency between fuse and unfused TE impls to achieve same performance (removing extra dropout layer in fused layers. Also some minor wording changes
Signed-off-by: tdophung <tdophung@nvidia.com> [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: tdophung <tdophung@nvidia.com>
1 parent 30e51bf commit c6b6ca3

File tree

2 files changed

+164
-172
lines changed

2 files changed

+164
-172
lines changed

0 commit comments

Comments
 (0)