-
Notifications
You must be signed in to change notification settings - Fork 45
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
**Description** Fix some bugs for latest TE and add UT for it. 1. In TE, it only allocates fp8 weight for the first micro batch. In MS-AMP, it allocates zero size tensor for fp8 weight because `tex.fp8_cast_transpose_fused` will allocate memory for it. However, latest TE introduces a data structure `Float8Tensor` which use `_data` to store the original fp8 tensor. When comparing shape in `set_fp8_weights`, we should use the shape of `_data`. Otherwise, TE will allocate zero-size tensor for non-first micro batch. 2. Seem that when using latest TE, Megaton-LM can't converge(Test it with GPT-345M). The newest TE which can converge is v1.1, so convert it back to v1.1
- Loading branch information
Showing
5 changed files
with
8 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters