-
Notifications
You must be signed in to change notification settings - Fork 36
[Fix]correct the error in docs #340
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
| ## Features Overview | ||
|
|
||
| UCM supports two key features: **Prefix Cache** and **GSA Sparsity**. | ||
| UCM supports two key features: **Prefix Cache** and **Sparse-attention**. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
GSA is just one algorithm among sparse attention methods. GSA can only serve as an example of this feature.
| # For GPU devices, use the following command: | ||
| export CUDA_VISIBLE_DEVICES=0 | ||
| # For NPU devices, use the following command: | ||
| export ASCEND_RT_VISIVLE_DEVEICES=0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| ## Prerequisites | ||
| - UCM: Installed with reference to the Installation documentation. | ||
| - Hardware: At least 4 GPUs (At least 2 GPUs for prefiller + 2 for decoder in 2d2p setup) | ||
| - Hardware: At least 4 GPUs (At least 2 GPUs for prefiller + 2 for decoder in 2d2p setup or 2 NPUs for prefiller + 2 for decoder in 2d2p setup) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It does not match for 'At least 4 GPUs' and 'or 2 NPUs for prefiller + 2 for decoder in 2d2p setup'


Purpose
Corrtct some errors in docs
Modifications
Corrtct some errors in docs
Test
Corrtct some errors in docs