Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DGX-A100 blog post #1

Open
wants to merge 13 commits into
base: master
Choose a base branch
from
Open

DGX-A100 blog post #1

wants to merge 13 commits into from

Conversation

kpouget
Copy link
Collaborator

@kpouget kpouget commented Jun 14, 2022

No description provided.

help of NFD labels, the GPU Operator will automatically detect the DGX
A100 node and install (via containers) the necessary drivers, services
and CUDA libraries to run GPU workload.
Do not forget to create the default `ClusterPolicy` custom resource

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would be good to explain why you need to create the default ClusterPolicy instead of just staying don't forget to do it. Also there is a '.' missing at the end of the sentence.

Copy link
Collaborator Author

@kpouget kpouget Jun 14, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hum, now I'm wondering if it makes sense to keep it ... There's a link to the documentation, maybe it doesn't make sense to stress a particular point

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think if it's a common mistake I would keep it. Either way I think it's good to have one sentence explaining why you are creating the default ClusterPolicy. It makes the blog more complete and understandable.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

changed to:

Do not forget the last step, the creation of the ClusterPolicy custom resource! This object defines the configuration of the GPU stack, and without it, nothing will be deployed on the nodes!

kpouget and others added 9 commits June 14, 2022 18:25
Co-authored-by: David Gray <40244437+dagrayvid@users.noreply.github.com>
Request from NVIDIA Product Management to remove the dash
Good to place the product name in the conclusion.
Fix DGX A100 name.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants