Skip to content

Commit

Permalink
fix: Added GPU node selector
Browse files Browse the repository at this point in the history
This prevents a GPU destined pod ending up on the non-gpu based nodes. This has been seen in the wild even though the resources request has a GPU specified.
  • Loading branch information
drew-viles committed Jan 8, 2024
1 parent ce3b639 commit c72ede4
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion charts/dogkat/templates/gpu-pod.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -14,13 +14,14 @@ metadata:
{{- include "e2e-testing.gpu.labels" . | nindent 4 }}
spec:
restartPolicy: OnFailure
nodeSelector:
nvidia.com/gpu.present: "true"
containers:
- name: nvidia-vectoradd
image: {{.Values.gpu.image.repo}}:{{.Values.gpu.image.tag}}
terminationMessagePath: /dev/termination-log
terminationMessagePolicy: File
imagePullPolicy: IfNotPresent

{{- if .Values.gpu.resources }}
resources:
"nvidia.com/gpu": {{ .Values.gpu.numberOfGPUs }}
Expand Down

0 comments on commit c72ede4

Please sign in to comment.