-
Notifications
You must be signed in to change notification settings - Fork 431
Open
Description
Problem
The GPU Driver Upgrades documentation states:
Upgrade the driver by changing the
driver.versionvalue in the cluster policy
This works on Kubernetes (Helm) but fails on OpenShift (OLM).
Behavior
On OpenShift, change only driver.version:
spec:
driver:
version: "570.172.08"The above configuration results in:
- Invalid driver image path:
/:570.172.08-rhel9.6 - Image pull fails
- Driver pods fail to start
Required workaround:
Provide values for all driver image properties:
spec:
driver:
repository: nvcr.io/nvidia
image: driver
version: "570.172.08"Differences
- Helm: Populates default
repositoryandimagevalues from chart into ClusterPolicy - OLM: ClusterPolicy has no defaults; operator relies on static CSV environment variables
Request
I need your input before suggesting a solution. It looks like either:
- The code must be fixed: Provide default values for OLM deployments to match Helm behavior
- The docs must be fixed: Document that on OpenShift all three fields (
repository,image,version) are required
Environment
- OpenShift with GPU Operator installed via OLM
srikantt
Metadata
Metadata
Assignees
Labels
No labels