Reinstalling nvidia drivers requires a reboot
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Containerd Subordinate Charm |
Fix Released
|
Medium
|
Unassigned |
Bug Description
When the nvidia drivers are reinstalled either through the upgrade-packages action or through the charm, the worker will need to be rebooted in order to use the GPU. We should surface this need to the users by setting a message (and possibly blocked status) in juju status.
This requirement also highlights the issue found in LP#1982894 where changing the sources or the packages will always cause a reinstall. There should be a check added to see if there is a change in what should be installed prior to uninstalling/
Additionally, we probably shouldn't be autoremoving stuff that isn't relevant to what we are trying to do. For example, I have seen a kernel get autoremoved through this functionality. We should attempt to limit the autoremoval to just what our needs are.
description: | updated |
Changed in charm-containerd: | |
status: | New → Confirmed |
importance: | Undecided → Medium |
milestone: | none → 1.26 |
Changed in charm-containerd: | |
status: | Confirmed → Fix Committed |
Changed in charm-containerd: | |
status: | Fix Committed → Fix Released |
Provides a warning on upgrade that a reboot may be needed: /github. com/charmed- kubernetes/ charm-container d/pull/ 80
https:/