[Summary] UNIT LOAD ACTIVE SUB DESCRIPTION ------------------------------------------------------------------------------------- ● nvidia-fabricmanager.service loaded failed failed NVIDIA fabric manager service
× nvidia-fabricmanager.service - NVIDIA fabric manager service Loaded: loaded (/lib/systemd/system/nvidia-fabricmanager.service; enabled; vendor preset: enabled) Active: failed (Result: exit-code) since Mon 2023-07-03 03:21:53 EDT; 2h 7min ago Process: 2026 ExecStart=/usr/bin/nv-fabricmanager -c /usr/share/nvidia/nvswitch/fabricmanager.cfg (code=exited, status=1/FAILURE) CPU: 7ms
[Failure rate] 1/1
[Additional information] CID: 201711-25989 SKU: DGX-1 Station system-manufacturer: NVIDIA system-product-name: DGX Station bios-version: 0406 CPU: Intel(R) Xeon(R) CPU E5-2698 v4 @ 2.20GHz (40x) GPU: 07:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1) 08:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1) 0e:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1) 0f:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1) nvidia-driver-version: 525.105.17 kernel-version: 5.15.0-1028-nvidia
[Stage] Issue reported and logs collected at a later stage
[Summary] ------- ------- ------- ------- ------- ------- ------- ------- ------- ------- ------- - fabricmanager. service loaded failed failed NVIDIA fabric manager service
UNIT LOAD ACTIVE SUB DESCRIPTION
-------
● nvidia-
× nvidia- fabricmanager. service - NVIDIA fabric manager service system/ nvidia- fabricmanager. service; enabled; vendor preset: enabled) /usr/bin/ nv-fabricmanage r -c /usr/share/ nvidia/ nvswitch/ fabricmanager. cfg (code=exited, status=1/FAILURE)
Loaded: loaded (/lib/systemd/
Active: failed (Result: exit-code) since Mon 2023-07-03 03:21:53 EDT; 2h 7min ago
Process: 2026 ExecStart=
CPU: 7ms
[Failure rate]
1/1
[Additional information] manufacturer: NVIDIA product- name: DGX Station driver- version: 525.105.17
CID: 201711-25989
SKU: DGX-1 Station
system-
system-
bios-version: 0406
CPU: Intel(R) Xeon(R) CPU E5-2698 v4 @ 2.20GHz (40x)
GPU: 07:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
08:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
0e:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
0f:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
nvidia-
kernel-version: 5.15.0-1028-nvidia
[Stage]
Issue reported and logs collected at a later stage