Comment 0 for bug 2025614

Revision history for this message
Weichen Wu (weichenwu) wrote :

[Summary]
  UNIT LOAD ACTIVE SUB DESCRIPTION
-------------------------------------------------------------------------------------
● nvidia-fabricmanager.service loaded failed failed NVIDIA fabric manager service

× nvidia-fabricmanager.service - NVIDIA fabric manager service
     Loaded: loaded (/lib/systemd/system/nvidia-fabricmanager.service; enabled; vendor preset: enabled)
     Active: failed (Result: exit-code) since Mon 2023-07-03 03:21:53 EDT; 2h 7min ago
    Process: 2026 ExecStart=/usr/bin/nv-fabricmanager -c /usr/share/nvidia/nvswitch/fabricmanager.cfg (code=exited, status=1/FAILURE)
        CPU: 7ms

[Failure rate]
1/1

[Additional information]
CID: 201711-25989
SKU: DGX-1 Station
system-manufacturer: NVIDIA
system-product-name: DGX Station
bios-version: 0406
CPU: Intel(R) Xeon(R) CPU E5-2698 v4 @ 2.20GHz (40x)
GPU: 07:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
08:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
0e:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
0f:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
nvidia-driver-version: 525.105.17
kernel-version: 5.15.0-1028-nvidia

[Stage]
Issue reported and logs collected at a later stage