nvidia-fabricmanager.service activation failed

Bug #2025614 reported by Weichen Wu
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
fabric-manager-525 (Ubuntu)
New
Undecided
Unassigned

Bug Description

[Summary]
  UNIT LOAD ACTIVE SUB DESCRIPTION
-------------------------------------------------------------------------------------
● nvidia-fabricmanager.service loaded failed failed NVIDIA fabric manager service

× nvidia-fabricmanager.service - NVIDIA fabric manager service
     Loaded: loaded (/lib/systemd/system/nvidia-fabricmanager.service; enabled; vendor preset: enabled)
     Active: failed (Result: exit-code) since Mon 2023-07-03 03:21:53 EDT; 2h 7min ago
    Process: 2026 ExecStart=/usr/bin/nv-fabricmanager -c /usr/share/nvidia/nvswitch/fabricmanager.cfg (code=exited, status=1/FAILURE)
        CPU: 7ms

[Failure rate]
1/1

[Additional information]
CID: 201711-25989
SKU: DGX-1 Station
system-manufacturer: NVIDIA
system-product-name: DGX Station
bios-version: 0406
CPU: Intel(R) Xeon(R) CPU E5-2698 v4 @ 2.20GHz (40x)
GPU: 07:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
08:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
0e:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
0f:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:1db2] (rev a1)
nvidia-driver-version: 525.105.17
kernel-version: 5.15.0-1028-nvidia

[Stage]
Issue reported and logs collected at a later stage

Revision history for this message
Weichen Wu (weichenwu) wrote :

Automatically attached

description: updated
Changed in ubuntu:
status: Confirmed → New
Revision history for this message
Weichen Wu (weichenwu) wrote :

Automatically attached

Revision history for this message
Weichen Wu (weichenwu) wrote :

Automatically attached

Revision history for this message
Weichen Wu (weichenwu) wrote :

Automatically attached

Revision history for this message
Weichen Wu (weichenwu) wrote :

Automatically attached

Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Libera.chat.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/2025614/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
Revision history for this message
Taihsiang Ho (taihsiangho) wrote :

This should be fine if the service is not up because DGX Station does not incorporate NVSwitch technology, which requires fabric-manager to support, IIRC.

I don't have the live systems for post-mortem. Maybe we can check if the service failed to up because it detects no hardware it should support.

dann frazier (dannf)
affects: ubuntu → fabric-manager-525 (Ubuntu)
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.