third party Mellanox OFED 5.8-3.0.7.1 fail on kernel above 5.15.0-82-generic
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Kernel SRU Workflow |
New
|
Undecided
|
Unassigned |
Bug Description
Description:
Nvidia Mellanox MLNX_OFED_
Issue:
beegfs client kernel module refuses to be inserted (insmod, modprobe).
Syslog:
BeeGFS client fails to load and in syslog
Sep 21 12:51:57 n017 kernel: beegfs: disagrees about version of symbol rdma_resolve_addr
Sep 21 12:51:57 n017 kernel: beegfs: Unknown symbol rdma_resolve_addr (err -22)
Sep 21 12:51:57 n017 kernel: beegfs: disagrees about version of symbol rdma_set_
Sep 21 12:51:57 n017 kernel: beegfs: Unknown symbol rdma_set_
Sep 21 12:51:57 n017 kernel: beegfs: disagrees about version of symbol rdma_reject
Sep 21 12:51:57 n017 kernel: beegfs: Unknown symbol rdma_reject (err -22)
Sep 21 12:51:57 n017 kernel: beegfs: disagrees about version of symbol rdma_disconnect
Sep 21 12:51:57 n017 kernel: beegfs: Unknown symbol rdma_disconnect (err -22)
Sep 21 12:51:57 n017 kernel: beegfs: disagrees about version of symbol __rdma_
Sep 21 12:51:57 n017 kernel: beegfs: Unknown symbol __rdma_
Sep 21 12:51:57 n017 kernel: beegfs: disagrees about version of symbol rdma_resolve_route
Sep 21 12:51:57 n017 kernel: beegfs: Unknown symbol rdma_resolve_route (err -22)
Sep 21 12:51:57 n017 kernel: beegfs: disagrees about version of symbol rdma_bind_addr
Sep 21 12:51:57 n017 kernel: beegfs: Unknown symbol rdma_bind_addr (err -22)
Sep 21 12:51:57 n017 kernel: beegfs: disagrees about version of symbol rdma_create_qp
Sep 21 12:51:57 n017 kernel: beegfs: Unknown symbol rdma_create_qp (err -22)
WorkARound:
Kernels up to 5.15.0-82-generic works fine.
Changes:
Looking through diffs of config-
Third party bug report:
Reported as ThinkParQ RT #12388: BeeGFS stopped working after kernel upgrade. Also confirm by ThinkParQ.
https:/
Has anything changed on Infiniband ofed which affects third party MOFED modules?
--Tore