Activity log for bug #2029417

Date Who What changed Old value New value Message
2023-08-02 20:47:39 Joao Andre Simioni bug added bug
2023-08-02 20:49:08 Alan Baghumian bug added subscriber Alan Baghumian
2023-08-02 22:00:57 Mauricio Faria de Oliveira bug added subscriber Mauricio Faria de Oliveira
2023-08-03 06:33:32 Jerzy Husakowski tags bug-council
2023-08-03 17:17:12 Adam Collard summary After applying fix on LP#2027735 RPC Communitation is failing RPC failure to contact rack/region - operations on closed handler
2023-08-03 17:17:23 Adam Collard description [Problem Description] After applying the fixes proposed in LP#2027735 to MAAS 3.2.8 (taken from ppa:r00ta/maas-2027735), MAAS started to behave well, with the expected improved performance. But after around ~24 hours, provisioning of nodes started to fail, and the following traces were seen in: rackd.log: ---------- 2023-07-31 23:16:36 provisioningserver.rpc.clusterservice: [critical] Failed to contact region. (While requesting RPC info at http://10.217.0.11:5240/MAAS/, http://10.217.0.66:5240/MAAS/). Traceback (most recent call last): File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 460, in callback self._startRunCallbacks(result) File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 568, in _startRunCallbacks self._runCallbacks() File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks current.result = callback(current.result, *args, **kw) File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1475, in gotResult _inlineCallbacks(r, g, status) --- <exception caught here> --- File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1292, in _doUpdate eventloops, maas_url = yield self._get_rpc_info(urls) File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1549, in _get_rpc_info raise config_exc File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1520, in _get_rpc_info eventloops, maas_url = yield self._parallel_fetch_rpc_info(urls) File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks current.result = callback(current.result, *args, **kw) File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1494, in handle_responses errors[0].raiseException() File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 467, in raiseException raise self.value.with_traceback(self.tb) File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1455, in _serial_fetch_rpc_info raise last_exc File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1447, in _serial_fetch_rpc_info response = yield self._fetch_rpc_info(url, orig_url) File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1416, in _inlineCallbacks result = result.throwExceptionIntoGenerator(g) File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 491, in throwExceptionIntoGenerator return g.throw(self.type, self.value, self.tb) File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1549, in _get_rpc_info raise config_exc File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1520, in _get_rpc_info eventloops, maas_url = yield self._parallel_fetch_rpc_info(urls) File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks current.result = callback(current.result, *args, **kw) File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1494, in handle_responses errors[0].raiseException() File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 467, in raiseException raise self.value.with_traceback(self.tb) File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1416, in _inlineCallbacks result = result.throwExceptionIntoGenerator(g) File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 491, in throwExceptionIntoGenerator return g.throw(self.type, self.value, self.tb) File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1455, in _serial_fetch_rpc_info raise last_exc File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1447, in _serial_fetch_rpc_info response = yield self._fetch_rpc_info(url, orig_url) twisted.internet.error.ConnectingCancelledError: HostnameAddress(hostname=b'10.217.0.11', port=5240) 2023-07-31 23:16:36 provisioningserver.rpc.common: [debug] [RPC -> sent] AmpBox({b'_command': b'Ping'}) regiond.log: ------------ 2023-07-31 23:17:23 maasserver.dhcp: [critical] Error configuring DHCPv6 on rack controller 'pdx01-m01-c34-cpu-01 (xfhrbn)': unable to perform operation on <UVPoll closed=True 0x7f33f5cf0660>; the handler is closed Traceback (most recent call last): File "/usr/lib/python3/dist-packages/provisioningserver/prometheus/utils.py", line 127, in wrapper result = func(*args, **kwargs) File "/usr/lib/python3/dist-packages/provisioningserver/utils/twisted.py", line 127, in wrapper return func(*args, **kwargs) File "/usr/lib/python3/dist-packages/provisioningserver/rpc/common.py", line 176, in __call__ return deferWithTimeout( File "/usr/lib/python3/dist-packages/provisioningserver/utils/twisted.py", line 325, in deferWithTimeout d = maybeDeferred(func, *args, **kwargs) --- <exception caught here> --- File "/usr/lib/python3/dist-packages/maasserver/dhcp.py", line 898, in configure_dhcp yield client( File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 151, in maybeDeferred result = f(*args, **kw) File "/usr/lib/python3/dist-packages/twisted/protocols/amp.py", line 971, in callRemote return co._doCommand(self) File "/usr/lib/python3/dist-packages/twisted/protocols/amp.py", line 2000, in _doCommand d = proto._sendBoxCommand(self.commandName, File "/usr/lib/python3/dist-packages/provisioningserver/rpc/common.py", line 261, in _sendBoxCommand return super()._sendBoxCommand( File "/usr/lib/python3/dist-packages/twisted/protocols/amp.py", line 902, in _sendBoxCommand box._sendTo(self.boxSender) File "/usr/lib/python3/dist-packages/twisted/protocols/amp.py", line 723, in _sendTo proto.sendBox(self) File "/usr/lib/python3/dist-packages/twisted/protocols/amp.py", line 2386, in sendBox self.transport.write(box.serialize()) File "/usr/lib/python3/dist-packages/twisted/internet/_newtls.py", line 191, in write FileDescriptor.write(self, bytes) File "/usr/lib/python3/dist-packages/twisted/internet/abstract.py", line 356, in write self.startWriting() File "/usr/lib/python3/dist-packages/twisted/internet/abstract.py", line 443, in startWriting self.reactor.addWriter(self) File "/usr/lib/python3/dist-packages/twisted/internet/asyncioreactor.py", line 173, in addWriter self._asyncioEventloop.add_writer(fd, callWithLogger, writer, File "uvloop/loop.pyx", line 2399, in uvloop.loop.Loop.add_writer File "uvloop/loop.pyx", line 808, in uvloop.loop.Loop._add_writer File "uvloop/handles/poll.pyx", line 122, in uvloop.loop.UVPoll.start_writing File "uvloop/handles/poll.pyx", line 39, in uvloop.loop.UVPoll._poll_start File "uvloop/handles/handle.pyx", line 159, in uvloop.loop.UVHandle._ensure_alive builtins.RuntimeError: unable to perform operation on <UVPoll closed=True 0x7f33f5cf0660>; the handler is closed 2023-07-31 23:17:23 maasserver.rack_controller: [critical] Failed configuring DHCP on rack controller 'id:12'. Traceback (most recent call last): File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1475, in gotResult _inlineCallbacks(r, g, status) File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1464, in _inlineCallbacks status.deferred.errback() File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 501, in errback self._startRunCallbacks(fail) File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 568, in _startRunCallbacks self._runCallbacks() --- <exception caught here> --- File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks current.result = callback(current.result, *args, **kw) File "/usr/lib/python3/dist-packages/maasserver/rack_controller.py", line 281, in <lambda> d.addErrback(lambda f: f.trap(NoConnectionsAvailable)) File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 439, in trap self.raiseException() File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 467, in raiseException raise self.value.with_traceback(self.tb) File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1418, in _inlineCallbacks result = g.send(result) File "/usr/lib/python3/dist-packages/maasserver/dhcp.py", line 951, in configure_dhcp raise ipv4_exc File "/usr/lib/python3/dist-packages/maasserver/dhcp.py", line 869, in configure_dhcp yield client( builtins.RuntimeError: unable to perform operation on <UVPoll closed=True 0x7f33f5cf0660>; the handler is closed Ubuntu version: 20.04 MAAS: 3.2.99 (Interim version from PPA) Format: Debian PostgreSQL 12 [Problem Description] After applying the fixes proposed in LP:2027735 to MAAS 3.2.8 (taken from ppa:r00ta/maas-2027735), MAAS started to behave well, with the expected improved performance. But after around ~24 hours, provisioning of nodes started to fail, and the following traces were seen in: rackd.log: ---------- 2023-07-31 23:16:36 provisioningserver.rpc.clusterservice: [critical] Failed to contact region. (While requesting RPC info at http://10.217.0.11:5240/MAAS/, http://10.217.0.66:5240/MAAS/).  Traceback (most recent call last):    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 460, in callback      self._startRunCallbacks(result)    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 568, in _startRunCallbacks      self._runCallbacks()    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks      current.result = callback(current.result, *args, **kw)    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1475, in gotResult      _inlineCallbacks(r, g, status)  --- <exception caught here> ---    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1292, in _doUpdate      eventloops, maas_url = yield self._get_rpc_info(urls)    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1549, in _get_rpc_info      raise config_exc    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1520, in _get_rpc_info      eventloops, maas_url = yield self._parallel_fetch_rpc_info(urls)    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks      current.result = callback(current.result, *args, **kw)    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1494, in handle_responses      errors[0].raiseException()    File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 467, in raiseException      raise self.value.with_traceback(self.tb)    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1455, in _serial_fetch_rpc_info      raise last_exc    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1447, in _serial_fetch_rpc_info      response = yield self._fetch_rpc_info(url, orig_url)    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1416, in _inlineCallbacks      result = result.throwExceptionIntoGenerator(g)    File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 491, in throwExceptionIntoGenerator      return g.throw(self.type, self.value, self.tb)    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1549, in _get_rpc_info      raise config_exc    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1520, in _get_rpc_info      eventloops, maas_url = yield self._parallel_fetch_rpc_info(urls)    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks      current.result = callback(current.result, *args, **kw)    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1494, in handle_responses      errors[0].raiseException()    File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 467, in raiseException      raise self.value.with_traceback(self.tb)    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1416, in _inlineCallbacks      result = result.throwExceptionIntoGenerator(g)    File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 491, in throwExceptionIntoGenerator      return g.throw(self.type, self.value, self.tb)    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1455, in _serial_fetch_rpc_info      raise last_exc    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/clusterservice.py", line 1447, in _serial_fetch_rpc_info      response = yield self._fetch_rpc_info(url, orig_url)  twisted.internet.error.ConnectingCancelledError: HostnameAddress(hostname=b'10.217.0.11', port=5240) 2023-07-31 23:16:36 provisioningserver.rpc.common: [debug] [RPC -> sent] AmpBox({b'_command': b'Ping'}) regiond.log: ------------ 2023-07-31 23:17:23 maasserver.dhcp: [critical] Error configuring DHCPv6 on rack controller 'pdx01-m01-c34-cpu-01 (xfhrbn)': unable to perform operation on <UVPoll closed=True 0x7f33f5cf0660>; the handler is closed  Traceback (most recent call last):    File "/usr/lib/python3/dist-packages/provisioningserver/prometheus/utils.py", line 127, in wrapper      result = func(*args, **kwargs)    File "/usr/lib/python3/dist-packages/provisioningserver/utils/twisted.py", line 127, in wrapper      return func(*args, **kwargs)    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/common.py", line 176, in __call__      return deferWithTimeout(    File "/usr/lib/python3/dist-packages/provisioningserver/utils/twisted.py", line 325, in deferWithTimeout      d = maybeDeferred(func, *args, **kwargs)  --- <exception caught here> ---    File "/usr/lib/python3/dist-packages/maasserver/dhcp.py", line 898, in configure_dhcp      yield client(    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 151, in maybeDeferred      result = f(*args, **kw)    File "/usr/lib/python3/dist-packages/twisted/protocols/amp.py", line 971, in callRemote      return co._doCommand(self)    File "/usr/lib/python3/dist-packages/twisted/protocols/amp.py", line 2000, in _doCommand      d = proto._sendBoxCommand(self.commandName,    File "/usr/lib/python3/dist-packages/provisioningserver/rpc/common.py", line 261, in _sendBoxCommand      return super()._sendBoxCommand(    File "/usr/lib/python3/dist-packages/twisted/protocols/amp.py", line 902, in _sendBoxCommand      box._sendTo(self.boxSender)    File "/usr/lib/python3/dist-packages/twisted/protocols/amp.py", line 723, in _sendTo      proto.sendBox(self)    File "/usr/lib/python3/dist-packages/twisted/protocols/amp.py", line 2386, in sendBox      self.transport.write(box.serialize())    File "/usr/lib/python3/dist-packages/twisted/internet/_newtls.py", line 191, in write      FileDescriptor.write(self, bytes)    File "/usr/lib/python3/dist-packages/twisted/internet/abstract.py", line 356, in write      self.startWriting()    File "/usr/lib/python3/dist-packages/twisted/internet/abstract.py", line 443, in startWriting      self.reactor.addWriter(self)    File "/usr/lib/python3/dist-packages/twisted/internet/asyncioreactor.py", line 173, in addWriter      self._asyncioEventloop.add_writer(fd, callWithLogger, writer,    File "uvloop/loop.pyx", line 2399, in uvloop.loop.Loop.add_writer    File "uvloop/loop.pyx", line 808, in uvloop.loop.Loop._add_writer    File "uvloop/handles/poll.pyx", line 122, in uvloop.loop.UVPoll.start_writing    File "uvloop/handles/poll.pyx", line 39, in uvloop.loop.UVPoll._poll_start    File "uvloop/handles/handle.pyx", line 159, in uvloop.loop.UVHandle._ensure_alive  builtins.RuntimeError: unable to perform operation on <UVPoll closed=True 0x7f33f5cf0660>; the handler is closed 2023-07-31 23:17:23 maasserver.rack_controller: [critical] Failed configuring DHCP on rack controller 'id:12'.  Traceback (most recent call last):    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1475, in gotResult      _inlineCallbacks(r, g, status)    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1464, in _inlineCallbacks      status.deferred.errback()    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 501, in errback      self._startRunCallbacks(fail)    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 568, in _startRunCallbacks      self._runCallbacks()  --- <exception caught here> ---    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks      current.result = callback(current.result, *args, **kw)    File "/usr/lib/python3/dist-packages/maasserver/rack_controller.py", line 281, in <lambda>      d.addErrback(lambda f: f.trap(NoConnectionsAvailable))    File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 439, in trap      self.raiseException()    File "/usr/lib/python3/dist-packages/twisted/python/failure.py", line 467, in raiseException      raise self.value.with_traceback(self.tb)    File "/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1418, in _inlineCallbacks      result = g.send(result)    File "/usr/lib/python3/dist-packages/maasserver/dhcp.py", line 951, in configure_dhcp      raise ipv4_exc    File "/usr/lib/python3/dist-packages/maasserver/dhcp.py", line 869, in configure_dhcp      yield client(  builtins.RuntimeError: unable to perform operation on <UVPoll closed=True 0x7f33f5cf0660>; the handler is closed Ubuntu version: 20.04 MAAS: 3.2.99 (Interim version from PPA) Format: Debian PostgreSQL 12
2023-08-08 09:13:52 Björn Tillenius maas: assignee Jacopo Rota (r00ta)
2023-08-08 09:13:55 Björn Tillenius maas: status New In Progress
2023-08-09 14:07:18 Jacopo Rota bug watch added https://github.com/MagicStack/uvloop/issues/552
2023-08-14 16:28:20 Björn Tillenius maas: status In Progress Incomplete
2023-08-14 18:42:34 Sai Vemulapalli bug added subscriber Sai Vemulapalli
2023-08-14 21:59:01 Jacopo Rota attachment added connectionsfull.png https://bugs.launchpad.net/maas/+bug/2029417/+attachment/5692254/+files/connectionsfull.png
2023-08-15 09:23:23 Adam Collard merge proposal linked https://code.launchpad.net/~igor-brovtsin/maas/+git/maas/+merge/449183
2023-08-17 15:37:44 Jack Lloyd-Walters maas: status Incomplete Confirmed
2023-08-17 15:43:54 Jack Lloyd-Walters merge proposal linked https://code.launchpad.net/~lloydwaltersj/maas/+git/maas/+merge/449366
2023-08-17 15:52:18 Jack Lloyd-Walters maas: status Confirmed In Progress
2023-08-24 08:03:19 Adam Collard nominated for series maas/3.3
2023-08-24 08:03:19 Adam Collard bug task added maas/3.3
2023-08-24 08:03:19 Adam Collard nominated for series maas/3.4
2023-08-24 08:03:19 Adam Collard bug task added maas/3.4
2023-08-24 08:03:19 Adam Collard nominated for series maas/3.2
2023-08-24 08:03:19 Adam Collard bug task added maas/3.2
2023-08-24 09:14:35 Jacopo Rota maas/3.2: assignee Jacopo Rota (r00ta)
2023-08-24 09:14:38 Jacopo Rota maas/3.3: assignee Jacopo Rota (r00ta)
2023-08-24 09:14:40 Jacopo Rota maas/3.4: assignee Jacopo Rota (r00ta)
2023-08-24 09:15:34 Jacopo Rota maas: milestone 3.5.0
2023-08-24 09:15:40 Jacopo Rota maas: status In Progress Fix Committed
2023-08-24 09:17:37 Jacopo Rota maas: status Fix Committed In Progress
2023-08-24 09:17:41 Jacopo Rota maas/3.2: milestone 3.2.x
2023-08-24 09:17:45 Jacopo Rota maas/3.3: milestone 3.3.x
2023-08-24 09:17:53 Jacopo Rota maas/3.4: milestone 3.4.x
2023-08-24 09:30:54 Jacopo Rota maas/3.2: status New Fix Committed
2023-08-24 09:30:59 Jacopo Rota maas: importance Undecided High
2023-08-24 09:31:01 Jacopo Rota maas/3.2: importance Undecided High
2023-08-24 09:31:04 Jacopo Rota maas/3.3: importance Undecided High
2023-08-24 09:31:05 Jacopo Rota maas/3.4: importance Undecided High
2023-08-24 09:40:35 Jacopo Rota merge proposal linked https://code.launchpad.net/~r00ta/maas/+git/maas/+merge/449828
2023-08-25 06:25:57 Jacopo Rota attachment added connectionsfull.png https://bugs.launchpad.net/maas/+bug/2029417/+attachment/5694908/+files/connectionsfull.png
2023-08-29 10:12:10 MAAS Lander maas: status In Progress Fix Committed
2023-08-30 16:16:03 Jacopo Rota maas/3.3: status New In Progress
2023-08-30 16:16:07 Jacopo Rota maas/3.4: status New In Progress
2023-08-31 08:19:09 Adam Collard merge proposal linked https://code.launchpad.net/~r00ta/maas/+git/maas/+merge/450266
2023-08-31 08:19:13 Adam Collard merge proposal linked https://code.launchpad.net/~r00ta/maas/+git/maas/+merge/450265
2023-08-31 08:19:27 Adam Collard maas/3.3: status In Progress Fix Committed
2023-08-31 08:19:28 Adam Collard maas/3.4: status In Progress Fix Committed
2023-08-31 08:20:05 Jerzy Husakowski tags bug-council
2023-09-07 08:58:17 Alberto Donato maas/3.4: milestone 3.4.x 3.4.0
2023-09-19 04:17:26 Jeffrey Chang bug added subscriber Jeffrey Chang
2023-09-21 12:16:41 Jacopo Rota merge proposal linked https://code.launchpad.net/~r00ta/maas/+git/maas/+merge/451782
2023-10-03 14:10:03 Alberto Donato merge proposal linked https://code.launchpad.net/~r00ta/maas/+git/maas/+merge/452660
2023-10-09 14:40:11 Alberto Donato maas/3.4: milestone 3.4.0-rc2 3.4.x