HA xenial/mitaka deployment failed on a subprocess call:
2017-10-31 19:52:34 DEBUG identity-service-relation-changed subprocess.CalledProcessError: Command '['reactive/designate_utils.py', 'domain-get', '--domain-name', 'mojo.serverstack.com.']' returned non-zero exit status 1
2017-10-31 19:52:34 ERROR juju.worker.uniter.operation runhook.go:107 hook "identity-service-relation-changed" failed: exit status 1
The corresponding message in the designate-api logs:
Oct 31 19:52:33 juju-f9e7e2-designate-9 designate-api[25593]: 2017-10-31 19:52:33.802 25593 ERROR designate.api.middleware MessagingTimeout: Timed out waiting for a reply to message ID cb0b0db5fc114430b253d
Oct 31 19:52:33 juju-f9e7e2-designate-9 designate-api[25593]: 2017-10-31 19:52:33.802 25593 ERROR designate.api.middleware
This looks like a race as the proceeding call ensures the api is up and running, so something could have gone done between the api check and the domain-get. The only other thing that stands out is this message from the designate-central service which looks like it has hung:
journalctl -u designate-central.service
Oct 31 17:51:16 juju-f9e7e2-designate-9 designate-central[13381]: 2017-10-31 17:51:16.118 13381 WARNING oslo_messaging.server [-] Possible hang: stop is waiting for start to complete
Oct 31 19:43:22 juju-f9e7e2-designate-9 systemd[1]: Stopping OpenStack Designate DNSaaS central...
http://lists.openstack.org/pipermail/openstack-dev/2017-August/120386.html
but it looks like retrieving the domain id does not rely on the central service
The rabbit units appear to have been up and functioning fine when the error occurred.
Another failure on HA xenial/mitaka deployment. This time looks like an issue talking to keystone:
2017-11-22 12:09:09 INFO juju-log identity- service: 80: Retrying 'ensure_ api_responding' 1 more times (delay=50) service: 80: Checking API service is responding service- relation- changed Traceback (most recent call last): service- relation- changed File "reactive/ designate_ utils.py" , line 169, in <module> service- relation- changed commands[ args.command] () service- relation- changed File "reactive/ designate_ utils.py" , line 137, in display_servers service- relation- changed for server in get_servers(): service- relation- changed File "reactive/ designate_ utils.py" , line 121, in get_servers service- relation- changed out, err = run_command(cmd) service- relation- changed File "reactive/ designate_ utils.py" , line 34, in run_command service- relation- changed cmd, p.returncode, out, err)) service- relation- changed RuntimeError: ['designate', 'server-list', '-f', 'value'] failed, status code 1 stdout b'' stderr b'/usr/ lib/python2. 7/dist- packages/ designateclient /cli/base. py:38: DeprecationWarning: The "designate" CLI is being deprecated in favour of the "openstack" CLI plugin. All designate API v2 commands are implemented there. When the v1 API is removed this CLI will stop functioning\n DeprecationWarn ing)\nERROR: Unable to establish connection to http:// 10.5.100. 2:35357/ v2.0/tokens: HTTPConnectionP ool(host= \'10.5. 100.2\' , port=35357): Max retries exceeded with url: /v2.0/tokens (Caused by NewConnectionEr ror(\'< requests. packages. urllib3. connection. HTTPConnection object at 0x7fbf7eae8d90>: Failed to establish a new connection: [Errno 113] No route to host\',))\n' designate_ utils.py' , 'server-list']' returned non-zero exit status 1 uniter. operation runhook.go:107 hook "identity- service- relation- changed" failed: exit status 1
2017-11-22 12:09:59 WARNING juju-log identity-
2017-11-22 12:10:07 DEBUG identity-
2017-11-22 12:10:07 DEBUG identity-
2017-11-22 12:10:07 DEBUG identity-
2017-11-22 12:10:07 DEBUG identity-
2017-11-22 12:10:07 DEBUG identity-
2017-11-22 12:10:07 DEBUG identity-
2017-11-22 12:10:07 DEBUG identity-
2017-11-22 12:10:07 DEBUG identity-
2017-11-22 12:10:07 DEBUG identity-
2017-11-22 12:10:07 DEBUG identity-
<snip>Command '['reactive/
2017-11-22 12:10:07 ERROR juju.worker.