RPC component was down by docker grpc error?.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
fuel-ccp |
New
|
Undecided
|
Unassigned |
Bug Description
Hi everyone,
Sometimes, RPC container seems to be stopped after running properly few hours.
I guess this issue is caused by docker error.
Have anyone got similar issue?
I am using Docker v1.13.1
I'm trying to deploy each component one by one.
The following components have been started.
# ccp status
+------
| service | pod | job | ready | links |
+------
| database | 3/3 | 0/0 | ok | http://
| etcd | 1/1 | 0/0 | ok | http://
| | | | | http://
| keystone | 1/1 | 7/7 | ok | http://
| | | | | http://
| memcached | 1/1 | 0/0 | ok | http://
| notifications | 1/1 | 0/0 | ok | http://
| nova-api | 1/1 | 20/20 | ok | http://
| | | | | http://
| nova-conductor | 1/1 | 0/0 | ok | |
| nova-consoleauth | 1/1 | 0/0 | ok | |
| nova-novncproxy | 1/1 | 0/0 | ok | http://
| rpc | 3/3 | 0/0 | ok | http://
+------
But after few hours, 1 rpc component stopped as below.
# ccp status
+------
| service | pod | job | ready | links |
+------
| database | 3/3 | 0/0 | ok | http://
| etcd | 1/1 | 0/0 | ok | http://
| | | | | http://
| keystone | 1/1 | 7/7 | ok | http://
| | | | | http://
| memcached | 1/1 | 0/0 | ok | http://
| notifications | 1/1 | 0/0 | ok | http://
| nova-api | 1/1 | 20/20 | ok | http://
| | | | | http://
| nova-conductor | 1/1 | 0/0 | ok | |
| nova-consoleauth | 1/1 | 0/0 | ok | |
| nova-novncproxy | 1/1 | 0/0 | ok | http://
| rpc | 2/3 | 0/0 | wip | http://
+------
It shows a part of the result of executing the command "kubectl -n ccp get pod"
# kubectl -n ccp get pod
NAME READY STATUS RESTARTS AGE
*snip*
rpc-1937807526-
rpc-1937807526-
rpc-1937807526-
*snip*
I checked log by the command "kubectl -n ccp logs rpc-1937807526-
This log shows that it is not updated after "2017-06-07 20:57:39".
"2017-06-07 20:57:39" means 2017-06-08 05:57:39(JST).
# kubectl -n ccp logs rpc-1937807526-
*snip*
[readiness:29629] DIAGNOSTICS
[readiness:29629] ===========
[readiness:29629]
[readiness:29629] attempted to contact: ['rabbit@
[readiness:29629] * connected to epmd (port 4369) on 172.30.31.6
[readiness:29629] * node rabbit@172.30.31.6 up, 'rabbit' application running
[readiness:29629]
[readiness:29629] current node details:
[readiness:29629] - node name: '<email address hidden>'
[readiness:29629] - home dir: .
[readiness:29629] - cookie hash: cPeI/H+
And I checked syslog.
then I found the docker error as below,
Jun 8 05:57:08 cent-ccp01 dockerd: time="2017-
Jun 8 05:57:08 cent-ccp01 dockerd: time="2017-
Jun 8 05:57:08 cent-ccp01 dockerd: time="2017-
This docker error occurred at the same timing as RPC component log stopped.
I guess RPC container was down by docker error.
Have you seen similar issue ?
Thanks.