System tests: 'ceph_ha_one_controller_compact' and 'migrate_vm_backed_with_ceph', on CI: http://jenkins-product.srt.mirantis.net:8080/job/6.1.system_test.centos.thread_1/32/
When Ceph service started on the controller, one of two ceph-osd processes wasn't started:
[root@node-2 ~]# service ceph status === mon.node-2 === mon.node-2: running {"version":"0.80.7"} === osd.1 === osd.1: not running. === osd.0 === osd.0: running {"version":"0.80.7"} [root@node-2 ~]# echo $? 3
[root@node-1 ~]# ceph status cluster 75098087-bb84-4fe5-8be6-55fc900ef80d health HEALTH_OK monmap e1: 1 mons at {node-2=10.108.17.4:6789/0}, election epoch 1, quorum 0 node-2 osdmap e22: 6 osds: 5 up, 5 in pgmap v46: 1728 pgs, 6 pools, 12859 kB data, 5 objects 10470 MB used, 236 GB / 246 GB avail 1728 active+clean
As was found in the ceph logs, the command 'osd crush create-or-move' never appeared for osd.1:
==== osd.0 is starting: Feb 5 21:53:26 node-2 ceph-mon: 2015-02-05 18:53:26.559788 7eff644ef700 0 mon.node-2@0(leader) e1 handle_command mon_command({"prefix": "auth add", "entity": "osd.0", "caps": ["osd", "allow *", "mon", "allow profile osd"]} v 0) v1 Feb 5 21:53:27 node-2 ceph-mon: 2015-02-05 18:53:27.392325 7eff644ef700 0 mon.node-2@0(leader) e1 handle_command mon_command({"prefix": "osd crush create-or-move", "args": ["host=node-2", "root=default"], "id": 0, " weight": 0.050000000000000003} v 0) v1 Feb 5 21:53:27 node-2 ceph-mon: 2015-02-05 18:53:27.392494 7eff644ef700 0 mon.node-2@0(leader).osd e4 create-or-move crush item name 'osd.0' initial_weight 0.05 at location {host=node-2,root=default}
==== osd.1 is starting: Feb 5 21:53:32 node-2 ceph-mon: 2015-02-05 18:53:32.797480 7eff644ef700 0 mon.node-2@0(leader) e1 handle_command mon_command({"prefix": "auth add", "entity": "osd.1", "caps": ["osd", "allow *", "mon", "allow profile osd"]} v 0) v1 Feb 5 21:53:32 node-2 puppet-user[18429]: (/Stage[main]/Ceph/Service[ceph]) Triggered 'refresh' from 1 events Feb 5 21:53:32 node-2 puppet-user[18429]: (/Stage[main]/Ceph/Service[ceph]) Evaluated in 11.57 seconds Feb 5 21:53:32 node-2 puppet-user[18429]: (Class[Ceph]) Starting to evaluate the resource Feb 5 21:53:33 node-2 puppet-user[18429]: (Class[Ceph]) Evaluated in 0.06 seconds Feb 5 21:53:33 node-2 puppet-user[18429]: (Stage[main]) Starting to evaluate the resource Feb 5 21:53:33 node-2 puppet-user[18429]: (Stage[main]) Evaluated in 0.05 seconds
System tests: 'ceph_ha_ one_controller_ compact' and 'migrate_ vm_backed_ with_ceph' , on CI: http:// jenkins- product. srt.mirantis. net:8080/ job/6.1. system_ test.centos. thread_ 1/32/
When Ceph service started on the controller, one of two ceph-osd processes wasn't started:
[root@node-2 ~]# service ceph status :"0.80. 7"} :"0.80. 7"}
=== mon.node-2 ===
mon.node-2: running {"version"
=== osd.1 ===
osd.1: not running.
=== osd.0 ===
osd.0: running {"version"
[root@node-2 ~]# echo $?
3
[root@node-1 ~]# ceph status bb84-4fe5- 8be6-55fc900ef8 0d 10.108. 17.4:6789/ 0}, election epoch 1, quorum 0 node-2
1728 active+clean
cluster 75098087-
health HEALTH_OK
monmap e1: 1 mons at {node-2=
osdmap e22: 6 osds: 5 up, 5 in
pgmap v46: 1728 pgs, 6 pools, 12859 kB data, 5 objects
10470 MB used, 236 GB / 246 GB avail
As was found in the ceph logs, the command 'osd crush create-or-move' never appeared for osd.1:
==== osd.0 is starting: 2@0(leader) e1 handle_command mon_command( {"prefix" : "auth add", "entity": "osd.0", "caps": ["osd", "allow *", "mon", "allow profile 2@0(leader) e1 handle_command mon_command( {"prefix" : "osd crush create-or-move", "args": ["host=node-2", "root=default"], "id": 0, " 00003} v 0) v1 2@0(leader) .osd e4 create-or-move crush item name 'osd.0' initial_weight 0.05 at location {host=node- 2,root= default}
Feb 5 21:53:26 node-2 ceph-mon: 2015-02-05 18:53:26.559788 7eff644ef700 0 mon.node-
osd"]} v 0) v1
Feb 5 21:53:27 node-2 ceph-mon: 2015-02-05 18:53:27.392325 7eff644ef700 0 mon.node-
weight": 0.0500000000000
Feb 5 21:53:27 node-2 ceph-mon: 2015-02-05 18:53:27.392494 7eff644ef700 0 mon.node-
==== osd.1 is starting: 2@0(leader) e1 handle_command mon_command( {"prefix" : "auth add", "entity": "osd.1", "caps": ["osd", "allow *", "mon", "allow profile main]/Ceph/ Service[ ceph]) Triggered 'refresh' from 1 events main]/Ceph/ Service[ ceph]) Evaluated in 11.57 seconds
Feb 5 21:53:32 node-2 ceph-mon: 2015-02-05 18:53:32.797480 7eff644ef700 0 mon.node-
osd"]} v 0) v1
Feb 5 21:53:32 node-2 puppet-user[18429]: (/Stage[
Feb 5 21:53:32 node-2 puppet-user[18429]: (/Stage[
Feb 5 21:53:32 node-2 puppet-user[18429]: (Class[Ceph]) Starting to evaluate the resource
Feb 5 21:53:33 node-2 puppet-user[18429]: (Class[Ceph]) Evaluated in 0.06 seconds
Feb 5 21:53:33 node-2 puppet-user[18429]: (Stage[main]) Starting to evaluate the resource
Feb 5 21:53:33 node-2 puppet-user[18429]: (Stage[main]) Evaluated in 0.05 seconds