instance is using wrong scsi disk after a stop/start
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Cinder |
Incomplete
|
High
|
Unassigned |
Bug Description
instance is using wrong scsi disk after a stop/start.
- We have one instance running, booting from
cinder:
- We rebooted the instance and we observed from the console it was
ubuntu (with the same volume ID) which was running since 2 months as a
windows. It is surprising.
- When we create the snapshot of the same volume and launch an instance
then it created a windows VM!
- user dont install OS, we use image to launch the instance using it. we
dont give raw disk for installation of the instance...
Questions:
1. How is this possible that new volume snapshot is creating window
instance but same volume showing Ubuntu VM?
2. How is it possible that volume ID is same but after reboot it
converted from window to ubuntu.
=======
- We found 2 instances using same backend /dev/sdn :
~~~
#grep -ir sdn sos_commands/
sos_commands/
sos_commands/
~~~
instance : bbbbbbbb-
instance : cccccccc-
- instance cccccccc-
~~~
lrwxrwxrwx. 1 0 0 9 Jan 25 14:22 ip-X.X.
~~~
This was supposed to be a 250GiB volume but is 100 GiB.
~~~
$ egrep -i "sdn|iscsi" sos_commands/
localhost iscsid[4235]: Connection-1:0 to [target: iqn.2002-
localhost kernel: sd 12:0:0:6: [sdn] 83886080 512-byte logical blocks: (42.9 GB/40.0 GiB)
localhost kernel: sd 12:0:0:6: [sdn] 4096-byte physical blocks
localhost kernel: sd 12:0:0:6: [sdn] Write Protect is off
localhost kernel: sd 12:0:0:6: [sdn] Mode Sense: 8f 00 00 08
localhost kernel: sd 12:0:0:6: [sdn] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
localhost kernel: sdn: sdn1 sdn2
localhost kernel: sd 12:0:0:6: [sdn] Attached SCSI disk
localhost lvm[38901]: WARNING: PV Y4RnUu-
localhost kernel: iscsi: registered transport (bnx2i)
localhost iscsid[4235]: Connection-1:0 to [target: iqn.2002-
localhost iscsid[4235]: Connection-1:0 to [target: iqn.2002-
localhost kernel: sd 14:0:0:10: [sdn] 209715200 512-byte logical blocks: (107 GB/100 GiB)
localhost kernel: sd 14:0:0:10: [sdn] 4096-byte physical blocks
localhost kernel: sd 14:0:0:10: [sdn] Write Protect is off
localhost kernel: sd 14:0:0:10: [sdn] Mode Sense: 8f 00 00 08
localhost kernel: sd 14:0:0:10: [sdn] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
localhost kernel: sdn: sdn1
localhost kernel: sd 14:0:0:10: [sdn] Attached SCSI disk
localhost kernel: sd 14:0:0:10: [sdn] 209715200 512-byte logical blocks: (107 GB/100 GiB)
localhost kernel: sd 14:0:0:10: [sdn] 4096-byte physical blocks
localhost kernel: sd 14:0:0:10: [sdn] Write Protect is off
localhost kernel: sd 14:0:0:10: [sdn] Mode Sense: 8f 00 00 08
localhost kernel: sd 14:0:0:10: [sdn] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
localhost kernel: sdn: sdn1
localhost kernel: sd 14:0:0:10: [sdn] Attached SCSI disk
localhost kernel: sd 14:0:0:10: [sdn] 209715200 512-byte logical blocks: (107 GB/100 GiB)
localhost kernel: sd 14:0:0:10: [sdn] 4096-byte physical blocks
localhost kernel: sd 14:0:0:10: [sdn] Write Protect is off
localhost kernel: sd 14:0:0:10: [sdn] Mode Sense: 8f 00 00 08
localhost kernel: sd 14:0:0:10: [sdn] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
localhost kernel: sdn: sdn1
localhost kernel: sd 14:0:0:10: [sdn] Attached SCSI disk
localhost kernel: sdn: sdn1
Note I found one disk in scsi layer which is 250 gb and not used by any instance, mostly this is what instance should use :
~~~
localhost kernel: sd 14:0:0:1: [sdr] 524288000 512-byte logical blocks: (268 GB/250 GiB)
localhost kernel: sd 14:0:0:1: [sdr] 524288000 512-byte logical blocks: (268 GB/250 GiB)
localhost kernel: sd 14:0:0:1: [sdr] 524288000 512-byte logical blocks: (268 GB/250 GiB)
localhost kernel: sd 14:0:0:1: [sdr] 524288000 512-byte logical blocks: (268 GB/250 GiB)
~~~
Interestingly for instance : bbbbbbbb-
is what customer reported i.e. after a stop start instance got changed
from windows to ubuntu.
$ nova instance-
+------
| Action | Request_ID | Message | Start_Time |
+------
| create | req-753c2b04-
| live-migration | req-2300d864-
| live-migration | req-af552fc7-
| live-migration | req-d0db117e-
| stop | req-99bb7212-
| start | req-489aa37e-
| stop | req-7e93ee3e-
| start | req-410fe17d-
| stop | req-b57acf82-
| start | req-89d78197-
+------
So basically from what I understand the instance is using wrong scsi disk.
DB query output
MariaDB [(none)]> use nova ;
MariaDB [nova]> select * from instances where uuid='cccccccc-
*******
display_
default_
default_
shutdown_
ephemeral
1 row in set (0.00 sec)
ERROR: No query specified
MariaDB [nova]> select * from block_device_
*******
delete_
connectio
destinatio
1 row in set (0.00 sec)
ERROR: No query specified
MariaDB [nova]> select * from instances where uuid='bbbbbbbb-
*******
display_
default_
default_
shutdown_
ephemeral
1 row in set (0.00 sec)
ERROR: No query specified
MariaDB [nova]> select * from block_device_
*******
delete_
connectio
destinatio
1 row in set (0.00 sec)
Changed in cinder: | |
status: | New → Incomplete |
Hi Jon
Are you using master o another release?
Cheers,
Sofia