Restore playbook is not reentrant
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
New
|
Low
|
Unassigned |
Bug Description
Brief Description
-----------------
The restore_
Severity
--------
Minor: System/Feature is usable with minor issue
Steps to Reproduce
------------------
- Run restore with any config that will make bootstrap fail
Expected Behavior
------------------
Playbook fails but can be re-executed
Actual Behavior
----------------
Playbook doesn't execute again, forcing re-installation of the system or manual removal of flag file (for experts)
Reproducibility
---------------
10/10 when the restore fails during bootstrap tasks
System Configuration
-------
AIO-SX (Might happen on all configs)
Branch/Pull Time/Commit
-------
2022-08-22
Last Pass
---------
N/A
Timestamp/Logs
--------------
### 1st run failed because of this task
2022-08-23 20:16:43,861 p=1209313 u=tbrito n=ansible | TASK [bootstrap/
2022-08-23 20:16:43,874 p=1209313 u=tbrito n=ansible | fatal: [lab_vbox_2]: FAILED! => {
"msg": "The task includes an option with an undefined variable. The error was: 'target_backup_dir' is undefined\n\nThe error appears to be in '/home/
}
2022-08-23 20:16:43,876 p=1209313 u=tbrito n=ansible | PLAY RECAP *******
2022-08-23 20:16:43,876 p=1209313 u=tbrito n=ansible | lab_vbox_2 : ok=299 changed=122 unreachable=0 failed=1 skipped=311 rescued=0 ignored=0
### 2nd run after defining variable failed here:
2022-08-24 09:54:16,896 p=1338321 u=tbrito n=ansible | TASK [restore-
2022-08-24 09:54:16,908 p=1338754 u=tbrito n=ansible | <10.127.130.10> ESTABLISH SSH CONNECTION FOR USER: sysadmin
2022-08-24 09:54:16,912 p=1338754 u=tbrito n=ansible | <10.127.130.10> SSH: EXEC sshpass -d11 ssh -C -o ControlMaster=auto -o ControlPersist=60s -o Port=10200 -o 'User="sysadmin"' -o ConnectTimeout=10 -o ControlPath=
2022-08-24 09:54:17,008 p=1338754 u=tbrito n=ansible | <10.127.130.10> (0, b'/home/
2022-08-24 09:54:17,010 p=1338754 u=tbrito n=ansible | <10.127.130.10> ESTABLISH SSH CONNECTION FOR USER: sysadmin
2022-08-24 09:54:17,012 p=1338754 u=tbrito n=ansible | <10.127.130.10> SSH: EXEC sshpass -d11 ssh -C -o ControlMaster=auto -o ControlPersist=60s -o Port=10200 -o 'User="sysadmin"' -o ConnectTimeout=10 -o ControlPath=
2022-08-24 09:54:17,168 p=1338754 u=tbrito n=ansible | <10.127.130.10> (0, b'ansible-
2022-08-24 09:54:17,173 p=1338754 u=tbrito n=ansible | Using module file /tmp/tbrito_
2022-08-24 09:54:17,175 p=1338754 u=tbrito n=ansible | <10.127.130.10> PUT /home/tbrito/
2022-08-24 09:54:17,177 p=1338754 u=tbrito n=ansible | <10.127.130.10> SSH: EXEC sshpass -d11 sftp -o BatchMode=no -b - -C -o ControlMaster=auto -o ControlPersist=60s -o Port=10200 -o 'User="sysadmin"' -o ConnectTimeout=10 -o ControlPath=
2022-08-24 09:54:17,526 p=1338754 u=tbrito n=ansible | <10.127.130.10> (0, b'sftp> put /home/tbrito/
2022-08-24 09:54:17,528 p=1338754 u=tbrito n=ansible | <10.127.130.10> ESTABLISH SSH CONNECTION FOR USER: sysadmin
2022-08-24 09:54:17,530 p=1338754 u=tbrito n=ansible | <10.127.130.10> SSH: EXEC sshpass -d11 ssh -C -o ControlMaster=auto -o ControlPersist=60s -o Port=10200 -o 'User="sysadmin"' -o ConnectTimeout=10 -o ControlPath=
2022-08-24 09:54:17,634 p=1338754 u=tbrito n=ansible | <10.127.130.10> (0, b'', b'')
2022-08-24 09:54:17,635 p=1338754 u=tbrito n=ansible | <10.127.130.10> ESTABLISH SSH CONNECTION FOR USER: sysadmin
2022-08-24 09:54:17,635 p=1338754 u=tbrito n=ansible | <10.127.130.10> SSH: EXEC sshpass -d11 ssh -C -o ControlMaster=auto -o ControlPersist=60s -o Port=10200 -o 'User="sysadmin"' -o ConnectTimeout=10 -o ControlPath=
2022-08-24 09:54:17,821 p=1338754 u=tbrito n=ansible | Escalation succeeded
2022-08-24 09:54:18,207 p=1338754 u=tbrito n=ansible | <10.127.130.10> (0, b'\r\n\
2022-08-24 09:54:18,210 p=1338754 u=tbrito n=ansible | <10.127.130.10> ESTABLISH SSH CONNECTION FOR USER: sysadmin
2022-08-24 09:54:18,212 p=1338754 u=tbrito n=ansible | <10.127.130.10> SSH: EXEC sshpass -d11 ssh -C -o ControlMaster=auto -o ControlPersist=60s -o Port=10200 -o 'User="sysadmin"' -o ConnectTimeout=10 -o ControlPath=
2022-08-24 09:54:18,313 p=1338754 u=tbrito n=ansible | <10.127.130.10> (0, b'', b'')
2022-08-24 09:54:18,317 p=1338321 u=tbrito n=ansible | ok: [lab_vbox_2] => {
"changed": false,
"invocation": {
"path": "/etc/platform/
}
},
"stat": {
"atime": 1661284731.32077,
],
"blocks": 0,
"charset": "binary",
"checksum": "da39a3ee5e6b4b
"ctime": 1661284731.32077,
"dev": 2052,
"exists": true,
"gid": 0,
"gr_name": "root",
"inode": 1053275,
"isblk": false,
"ischr": false,
"isdir": false,
"isfifo": false,
"isgid": false,
"islnk": false,
"isreg": true,
"issock": false,
"isuid": false,
"mimetype": "inode/x-empty",
"mode": "0644",
"mtime": 1661284731.32077,
"nlink": 1,
"path": "/etc/platform/
"pw_name": "root",
"readable": true,
"rgrp": true,
"roth": true,
"rusr": true,
"size": 0,
"uid": 0,
"version": "3903128544",
"wgrp": false,
"woth": false,
"wusr": true,
"xgrp": false,
"xoth": false,
"xusr": false
}
}
2022-08-24 09:54:18,324 p=1338321 u=tbrito n=ansible | Read vars_file 'vars/common/
2022-08-24 09:54:18,325 p=1338321 u=tbrito n=ansible | Read vars_file 'host_vars/
2022-08-24 09:54:18,326 p=1338321 u=tbrito n=ansible | TASK [restore-
2022-08-24 09:54:18,349 p=1338321 u=tbrito n=ansible | fatal: [lab_vbox_2]: FAILED! => {
"changed": false,
"msg": " Restore is already in progress!"
}
2022-08-24 09:54:18,351 p=1338321 u=tbrito n=ansible | PLAY RECAP *******
2022-08-24 09:54:18,352 p=1338321 u=tbrito n=ansible | lab_vbox_2 : ok=22 changed=6 unreachable=0 failed=1 skipped=28 rescued=0 ignored=0
Test Activity
-------------
Developer Testing
Workaround
----------
- sudo rm /etc/platform/
- Retry running restore_
Changed in starlingx: | |
importance: | Undecided → Low |
tags: | added: stx.update |