Robustify against transient worker failures
Bug #1474734 reported by
Martin Pitt
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Auto Package Testing |
Fix Released
|
Medium
|
Martin Pitt |
Bug Description
Workers can die on transient issues like bug 1474729. They should be auto-restarted when they don't run for an hour, with cron jobs and lock files (until we get systemd which makes this simpler). It should also send out a notification mail to the admins (me, Adam, Iain).
This requires setting up mail on the worker boxes.
Changed in auto-package-testing: | |
status: | New → Triaged |
importance: | Undecided → Medium |
assignee: | nobody → Martin Pitt (pitti) |
description: | updated |
To post a comment you must log in.
https:/ /git.launchpad. net/~pitti/ +git/autopkgtes t-cloud/ commit/ ?id=3d5264f545 provides the auto-restart via upstartification.