Workers can die on transient issues like bug 1474729. They should be auto-restarted when they don't run for an hour, with cron jobs and lock files (until we get systemd which makes this simpler).
This requires setting up mail on the worker boxes.
Workers can die on transient issues like bug 1474729. They should be auto-restarted when they don't run for an hour, with cron jobs and lock files (until we get systemd which makes this simpler).
This requires setting up mail on the worker boxes.