Job errors with "ErrorMessage: None", possibly due to job timeout
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
LAVA Dispatcher |
Confirmed
|
Medium
|
Tyler Baker | ||
linaro-networking |
Won't Fix
|
Medium
|
Unassigned |
Bug Description
The following job:
https:/
reports back with:
Dispatcher error or test failure
ErrorMessage: None
The job itself eventually runs the `jtreg' tests which can be seen in
the output starting at:
11:50:45 AM INFO: [ACTION-B] lava_test_shell is started with {u'timeout': 180000, u'testdef_repos': [{u'git-repo': u'git...
and the timeout for that test is 50 hours which, as an aside, I'm
suspecting is not long enough for this particular test/job. I would have
expected LAVA to fail with a `timeout error' but I don't see that.
And looking through the test runs I see that test 'lava_test_shell #13'
fails with:
Lava failed at action lava_test_shell with error:None
Traceback (most recent call last):
File "/srv/lava<email address hidden>
action.
File "/srv/lava<email address hidden>
while self._keep_
File "/srv/lava<email address hidden>
idx = runner.
File "/srv/lava<email address hidden>
return super(logging_
File "/srv/lava/
return self.expect_
File "/srv/lava/
return self.expect_
File "/srv/lava/
c = self.read_
File "/srv/lava/
r,w,e = self.__
File "/srv/lava/
return select.select (iwtd, owtd, ewtd, timeout)
KeyboardInterrupt
Where is KeyboardInterrupt coming from?
The other thing I notice is that the job itself seems to run for exactly
24 hours (and not 50 hours as specified for the jtreg test case).
From:
Started at: Sept. 15, 2013, 7:34 a.m.
Finished at: Sept. 16, 2013, 7:34 a.m.
summary: |
- Job errors with "ErrorMessage: None", possibly due to job timeot + Job errors with "ErrorMessage: None", possibly due to job timeout |
description: | updated |
Changed in linaro-networking: | |
status: | New → Confirmed |
importance: | Undecided → Medium |
Changed in lava-dispatcher: | |
assignee: | Dave Pigott (dpigott) → Tyler Baker (tyler-baker) |
Changed in linaro-networking: | |
status: | Confirmed → Won't Fix |
I can't find anywhere in the message which particular target is in effect
when this happens. Any clue?
On 16 September 2013 08:20, Andrew McDermott <email address hidden>wrote:
> ** Description changed: /validation. linaro. org/scheduler/ job/72876 /validation. linaro. org/scheduler/ job/72876 /validation. linaro. org/dashboard/ streams/ public/ team/linaro/ pre- 4/bundles/ 0a75f2ca9c78f77 fbadf327f8371ea 7e5f7287c5 1dd9-11e3- 8b3b-001e671608 cd/result/ 13/ /lava_dispatche r/job.py" , run(**params) /lava_dispatche r/actions/ lava_test_ shell.py" , running( runner, timeout, signal_director): /lava_dispatche r/actions/ lava_test_ shell.py" , _connection. expect( patterns, timeout=timeout) /lava_dispatche r/utils. py", spawn, self).expect(*args, **kw) .cache/ eggs/pexpect- 2.4-py2. 7.egg/pexpect. py", line list(compiled_ pattern_ list, timeout, .cache/ eggs/pexpect- 2.4-py2. 7.egg/pexpect. py", line loop(searcher_ re(pattern_ list), timeout, .cache/ eggs/pexpect- 2.4-py2. 7.egg/pexpect. py", line nonblocking (self.maxread, timeout) .cache/ eggs/pexpect- 2.4-py2. 7.egg/pexpect. py", line select( [self.child_ fd], [], [], timeout) .cache/ eggs/pexpect- 2.4-py2. 7.egg/pexpect. py", line
>
> The following job:
>
> - https:/
> + https:/
>
> reports back with:
>
> - Dispatcher error or test failure
> + Dispatcher error or test failure
>
> - ErrorMessage: None
> + ErrorMessage: None
>
> The job itself eventually runs the `jtreg' tests which can be seen in
> the output starting at:
>
> - 11:50:45 AM INFO: [ACTION-B] lava_test_shell is started with
> + 11:50:45 AM INFO: [ACTION-B] lava_test_shell is started with
> {u'timeout': 180000, u'testdef_repos': [{u'git-repo': u'git...
>
> and the timeout for that test is 50 hours which, as an aside, I'm
> suspecting is not long enough for this particular test/job. I would have
> expected LAVA to fail with a `timeout error' but I don't see that.
>
> And looking through the test runs I see that test 'lava_test_shell #13'
> fails with:
>
> + https:/
> + built-vexpress6
> + /344362fe-
> +
> Lava failed at action lava_test_shell with error:None
> Traceback (most recent call last):
> - File
> "/srv/lava<email address hidden>
> line 254, in run
> - action.
> - File
> "/srv/lava<email address hidden>
> line 559, in run
> - while self._keep_
> - File
> "/srv/lava<email address hidden>
> line 574, in _keep_running
> - idx = runner.
> - File
> "/srv/lava<email address hidden>
> line 206, in expect
> - return super(logging_
> - File "/srv/lava/
> 1316, in expect
> - return self.expect_
> searchwindowsize)
> - File "/srv/lava/
> 1330, in expect_list
> - return self.expect_
> searchwindowsize)
> - File "/srv/lava/
> 1383, in expect_loop
> - c = self.read_
> - File "/srv/lava/
> 820, in read_nonblocking
> - r,w,e = self.__
> - File "/srv/lava/
> 1554, in __select
> - return se...