With rc2 result. It looks like there is a noticeable difference between v4.12-rc3 and v4.12-rc4.
@Joseph, can you please start looking into diffs? I'm keeping one dedicated node just for this testing, so I can run the same script one by one for more bisections.
v4.12-rc1 - bad (3 of 3)
v4.12-rc2 - bad (15 of 53 - 28.3%)
v4.12-rc3 - bad (24 of 90 - 26.6%)
v4.12-rc4 - relatively good (1 of 70 - 1.4%)
v4.12 - relatively good (5 out of 68 - 7.4%)
v4.13 - good (0 out of 41 - 0%)
FWIW, I will run the same test with xenial's 4.4 kernel to make sure around 30% is the base line of "bad".
With rc2 result. It looks like there is a noticeable difference between v4.12-rc3 and v4.12-rc4.
@Joseph, can you please start looking into diffs? I'm keeping one dedicated node just for this testing, so I can run the same script one by one for more bisections.
v4.12-rc1 - bad (3 of 3)
v4.12-rc2 - bad (15 of 53 - 28.3%)
v4.12-rc3 - bad (24 of 90 - 26.6%)
v4.12-rc4 - relatively good (1 of 70 - 1.4%)
v4.12 - relatively good (5 out of 68 - 7.4%)
v4.13 - good (0 out of 41 - 0%)
FWIW, I will run the same test with xenial's 4.4 kernel to make sure around 30% is the base line of "bad".