Suppressing some tests which fail in CI #3799

jglick · 2018-12-10T22:19:09Z

Various tests which have prevented us from having a stable master build since Nov 27, which is intolerable. Exempting this DisablePluginCommandTest test where it just looks like all hell broke loose, not the fault of the test.

core/src/test/java/hudson/FilePathSEC904Test.java

jglick · 2018-12-10T22:19:47Z

core/src/test/java/hudson/UtilTest.java

        assertFalse("f1 exists", f1.exists());
    }

+    @Ignore("TODO often fails in CI")


Has long been flaky. #3787 forgot this one.

Perhaps better to assume not running in a CI environment? https://github.com/auchenberg/volkswagen/ light?

Already in progress for this one in #3721. Otherwise we will have 2 @Ignore and the compilation will fail. (Sorry "ignore" user for the ping)

Well, would just be a trivial merge conflict.

jglick · 2018-12-10T22:20:05Z

test/src/test/java/hudson/PluginSEC925Test.java

    @Rule
    public JenkinsRule r = new JenkinsRule();

+    @Ignore("TODO observed to fail in CI with 404")


Some sort of network outage, it seems.

This one seems unnecessary, haven't seen it fail a substantial number of times.

But I have seen it fail for reasons which were clearly tied to issues completely unrelated to core code. There are other UC-dependent tests which have been suppressed for the same reason.

jglick · 2018-12-10T22:20:26Z

test/src/test/java/hudson/cli/CLITest.java

        }
    }

+    @Ignore("TODO sometimes fails, in CI & locally")


#3708 does not seem to have worked reliably.

jglick · 2018-12-10T22:20:39Z

test/src/test/java/hudson/util/AtomicFileWriterPerfTest.java

     * So using slightly more than the worse value obtained above should avoid making this flaky and still catch
     * <strong>really</strong> bad performance regressions.
     */
+    @Ignore("TODO often fails in CI")


#3797 (review)

As above, perhaps just assume not running in CI?

I like that idea, because this test is unlikely to ever fail in a normal environment (I mean something built after ~1995), and at the same time I feel a bit bad if tests run locally are not the same as in CI. I see potential interesting headaches for some people if some tests fail locally but not in CI (though I suppose a comment would help about this...)

this test is unlikely to ever fail in a normal environment

Possibly so, but performance tests are not suitable as merge/deploy gates—I could be patching README.md and the build might be marked as a failure for reasons unrelated to my change. Or, as in #3778, I could be trying to get a binary published for some unrelated fix and be stymied by this flake.

Better to have these in a separate job (repo?) where they are run say on a daily basis with some kind of tooling to track trends and alert developers to consistent regressions.

daniel-beck · 2018-12-10T23:38:16Z

In general I'm not positive tooling picks up TODO as part of strings, would prefer an additional // TODO comment on the same line to ensure as much visibility as possible.

Other than that and inline suggestions for a few of these, 👍

Fixed in jenkinsci#3795

batmat

Despite a few ignores are probably too aggressive, as Daniel says some have not failed since some time, I think we can move forward here.
I think given the current situation where we didn't get a green build on the master branch since quite long, possibly a radical fix like this one is the way to go, then we take the time to fix it/re-enable tests one by one.

Overall, I think we should define a process more common for this. Flakes are a pain, and we should probably just go some way like either @Ignore them as soon as they behave flakily, and work on these to see if they can be later reintroduced, or move them to a special profile that would be run in a dedicated test run that would not fail the build, but would help us have a look afterwards if needed.

@jglick if this gets more approval and gets merged, could you please file a JIRA to work on fixing these flakes and ping me there? I'd like to see what we can do to improve the situation, once the master branch has durably gotten back to green, which is my first goal approving here.

Thanks!

jglick · 2018-12-11T14:34:08Z

we should probably just go some way like either @Ignore them as soon as they behave flakily, and [various options]

Yes, that would be my preference. Get the master build blue ASAP, then work at our leisure on reintroducing test coverage which still seems relevant.

jglick · 2018-12-11T15:28:57Z

Not waiting for build 4 since build 2 was stable and there have been only trivial changes since then (which could only affect test compilation, which has already passed).

jglick · 2018-12-11T15:32:42Z

could you please file a JIRA to work on fixing these flakes

@batmat JENKINS-55122 and thanks!

This reverts commit 2656f7b, reversing changes made to d60a544.

daniel-beck · 2019-04-09T17:24:40Z

test/src/test/java/hudson/PluginSEC925Test.java

    @Rule
    public JenkinsRule r = new JenkinsRule();

+    @Ignore("TODO observed to fail in CI with 404 due to external UC issues")


Should perhaps get a workaround applied similar to #3962?

Possibly. Does not look like I thought to record a copy of the failure message. Probably best to rewrite tests like these to not attempt to make an Internet connection to begin with, and instead use some hard-coded example JSON data.

Suppressing some tests which fail in CI.

50df563

jglick requested review from Wadeck, batmat and fcojfernandez December 10, 2018 22:19

jglick commented Dec 10, 2018

View reviewed changes

This was referenced Dec 10, 2018

[JENKINS-26677] Avoid using ServletException from SlaveComputer since it can break calls to SlaveComputer.getChannelToMaster #3778

Merged

Bump to latest 1.50 parent pom release #3783

Merged

Un-ignore test that was fixed

0ee685e

Fixed in jenkinsci#3795

batmat approved these changes Dec 11, 2018

View reviewed changes

Revert the whitespace addition as well

99835d6

Wadeck approved these changes Dec 11, 2018

View reviewed changes

batmat mentioned this pull request Dec 11, 2018

@Ignore AtomicFileWriterPerfTest #3797

Closed

1 task

Clarifying reason for suppression.

25823d9

jglick merged commit 2656f7b into jenkinsci:master Dec 11, 2018

jglick deleted the failing-tests branch December 11, 2018 15:29

jglick added a commit to jglick/jenkins that referenced this pull request Dec 12, 2018

Revert "Merge pull request jenkinsci#3799 from jglick/failing-tests"

9f86a60

This reverts commit 2656f7b, reversing changes made to d60a544.

daniel-beck reviewed Apr 9, 2019

View reviewed changes

Uh oh!

Suppressing some tests which fail in CI #3799

Suppressing some tests which fail in CI #3799

Uh oh!

Conversation

jglick commented Dec 10, 2018

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Wadeck Dec 11, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

daniel-beck commented Dec 10, 2018

Uh oh!

batmat left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jglick commented Dec 11, 2018

Uh oh!

jglick commented Dec 11, 2018

Uh oh!

jglick commented Dec 11, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Wadeck Dec 11, 2018 •

edited

Loading

batmat left a comment •

edited

Loading