[CI] Retry headers check and threading tests in case of failure. by serban-nicusor-toptal · Pull Request #2982 · stan-dev/math

serban-nicusor-toptal · 2023-12-06T14:35:54Z

Summary

Retry headers check and threading tests in case of failure during CI.

Tests

Side Effects

Are there any side effects that we should be aware of?
No

Release notes

Checklist

Copyright holder: (fill in copyright holder information)

The copyright holder is typically you or your assignee, such as a university or company. By submitting this pull request, the copyright holder is agreeing to the license the submitted work under the following licenses:
- Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
- Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)
the basic tests are passing
- unit tests pass (to run, use: ./runTests.py test/unit)
- header checks pass, (make test-headers)
- dependencies checks pass, (make test-math-dependencies)
- docs build, (make doxygen)
- code passes the built in C++ standards checks (make cpplint)
the code is written in idiomatic C++ and changes are documented in the doxygen
the new changes are tested

serban-nicusor-toptal · 2023-12-06T14:47:10Z

Hey @WardBrian the change for headers check is quite straightforward, just retry once.
About your earlier question:

The other thing that sometimes randomly fails is the thread tests sometimes don't build.
If this is due to some sort of runner incompatibility, will the retry pick up the same runner again?

The retry in it's simplest form seems to handle simple blocks, tho it seems to be possible to use it for a stage-wide failure and retry https://community.jenkins.io/t/how-to-retry-a-jenkins-pipeline-stage-with-an-agent-condition/3667/4
TL;DR; detecting agent issues and if that's the case, find a new agent and try again.

Now while that's possible I don't think it might be best to use in our case, let me explain.
We leverage Docker to ship our CI images with all the dependencies, so irrelevant of the agent we will always run the same thing. The only case where differences can occur is when the code is trying to read the kernel or some low-level CPU instructions (docker containers share the host kernel). Rember we had an issue at the beginning with Flatiron that an Intel CPU was not supporting some CPU instructions ? (or was it a GPU?)

Knowing this, I think it might be more simple and straightforward to detect on which host the threading tests fail and simply exclude it in an agent label conditional, thus always running on the valid one.
Could these threading failures occur because of two concurrent runs on the same host ? a develop building running at the same time with a PR build.

Because of the above train of thought I put it in a simple retry block for now, please let me know what do you think about this! Thanks!

syclik

LGTM

Retry headers check and threading tests in case of failure.

0fceb6a

serban-nicusor-toptal requested a review from WardBrian December 6, 2023 14:47

syclik approved these changes Dec 15, 2023

View reviewed changes

serban-nicusor-toptal merged commit 56cc817 into develop Dec 17, 2023

syclik pushed a commit that referenced this pull request Jan 3, 2024

Retry headers check and threading tests in case of failure. (#2982)

c9df636

WardBrian deleted the ci-retry-stages branch August 5, 2024 00:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI] Retry headers check and threading tests in case of failure.#2982

[CI] Retry headers check and threading tests in case of failure.#2982
serban-nicusor-toptal merged 1 commit into
developfrom
ci-retry-stages

serban-nicusor-toptal commented Dec 6, 2023

Uh oh!

serban-nicusor-toptal commented Dec 6, 2023

Uh oh!

syclik left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

serban-nicusor-toptal commented Dec 6, 2023

Summary

Tests

Side Effects

Release notes

Checklist

Uh oh!

serban-nicusor-toptal commented Dec 6, 2023

Uh oh!

syclik left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants