Skip to content

e2e/gpu: mark TestGPUK8sSuiteUbuntu2204 as flaky#51171

Open
pducolin wants to merge 1 commit into
mainfrom
oncall/flake-gpu-k8s-ubuntu2204
Open

e2e/gpu: mark TestGPUK8sSuiteUbuntu2204 as flaky#51171
pducolin wants to merge 1 commit into
mainfrom
oncall/flake-gpu-k8s-ubuntu2204

Conversation

@pducolin
Copy link
Copy Markdown
Collaborator

What does this PR do?

Marks TestGPUK8sSuiteUbuntu2204 as a known-flaky test using flake.Mark(t).

The CI job new-e2e-gpu: [--run TestGPUK8sSuiteUbuntu2204] is currently failing due to an external dependency being down. This change prevents the failure from blocking the pipeline while the dependency is restored.

Motivation

On-call mitigation. The test itself is sound; the failure is caused by a transient external outage.

External dependency is down causing the job to fail. Mark the test as
flaky so CI retries it rather than hard-failing the pipeline.
@github-actions
Copy link
Copy Markdown
Contributor

@codex review

@pducolin pducolin added changelog/no-changelog No changelog entry needed qa/done QA done before merge and regressions are covered by tests labels May 21, 2026
@dd-octo-sts dd-octo-sts Bot added internal Identify a non-fork PR team/ebpf-platform labels May 21, 2026
@github-actions github-actions Bot added the short review PR is simple enough to be reviewed quickly label May 21, 2026
@pducolin pducolin marked this pull request as ready for review May 21, 2026 15:24
@pducolin pducolin requested a review from a team as a code owner May 21, 2026 15:24
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4e415aee14

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

// TestGPUK8sSuiteUbuntu2204 runs tests for the VM interface to ensure its implementation is correct.
// Not to be run in parallel, as some tests wait until the checks are available.
func TestGPUK8sSuiteUbuntu2204(t *testing.T) {
flake.Mark(t)
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Limit flaky marking to outage-specific failures

Calling flake.Mark(t) at suite entry marks TestGPUK8sSuiteUbuntu2204 as flaky unconditionally, so any future failure in this suite (including real regressions unrelated to the current external outage) is treated as a flaky failure by the test washer (is_flaky_failure returns true on the marker string and process_result then allows the job to succeed). Because this masks all failures rather than the known dependency error, it creates a CI blind spot for GPU Kubernetes coverage until someone removes the marker.

Useful? React with 👍 / 👎.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

changelog/no-changelog No changelog entry needed internal Identify a non-fork PR qa/done QA done before merge and regressions are covered by tests short review PR is simple enough to be reviewed quickly team/ebpf-platform

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants