test(api-core): re-enable the three rack-handler tests against current behavior by chet · Pull Request #2735 · NVIDIA/infra-controller

chet · 2026-06-22T05:51:36Z

The three rack state-controller tests surfaced by #2694 were #[ignore]d because they assert an old state-machine vocabulary the handlers no longer use -- not because of any handler bug (confirmed: created.rs/discovering.rs are correct, corroborated by adjacent passing tests). They now pin current behavior and run again:

test_expected_incomplete_device_counts_stays → asserts Wait (was DoNothing).
test_discovering_waits_for_compute_ready → renamed test_discovering_waits_when_compute_not_ready, asserts Wait (was Err).
test_expected_more_discovered_than_expected_transitions → now seeds two hosts against the single-compute profile so over-discovery genuinely exceeds expected, keeping the Transition(Discovering) assertion; dropped the dead mac1 line + stale comment.

No production change. Verified: cargo test -p carbide-api-core --lib -- rack_state_controller::handler → 42 passed, 0 ignored; clippy --locked --all-targets --all-features + nightly rustfmt clean.

Addresses #2715.

…t behavior Three rack state-controller tests were `#[ignore]`d because they asserted an old state-machine vocabulary the handlers no longer use -- not because of any handler bug. They now pin the handlers' actual, intended behavior and run again. `created.rs` returns `Wait` (not `DoNothing`) while device counts are below the profile's expectation and transitions once they meet or exceed it; `discovering.rs` waits for compute to become ready rather than faulting on a missing host. So `test_expected_incomplete_device_counts_stays` now asserts `Wait`; `test_discovering_waits_for_compute_ready` becomes `test_discovering_waits_when_compute_not_ready` and asserts `Wait`; and `test_expected_more_discovered_than_expected_transitions` now seeds two hosts against the single-compute profile so it genuinely exercises the over-discovery transition it always claimed to test. No production change -- the handlers were already correct, corroborated by the adjacent passing tests. Addresses NVIDIA#2715. Signed-off-by: Chet Nichols III <chetn@nvidia.com>

chet · 2026-06-22T05:51:39Z

@coderabbitai PTAL, thanks!

copy-pr-bot · 2026-06-22T05:51:39Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

coderabbitai · 2026-06-22T05:51:43Z

Important

Review skipped

No new commits to review since the last review.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 67322df3-ccf3-413b-b414-de2309713ebb

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

Walkthrough

Three previously ignored rack state-controller handler tests are re-enabled by removing their #[ignore] attributes. Their assertions are updated to expect StateHandlerOutcome::Wait in place of prior error or stale outcomes, and the over-discovery test receives a reworked fixture that seeds two managed host records to correctly exceed the expected compute minimum.

Changes

Rack State-Controller Test Fixes

Layer / File(s)	Summary
Re-enable incomplete device counts test `crates/api-core/src/tests/rack_state_controller/handler.rs`	Removes `#[ignore]` from `test_expected_incomplete_device_counts_stays` and updates the assertion to `StateHandlerOutcome::Wait { .. }` with "wait in Created" messaging.
Re-enable over-discovery test with fixture rework `crates/api-core/src/tests/rack_state_controller/handler.rs`	Removes `#[ignore]` from `test_expected_more_discovered_than_expected_transitions`, seeds two managed host fixtures under the "Single" rack profile to exceed the expected compute minimum, and removes a stale ordering-branch comment.
Rename and fix Discovering-state wait test `crates/api-core/src/tests/rack_state_controller/handler.rs`	Renames the test to `test_discovering_waits_when_compute_not_ready`, removes `#[ignore]`, clarifies the rack setup intent by documenting absence of Ready/Assigned compute states, and replaces the `result.is_err()` assertion with `StateHandlerOutcome::Wait { .. }`.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Possibly related issues

Re-enable or correct 3 rack-handler state-transition tests that fail against current behavior #2715: The three re-enabled tests and corrected assertions directly address the failing rack state-controller tests identified in this issue.
Restore coverage from four ignored rack-handler tests and an always-true assertion #2694: The removal of #[ignore] attributes and the updated expectations align precisely with the scope specified in this issue.

Possibly related PRs

NVIDIA/infra-controller#2716: Modifies the same rack state-controller test file with analogous changes to #[ignore] removal and expected transition assertion updates.

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately describes the primary change: re-enabling three previously ignored rack-handler tests with updated behavior assertions.
Description check	✅ Passed	The description comprehensively details the rationale, specific test changes, and validation results, clearly relating to the changeset.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai · 2026-06-22T05:52:47Z

Tip

For best results, initiate chat on the files or code changes.

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash
set -euo pipefail

echo "=== PR Details ==="
gh pr view 2735 --repo NVIDIA/infra-controller --json title,body,files,additions,deletions,state

echo ""
echo "=== PR Diff ==="
gh pr diff 2735 --repo NVIDIA/infra-controller

Length of output: 7570

@chet: The changes are clean and well-reasoned. Here is my assessment:

Overall: Approved — test-only, no production risk.

Per-test review

Test	Assessment
`test_expected_incomplete_device_counts_stays`	✅ Straightforward `DoNothing → Wait` correction. Error message updated to match.
`test_expected_more_discovered_than_expected_transitions`	✅ Significant improvement. The old path called `db_rack::update` on a connection that had already gone out of meaningful scope and never actually seeded compute hosts — the test was asserting nothing real. Two `new_host()` calls now genuinely exercise the over-discovery threshold. The explicit `drop(txn)` before those calls is correct.
`test_discovering_waits_when_compute_not_ready`	✅ The rename is more precise. Switching from `result.is_err()` to `.await?` + `matches!(outcome, Wait)` is a tighter contract: if the handler faults, the test now fails with an error rather than silently passing as a false positive.

One minor observation

In test_discovering_waits_when_compute_not_ready, the assertion diagnostic uses:

"Discovering should wait when compute hosts are not yet ready, got {:?}",
std::mem::discriminant(&outcome)

std::mem::discriminant yields an opaque integer, not a variant name. If StateHandlerOutcome derives Debug, replacing this with {outcome:?} would give a more actionable failure message. Non-blocking — the test logic itself is correct.

[approve]

chet · 2026-06-22T06:25:06Z

@coderabbitai review

coderabbitai · 2026-06-22T06:25:13Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

chet · 2026-06-22T06:49:44Z

@coderabbitai full review

coderabbitai · 2026-06-22T06:49:50Z

✅ Action performed

Full review finished.

chet · 2026-06-22T07:41:27Z

@coderabbitai review

coderabbitai · 2026-06-22T07:41:34Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

github-actions · 2026-06-22T07:59:56Z

🔍 Container Scan Summary

Service	Total	Critical	High	Medium	Low	Other
boot-artifacts-aarch64	3	0	0	3	0	0
boot-artifacts-x86_64	3	0	0	3	0	0
forge-admin-cli-x86_64	264	6	23	99	6	130
machine-validation-runner	704	34	183	258	35	194
machine_validation	704	34	183	258	35	194
nvmetal-carbide	704	34	183	258	35	194
TOTAL	2382	108	572	879	111	712

Per-CVE detail lives in the per-service grype-* artifacts (JSON + SARIF). Severity counts only — no CVE IDs published here.

chet marked this pull request as ready for review June 22, 2026 06:35

chet requested a review from a team as a code owner June 22, 2026 06:35

bcavnvidia approved these changes Jun 22, 2026

View reviewed changes

chet merged commit 4acacdb into NVIDIA:main Jun 22, 2026
55 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(api-core): re-enable the three rack-handler tests against current behavior#2735

test(api-core): re-enable the three rack-handler tests against current behavior#2735
chet merged 1 commit into
NVIDIA:mainfrom
chet:gh-issue-2715

chet commented Jun 22, 2026

Uh oh!

chet commented Jun 22, 2026

Uh oh!

copy-pr-bot Bot commented Jun 22, 2026

Uh oh!

coderabbitai Bot commented Jun 22, 2026 •

edited

Loading

Review skipped

Uh oh!

coderabbitai Bot commented Jun 22, 2026

Uh oh!

chet commented Jun 22, 2026

Uh oh!

coderabbitai Bot commented Jun 22, 2026 •

edited

Loading

Uh oh!

chet commented Jun 22, 2026

Uh oh!

coderabbitai Bot commented Jun 22, 2026 •

edited

Loading

Uh oh!

chet commented Jun 22, 2026

Uh oh!

coderabbitai Bot commented Jun 22, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chet commented Jun 22, 2026

Uh oh!

chet commented Jun 22, 2026

Uh oh!

copy-pr-bot Bot commented Jun 22, 2026

Uh oh!

coderabbitai Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Possibly related issues

Possibly related PRs

Uh oh!

coderabbitai Bot commented Jun 22, 2026

Per-test review

One minor observation

Uh oh!

chet commented Jun 22, 2026

Uh oh!

coderabbitai Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chet commented Jun 22, 2026

Uh oh!

coderabbitai Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chet commented Jun 22, 2026

Uh oh!

coderabbitai Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 22, 2026

🔍 Container Scan Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai Bot commented Jun 22, 2026 •

edited

Loading

coderabbitai Bot commented Jun 22, 2026 •

edited

Loading

coderabbitai Bot commented Jun 22, 2026 •

edited

Loading

coderabbitai Bot commented Jun 22, 2026 •

edited

Loading