Skip to content

[Test] Fix intermittent failure in test_dynamic_file_system_update.#7418

Open
gmarciani wants to merge 1 commit into
aws:developfrom
gmarciani:wip/mgiacomo/3160/test-dfms-fix-0602-1
Open

[Test] Fix intermittent failure in test_dynamic_file_system_update.#7418
gmarciani wants to merge 1 commit into
aws:developfrom
gmarciani:wip/mgiacomo/3160/test-dfms-fix-0602-1

Conversation

@gmarciani
Copy link
Copy Markdown
Contributor

@gmarciani gmarciani commented Jun 2, 2026

Description of changes

Fix intermittent failure in test_dynamic_file_system_update.
The test could fail because the external EFS has chances to deploy mount targets in AZs that are not used by the cluster.
With this fix we deploy the storage in the private subnets used by the compute nodes (one per AZ), so that the external EFS has a mount target in every AZ used by the cluster.

Tests

ONGOING

test-suites:
  update:
    test_update.py::test_dynamic_file_systems_update:
      dimensions:
        - instances:
            - c5.xlarge
          oss:
            - alinux2023
          regions:
            - us-east-1
          schedulers:
            - slurm

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

 The test could fail because the external shared storage could
 randomly be deployed on AZs that were not used by the cluster.

 With this fix, we ensure that the shared storage is deployed
 on the AZs that the cluster would certainly land on.
@gmarciani gmarciani added skip-changelog-update Disables the check that enforces changelog updates in PRs 3.x Test labels Jun 2, 2026
@gmarciani gmarciani marked this pull request as ready for review June 2, 2026 15:43
@gmarciani gmarciani requested review from a team as code owners June 2, 2026 15:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

3.x skip-changelog-update Disables the check that enforces changelog updates in PRs Test

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant