[PyTorch][Training][EC2][SageMaker] PyTorch 2.10 Currency Release#5746
Merged
bhanutejagk merged 4 commits intoaws:masterfrom Mar 17, 2026
Merged
[PyTorch][Training][EC2][SageMaker] PyTorch 2.10 Currency Release#5746bhanutejagk merged 4 commits intoaws:masterfrom
bhanutejagk merged 4 commits intoaws:masterfrom
Conversation
- Add CPU and GPU Dockerfiles with SM SDK v3, fastai, Python 3.13, CUDA 13.0 - Add buildspecs for EC2 and SageMaker - Add EC2 test file for PyTorch 2.10 - Update conftest.py with pytorch_training___2__10 fixture and version regex fix - Update SageMaker conftest.py skip_smppy_test for 2.10 - Add mlflow/skops CVEs to SM allowlists - Add sanity test fixes for SM SDK v3 in-image (utility install, remote function, pip check) - Configure dlc_developer_config.toml for PyTorch training build
jinyan-li1
approved these changes
Mar 17, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Purpose
Currency release of PyTorch 2.10.0 training DLC images (CPU and GPU) to keep AWS Deep Learning Containers up to date with the latest stable PyTorch release. This enables customers to use PyTorch 2.10 with Python 3.13, CUDA 13.0, and SageMaker SDK v3 on both EC2 and SageMaker.
Test Plan
Testing the following on images built from this PR. Tests enabled via dlc_developer_config.toml:
sanity_tests = true — pip check, utility installation, remote function compatibility, pre-release checks
security_tests = true — ECR scan, safety report with CVE allowlists
ecs_tests = true
eks_tests = true
ec2_tests = true — EC2 training via test_pytorch_training_2_10.py
ec2_benchmark_tests = true
ec2_tests_on_heavy_instances = true
sagemaker_local_tests = true
sagemaker_remote_tests = true
sagemaker_efa_tests = true
sagemaker_rc_tests = true
sagemaker_benchmark_tests = true
Test Result
EC2 image -
d4aa71f - passing ec2, ecs, sanity, security and sanity tests
cc85ac4 - passing eks test
SM image -
179e610 - passing all sm related tests
Toggle if you are merging into master Branch
By default, docker image builds and tests are disabled. Two ways to run builds and tests:
How to use the helper utility for updating dlc_developer_config.toml
Assuming your remote is called
origin(you can find out more withgit remote -v)...python src/prepare_dlc_dev_environment.py -b </path/to/buildspec.yml> -cp originpython src/prepare_dlc_dev_environment.py -b </path/to/buildspec.yml> -t sanity_tests -cp originpython src/prepare_dlc_dev_environment.py -rcp originNOTE: If you are creating a PR for a new framework version, please ensure success of the local, standard, rc, and efa sagemaker tests by updating the dlc_developer_config.toml file:
sagemaker_remote_tests = truesagemaker_efa_tests = truesagemaker_rc_tests = truesagemaker_local_tests = trueHow to use PR description
Use the code block below to uncomment commands and run the PR CodeBuild jobs. There are two commands available:# /buildspec <buildspec_path># /buildspec pytorch/training/buildspec.yml# /tests <test_list># /tests sanity security ec2sanity, security, ec2, ecs, eks, sagemaker, sagemaker-local.Toggle if you are merging into main Branch
PR Checklist
pre-commit run --all-fileslocally before creating this PR. (Read DEVELOPMENT.md for details).