Skip to content

Optimize cuda::std::saturating_(add|sub) for 120f+#8026

Merged
davebayer merged 1 commit intoNVIDIA:mainfrom
davebayer:optimize_add_sat
Apr 7, 2026
Merged

Optimize cuda::std::saturating_(add|sub) for 120f+#8026
davebayer merged 1 commit intoNVIDIA:mainfrom
davebayer:optimize_add_sat

Conversation

@davebayer
Copy link
Copy Markdown
Contributor

@davebayer davebayer commented Mar 13, 2026

PTX ISA 9.2 introduced some .sat more variants of add and sub PTX instructions for 120-family architectures. We can use them in cuda::std::saturating_add and cuda::std::saturating_sub

@davebayer davebayer requested a review from a team as a code owner March 13, 2026 12:10
@davebayer davebayer requested a review from pciolkosz March 13, 2026 12:10
@github-project-automation github-project-automation Bot moved this to Todo in CCCL Mar 13, 2026
@cccl-authenticator-app cccl-authenticator-app Bot moved this from Todo to In Review in CCCL Mar 13, 2026
@github-actions

This comment has been minimized.

@davebayer davebayer marked this pull request as draft March 13, 2026 13:27
@copy-pr-bot
Copy link
Copy Markdown
Contributor

copy-pr-bot Bot commented Mar 13, 2026

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@cccl-authenticator-app cccl-authenticator-app Bot moved this from In Review to In Progress in CCCL Mar 13, 2026
Comment thread libcudacxx/include/cuda/std/__numeric/saturating_add.h
@davebayer
Copy link
Copy Markdown
Contributor Author

/ok to test 0da4377

@github-actions

This comment has been minimized.

@davebayer
Copy link
Copy Markdown
Contributor Author

/ok to test c942dbe

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Apr 3, 2026

🥳 CI Workflow Results

🟩 Finished in 1h 41m: Pass: 100%/108 | Total: 1d 15h | Max: 1h 13m | Hits: 97%/281562

See results here.

@davebayer davebayer changed the title Optimize cuda::std::(add|sub)_sat for 120f+ Optimize cuda::std::saturating_(add|sub) for 120f+ Apr 7, 2026
@davebayer davebayer marked this pull request as ready for review April 7, 2026 20:10
@cccl-authenticator-app cccl-authenticator-app Bot moved this from In Progress to In Review in CCCL Apr 7, 2026
@davebayer davebayer merged commit 12652bc into NVIDIA:main Apr 7, 2026
128 of 131 checks passed
@github-project-automation github-project-automation Bot moved this from In Review to Done in CCCL Apr 7, 2026
gonidelis pushed a commit to gonidelis/cccl that referenced this pull request Apr 8, 2026
jainishmehta pushed a commit to jainishmehta/cccl that referenced this pull request Apr 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

2 participants