Skip to content

[BACKPORT] Fix bert performance regressions on release branch#2327

Closed
umangyadav wants to merge 3 commits into
release/rocm-rel-7.2from
release/rocm-rel-7.2-perf-fix
Closed

[BACKPORT] Fix bert performance regressions on release branch#2327
umangyadav wants to merge 3 commits into
release/rocm-rel-7.2from
release/rocm-rel-7.2-perf-fix

Conversation

@umangyadav
Copy link
Copy Markdown
Member

@umangyadav umangyadav commented Apr 2, 2026

Motivation

Cherry pick some fixes to fix performance on bert models
More details at ticket SWDEV-580287

This cherry pick brings back the performance and does a little better compared to QA's baseline

justinrosner and others added 3 commits April 2, 2026 20:19
* Fix barrier placement for scheduleVersion = 1.
It should appear before LDSRead and not before GlobalLoad.
@umangyadav umangyadav requested a review from causten as a code owner April 2, 2026 21:02
@umangyadav umangyadav self-assigned this Apr 2, 2026
@umangyadav umangyadav changed the title Release/rocm rel 7.2 perf fix [BACKPORT] Fix bert performance regressions on release branch Apr 2, 2026
Copy link
Copy Markdown
Contributor

@justinrosner justinrosner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just cherry-picking some changes. Looks good to me

@umangyadav
Copy link
Copy Markdown
Member Author

Closing this one for #2331

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants