Skip to content

make training and sweeps work on amd gpu hip/rocm#562

Draft
sozforex wants to merge 1 commit into
PufferAI:4.0from
sozforex:rocm_for_40
Draft

make training and sweeps work on amd gpu hip/rocm#562
sozforex wants to merge 1 commit into
PufferAI:4.0from
sozforex:rocm_for_40

Conversation

@sozforex
Copy link
Copy Markdown
Contributor

Allows training and sweeps to run on amd gpu [hip/rocm],
cuda specific code is hipified during build and is placed in build/hip/src.

There has been a problem with sweep_obj being passed between processes when protein sweep is on gpu [apparently special synchronization handling is implemented for cuda, but not rocm in torch], so this PR contains a change related to that.

Leaving this as a draft as I have not tested this with an nvidia gpu, and only run a small sweep of breakout.

@jsuarez5341 please let me know if you care to merge amd gpu support in a shape similar to this draft PR

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant