[jax-inference-offloading] Bump to CUDA 13.0 by yhtang · Pull Request #1807 · NVIDIA/JAX-Toolbox

yhtang · 2025-12-02T07:19:22Z

This draft PR is intended to validate the latest vLLM+cu130 wheel. At the moment, the wheel is only available for x86_64 and does not yet support aarch64. I plan to formalize this PR once both amd64 and arm64 wheels are published to PyPI, or explore alternative build options if benchmarks with the experimental CUDA 13.0 containers reveal a significant performance gap.

yhtang added 5 commits December 1, 2025 21:46

bump to CUDA 13.0

ca22ed5

pin vllm CUDA 13.0 wheel

dd8c943

update base image in CI

bc77eec

x86 only

2b4ba4a

Merge branch 'main' into yhtang/jio-bump-cuda-13.0

c91a7f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[jax-inference-offloading] Bump to CUDA 13.0#1807

[jax-inference-offloading] Bump to CUDA 13.0#1807
yhtang wants to merge 5 commits into
mainfrom
yhtang/jio-bump-cuda-13.0

yhtang commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yhtang commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant