fix: use multi-GPU RWKV strategy from visible CUDA devices by Chessing234 · Pull Request #3882 · lm-sys/FastChat

Chessing234 · 2026-05-30T10:17:09Z

Bug

Launching an RWKV model worker with --num-gpus / --gpus still loads with strategy="cuda fp16", so RWKV ignores the requested GPU list and runs on a single device.

Root cause

RwkvModel.__init__ hardcodes strategy="cuda fp16" instead of building a per-device strategy string for multiple visible GPUs.

Why this fix is correct

When more than one CUDA device is visible (including after CUDA_VISIBLE_DEVICES is set by the worker), build cuda:0 fp16 -> cuda:1 fp16 -> ... as RWKV expects. Single-GPU behavior stays cuda fp16.

Made with Cursor

Use multi-GPU RWKV strategy when more than one GPU is visible via --gpus / --num-gpus instead of hardcoding single-device cuda fp16. Co-authored-by: Cursor <cursoragent@cursor.com>

fix: build RWKV strategy from visible CUDA device count

f1251e9

Use multi-GPU RWKV strategy when more than one GPU is visible via --gpus / --num-gpus instead of hardcoding single-device cuda fp16. Co-authored-by: Cursor <cursoragent@cursor.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use multi-GPU RWKV strategy from visible CUDA devices#3882

fix: use multi-GPU RWKV strategy from visible CUDA devices#3882
Chessing234 wants to merge 1 commit into
lm-sys:mainfrom
Chessing234:fix/rwkv-multi-gpu-strategy

Chessing234 commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Chessing234 commented May 30, 2026

Bug

Root cause

Why this fix is correct

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant