Skip to content

[BUG] EXL3 reserve allocations ignored for device IDs > 0 #405

@figuernd

Description

@figuernd

OS

Linux

GPU Library

CUDA 12.x

Python version

3.12

Describe the bug

Autosplit reserve (and the equivalent for tensor parallel) ONLY works for device ID 0.

Reproduction steps

In a system with multiple GPUs try something like autosplit_reserve: [1024,1024]. Device ID 1 will not reserve anything.

Expected behavior

Should reserve for each device as described in the documentation.

Logs

No response

Additional context

No response

Acknowledgements

  • I have looked for similar issues before submitting this one.
  • I have read the disclaimer, and this issue is related to a code bug. If I have a question, I will use the Discord server.
  • I understand that the developers have lives and my issue will be answered when possible.
  • I understand the developers of this program are human, and I will ask my questions politely.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions