Skip to content

[WIP] EasyMP vllm-omni model definition#15741

Open
vklimkov-nvidia wants to merge 8 commits into
NVIDIA-NeMo:easymp_voiceagentfrom
vklimkov-nvidia:easymp_vllm_omni
Open

[WIP] EasyMP vllm-omni model definition#15741
vklimkov-nvidia wants to merge 8 commits into
NVIDIA-NeMo:easymp_voiceagentfrom
vklimkov-nvidia:easymp_vllm_omni

Conversation

@vklimkov-nvidia
Copy link
Copy Markdown
Member

EasyMP model defnition, where backbone and LT are compiled into a single cuda graph for uniform batches.
Loads real weights, doesn't produce valid acoustic tokens at this point.

@vklimkov-nvidia vklimkov-nvidia requested a review from a team as a code owner June 1, 2026 16:59
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Jun 1, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@github-actions github-actions Bot added the TTS label Jun 1, 2026
…cate speaker encoder application

Signed-off-by: Viacheslav Klimkov <vklimkov@nvidia.com>
…ition of Easy Magpie

Signed-off-by: Viacheslav Klimkov <vklimkov@nvidia.com>
Signed-off-by: Viacheslav Klimkov <vklimkov@nvidia.com>
Signed-off-by: Viacheslav Klimkov <vklimkov@nvidia.com>
…embeddings and prepare prefill embeddings

Signed-off-by: Viacheslav Klimkov <vklimkov@nvidia.com>
…eckpoint to vllm omni one

Signed-off-by: Viacheslav Klimkov <vklimkov@nvidia.com>
Signed-off-by: Viacheslav Klimkov <vklimkov@nvidia.com>
… prediction processing

Signed-off-by: Viacheslav Klimkov <vklimkov@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant