Skip to content

Auto-detect NeMo model type during conversion#26

Open
cathrynlavery wants to merge 1 commit intoFrikallo:mainfrom
cathrynlavery:fix/auto-detect-convert-nemo-model
Open

Auto-detect NeMo model type during conversion#26
cathrynlavery wants to merge 1 commit intoFrikallo:mainfrom
cathrynlavery:fix/auto-detect-convert-nemo-model

Conversation

@cathrynlavery
Copy link
Copy Markdown

Summary

  • make convert_nemo.py auto-detect the Parakeet model preset from checkpoint tensor shapes by default
  • keep explicit --model overrides for all existing presets
  • update README wording for the new default

Why

Converting a 600M TDT checkpoint with the old 110M default can produce a safetensors file with the wrong joint tensor split. The file can look valid but then fail at runtime because the expected joint weights are not initialized.

Validation

  • python3 -m py_compile scripts/convert_nemo.py
  • python3 scripts/convert_nemo.py --help
  • synthetic infer_model_type checks for 600m-tdt, 110m-tdt-ctc, eou-120m, and sortformer

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant