diff --git a/docs/source/asr/asr_checkpoints.rst b/docs/source/asr/asr_checkpoints.rst index 0d6ce1d0478f..bd7a6cd4b2ba 100644 --- a/docs/source/asr/asr_checkpoints.rst +++ b/docs/source/asr/asr_checkpoints.rst @@ -7,6 +7,8 @@ ASR Model Checkpoints This page lists all supported ASR model checkpoints released by NVIDIA NeMo. Benchmark scores for each model can be found on its `HuggingFace model card `__. +For community fine-tunes built on these checkpoints, see :doc:`Featured Community Checkpoints <./featured_community_checkpoints>`. + Glossary -------- diff --git a/docs/source/asr/featured_community_checkpoints.rst b/docs/source/asr/featured_community_checkpoints.rst new file mode 100644 index 000000000000..7c0eb5ad1737 --- /dev/null +++ b/docs/source/asr/featured_community_checkpoints.rst @@ -0,0 +1,49 @@ +.. _featured-community-checkpoints: + +Featured Community Checkpoints +============================== + +Community fine-tunes built on NVIDIA NeMo ASR checkpoints and published on Hugging Face. +For NVIDIA-published checkpoints, see :doc:`./asr_checkpoints` and the `NVIDIA Hugging Face organization `__. + +.. note:: + + Community checkpoints are maintained by their authors, not by the NeMo team. + Use each model's Hugging Face model card and the framework project linked below for up-to-date setup and inference instructions. + +.. list-table:: + :header-rows: 1 + :widths: 28 52 20 + + * - Checkpoint + - What's special + - Framework + * - `akera/parakeet-tdt-salt `__ + - SALT multilingual ASR for 10 East African languages. Hybrid TDT+CTC FastConformer (600M), fine-tuned from `parakeet-tdt-0.6b-v3 `__. + - NeMo + * - `johannhartmann/parakeet_de_med `__ + - German medical documentation ASR (PEFT). WER 11.73% → 3.28% on a 122-sample medical eval set. + - NeMo + * - `qenneth/parakeet-tdt-0.6b-v3-finetuned-for-ATC `__ + - ATC English ASR on `jacktol/ATC-ASR-Dataset `__. Test WER 5.99%. + - NeMo + * - `KasuleTrevor/parakeet-0.6b-cv-sw-5hr_v9 `__ + - Swahili ASR fine-tune on ~5 hours of Common Voice data. + - NeMo + * - `NeurologyAI/neuro-parakeet-mlx `__ + - German medical/neurology ASR for Apple Silicon. WER 1.04% on the author's medical validation set. + - MLX + * - `cstr/parakeet-tdt-0.6b-v3-GGUF `__ + - Quantised Parakeet TDT (Q4_K ~467 MB). 25 EU languages, word-level timestamps. + - GGUF (`CrispASR `__) + * - `cstr/canary-1b-v2-GGUF `__ + - Quantised Canary 1B (Q4_K ~673 MB). Multilingual ASR and speech translation. + - GGUF (`CrispASR `__) + + +.. _submit-a-community-checkpoint: + +Submit a Community Checkpoint +----------------------------- + +To suggest a checkpoint for this page, open a `GitHub issue `__ with the Hugging Face model link, NeMo base checkpoint, task, languages, evaluation results, and inference framework. diff --git a/docs/source/asr/intro.rst b/docs/source/asr/intro.rst index c43fee6da7c6..0f5140662a8d 100644 --- a/docs/source/asr/intro.rst +++ b/docs/source/asr/intro.rst @@ -72,3 +72,4 @@ Further Reading asr_language_modeling_and_customization configs api + featured_community_checkpoints diff --git a/docs/source/starthere/choosing_a_model.rst b/docs/source/starthere/choosing_a_model.rst index 1abcc74d6fb4..c1c1a0c2fb05 100644 --- a/docs/source/starthere/choosing_a_model.rst +++ b/docs/source/starthere/choosing_a_model.rst @@ -132,6 +132,7 @@ All pretrained NeMo models are available on: - `HuggingFace Hub (nvidia) `_ — search for "nemo" or specific model names - `NGC Model Catalog `_ — NVIDIA's model registry +- :doc:`Featured Community Checkpoints ` — fine-tunes from external users See :doc:`../checkpoints/intro` for instructions on loading pretrained models.