Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 3 items • Updated 2 days ago • 18
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System Paper • 2310.12378 • Published Oct 18, 2023
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer Paper • 2306.08753 • Published Jun 14, 2023 • 1
Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach Paper • 2309.05248 • Published Sep 11, 2023
nvidia/stt_ru_fastconformer_hybrid_large_pc Automatic Speech Recognition • Updated 24 days ago • 1.42k • 7