How to Build Ultra-low-latency Voice Agents With NVIDIA Cache-aware Streaming ASRThis post accompanies the launch of NVIDIA Nemotron Speech ASR on Hugging Face. Read the full model announcement here. In this post, we’ll build a voice agent using three NVIDIA open models: The new Nemotron Speech ASR modelNemotron 3 Nano LLMA preview checkpoint of the upcoming NVIDIA Magpie text-to-speech modelThis

