ONNX cache-aware streaming ASR Nemo (Conformer-RNNT) [EN-0.16s]
Device: CPU
Language: English
Latency: 160ms (1 + 1 future context chunks; 1 chunk is 8 frames; 1 frame is 10ms)
Model origin: https://huggingface.co/nvidia/nemotron-speech-streaming-en-0.6b
ONNX origin: https://github.com/istupakov/onnx-asr
By: Patrick Lumbantobing
Copyright@VertoX-AI
- Downloads last month
- 2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for pltobing/nemo-asr-cache-aware-streaming-160ms-en-onnx
Base model
nvidia/nemotron-speech-streaming-en-0.6b