pltobing
/

nemo-asr-cache-aware-streaming-160ms-en-onnx

nemo-conformer-rnnt

Model card Files Files and versions

ONNX cache-aware streaming ASR Nemo (Conformer-RNNT) [EN-0.16s]

Device: CPU
Language: English
Latency: 160ms (1 + 1 future context chunks; 1 chunk is 8 frames; 1 frame is 10ms)
Model origin: https://huggingface.co/nvidia/nemotron-speech-streaming-en-0.6b
ONNX origin: https://github.com/istupakov/onnx-asr

By: Patrick Lumbantobing

Copyright@VertoX-AI

Downloads last month: 2

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for pltobing/nemo-asr-cache-aware-streaming-160ms-en-onnx

Base model

nvidia/nemotron-speech-streaming-en-0.6b

Quantized

(8)

this model