KittenTTS-Mini-v0.8-ONNX

ONNX export of KittenML/kitten-tts-mini-0.8 for browser inference with Transformers.js and ONNX Runtime Web.

Model Details

Architecture StyleTTS 2
Parameters 80M
Precision Int8 quantized
Sample Rate 24kHz
Runtime WASM only (int8 quantized ops not supported on WebGPU)
Voices Bella, Jasper, Luna, Bruno, Rosie, Hugo, Kiki, Leo

Note: This model uses int8 quantized operations (MatMulInteger, ConvInteger, DynamicQuantizeLSTM) that are not supported by the WebGPU execution provider in ONNX Runtime Web. A FP32 version is needed for WebGPU acceleration. For WebGPU support, use KittenTTS-Nano-v0.8-ONNX (15M FP32).

Demo

KittenTTS WebGPU Demo โ€” runs entirely in your browser.

Credits

Downloads last month
5
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for onnx-community/KittenTTS-Mini-v0.8-ONNX

Quantized
(1)
this model

Space using onnx-community/KittenTTS-Mini-v0.8-ONNX 1