Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

novateur
/
WavTokenizer

Text-to-Speech
audio-feature-extraction
speech-language-models
gpt4-o
tokenizer
codec-representation
automatic-speech-recognition
Model card Files Files and versions
xet
Community
3
WavTokenizer
3.17 GB
  • 1 contributor
History: 18 commits
novateur's picture
novateur
Update README.md
917d513 verified about 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    5.99 kB
    Update README.md about 1 year ago
  • WavTokenizer_small_320_24k_4096.ckpt
    1.58 GB
    xet
    Upload WavTokenizer_small_320_24k_4096.ckpt over 1 year ago
  • WavTokenizer_small_600_24k_4096.ckpt
    1.59 GB
    xet
    Upload WavTokenizer_small_600_24k_4096.ckpt over 1 year ago
  • result.png
    285 kB
    Upload result.png over 1 year ago
  • wavtokenizer_smalldata_frame40_3s_nq1_code4096_dim512_kmeans200_attn.yaml
    2.78 kB
    Update wavtokenizer_smalldata_frame40_3s_nq1_code4096_dim512_kmeans200_attn.yaml over 1 year ago
  • wavtokenizer_smalldata_frame75_3s_nq1_code4096_dim512_kmeans200_attn.yaml
    2.86 kB
    Update wavtokenizer_smalldata_frame75_3s_nq1_code4096_dim512_kmeans200_attn.yaml over 1 year ago