Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yifan Peng's picture
10 10 21

Yifan Peng

pyf98
regisss's profile picture manmademoon's profile picture Alphonsce's profile picture
Β·
https://pyf98.github.io
  • pyf98

AI & ML interests

Multimodal LLMs, Speech-to-Speech, Speech Recognition

Organizations

NVIDIA's profile picture ESPnet's profile picture Blog-explorers's profile picture YODAS Sharing inc's profile picture Nvidia Data&Tools team's profile picture Carnegie Mellon University's profile picture

pyf98 's collections 1

Open Whisper-style Speech Models (OWSM)
Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
  • Running on Zero
    9

    OWSM V4 Demo

    🌍
    9

    This is a demo for OWSM-V4 CTC and medium model.

  • Runtime error
    Featured
    55

    OWSM Demo

    πŸ”Š
    55

  • espnet/yodas_owsmv4

    Viewer β€’ Updated Sep 1, 2025 β€’ 4 β€’ 7.24k β€’ 15
  • espnet/owsm_ctc_v4_1B

    Automatic Speech Recognition β€’ Updated Aug 30, 2025 β€’ 955 β€’ 5
Open Whisper-style Speech Models (OWSM)
Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
  • Running on Zero
    9

    OWSM V4 Demo

    🌍
    9

    This is a demo for OWSM-V4 CTC and medium model.

  • Runtime error
    Featured
    55

    OWSM Demo

    πŸ”Š
    55

  • espnet/yodas_owsmv4

    Viewer β€’ Updated Sep 1, 2025 β€’ 4 β€’ 7.24k β€’ 15
  • espnet/owsm_ctc_v4_1B

    Automatic Speech Recognition β€’ Updated Aug 30, 2025 β€’ 955 β€’ 5
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs