Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Yifan Peng's picture

Yifan Peng

pyf98

regisss's profile picture

manmademoon's profile picture

Alphonsce's profile picture

·

https://pyf98.github.io

pyf98

AI & ML interests

Multimodal LLMs, Speech-to-Speech, Speech Recognition

Organizations

pyf98 's collections 1

Open Whisper-style Speech Models (OWSM)

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/

Running on Zero

9

OWSM V4 Demo

🌍

9

This is a demo for OWSM-V4 CTC and medium model.
Runtime error

Featured

55

OWSM Demo

🔊

55
espnet/yodas_owsmv4

Viewer • Updated Sep 1, 2025 • 4 • 7.24k • 15
espnet/owsm_ctc_v4_1B

Automatic Speech Recognition • Updated Aug 30, 2025 • 955 • 5

Open Whisper-style Speech Models (OWSM)

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/

Running on Zero

9

OWSM V4 Demo

🌍

9

This is a demo for OWSM-V4 CTC and medium model.
Runtime error

Featured

55

OWSM Demo

🔊

55
espnet/yodas_owsmv4

Viewer • Updated Sep 1, 2025 • 4 • 7.24k • 15
espnet/owsm_ctc_v4_1B

Automatic Speech Recognition • Updated Aug 30, 2025 • 955 • 5

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs