Yifan Yang

yfyeung

20 10 24

https://yfyeung.github.io

yfyeung

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

GigaSpeechBench: A Real-World Multilingual Speech-to-Text Benchmark

new activity 6 days ago

speechcolab/gigaspeech-test:[bot] Conversion to Parquet

updated a collection 7 days ago

GigaSpeech Series

View all activity

Organizations

authored a paper 1 day ago

GigaSpeechBench: A Real-World Multilingual Speech-to-Text Benchmark

Paper • 2606.28884 • Published 6 days ago

New activity in speechcolab/gigaspeech-test 6 days ago

[bot] Conversion to Parquet

#1 opened 6 days ago by

parquet-converter

updated a collection 7 days ago

GigaSpeech Series

Collection

Evolving, Large-Scale, and Multi-domain ASR Corpus • 6 items • Updated 7 days ago

updated a dataset 7 days ago

speechcolab/gigaspeech-test

Viewer • Updated 6 days ago • 19.9k • 38

published a dataset 7 days ago

speechcolab/gigaspeech-test

Viewer • Updated 6 days ago • 19.9k • 38

New activity in speechcolab/gigaspeech2 7 days ago

Can't load the `dev` and `test`

#8 opened 3 months ago by

sabilmakbar

updated a dataset 14 days ago

yfyeung/FCaps

Preview • Updated 14 days ago • 84 • 5

updated a model 14 days ago

yfyeung/CLSP

0.7B • Updated 14 days ago • 1.44k • 4

authored a paper 25 days ago

MMAE: A Massive Multitask Audio Editing Benchmark

Paper • 2606.07229 • Published 28 days ago • 46

upvoted a paper 25 days ago

MMAE: A Massive Multitask Audio Editing Benchmark

Paper • 2606.07229 • Published 28 days ago • 46

authored a paper 28 days ago

UAT: Unified Audio-Text Diffusion for Audio Generation, Editing, and Captioning

Paper • 2606.04939 • Published 30 days ago

authored a paper about 2 months ago

Evaluating the Expressive Appropriateness of Speech in Rich Contexts

Paper • 2605.09413 • Published May 10 • 5

New activity in typhoon-ai/gigaspeech2-typhoon about 2 months ago

Fix citation of GigaSpeech 2

#4 opened about 2 months ago by

yfyeung

upvoted a collection about 2 months ago

Typhoon ASR

Collection

A speech to text model for Thai • 9 items • Updated Jan 28 • 5

upvoted a paper about 2 months ago

Evaluating the Expressive Appropriateness of Speech in Rich Contexts

Paper • 2605.09413 • Published May 10 • 5

authored a paper about 2 months ago

WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling

Paper • 2605.06407 • Published May 7

upvoted a paper 2 months ago

SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations

Paper • 2510.25955 • Published Oct 29, 2025 • 1

upvoted a paper 3 months ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 69

authored a paper 3 months ago

Representation-Regularized Convolutional Audio Transformer for Audio Understanding

Paper • 2601.21612 • Published Jan 29 • 1

New activity in speechcolab/gigaspeech2 3 months ago

[Help Wanted] Support for GigaSpeech 2 Splits

#4 opened about 2 years ago by

ruby11dog