bigpon

🎯

Focusing

Yi-Chiao WU bigpon

🎯

Focusing

Research Scientist @ Meta Reality Labs Research topics: Neural codec Voice Conversion, Speech Synthesis, Speech Enhancement.

77 followers · 9 following

Meta
New York City, NY, US
15:31 (UTC -05:00)
https://bigpon.github.io/

Achievements

Stars

sp-uhh / ears_benchmark

Generation scripts for EARS-WHAM and EARS-Reverb

Python 21 3 Updated Sep 16, 2024

mdeff / fma

FMA: A Dataset For Music Analysis

Jupyter Notebook 2,241 439 Updated Jan 5, 2023

BradyFU / Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

402 12 Updated Jun 18, 2024

soham97 / PAM

PAM is a no-reference audio quality metric for audio generation tasks

Python 48 5 Updated Jul 19, 2024

gabrielmittag / NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 680 117 Updated Mar 8, 2024

unilight / sheet

Speech Human Evaluation Estimation Toolkit (SHEET)

Python 32 2 Updated Nov 7, 2024

kyutai-labs / moshi

Python 6,694 508 Updated Oct 31, 2024

dmlc / decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 1,881 161 Updated Jul 17, 2024

microsoft / fadtk

A simple library for Fréchet Audio Distance (FAD) calculation

Python 145 21 Updated Oct 13, 2024

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,292 777 Updated Nov 8, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,760 120 Updated Nov 10, 2024

mira-space / MiraData

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 366 9 Updated Sep 2, 2024

archinetai / cqt-pytorch

An invertible and differentiable implementation of the Constant-Q Transform (CQT).

Python 54 3 Updated Dec 9, 2022

v-iashin / Synchformer

Efficient synchronization from sparse cues

Python 28 4 Updated Apr 25, 2024

atong01 / conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

Python 1,203 98 Updated Oct 9, 2024

facebookresearch / ears_dataset

Expressive Anechoic Recordings of Speech (EARS)

Python 130 7 Updated Jun 25, 2024

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,069 87 Updated Aug 6, 2024

bfs18 / rfwave

Python 100 8 Updated Oct 7, 2024

slhck / ffmpeg-normalize

Audio Normalization for Python/ffmpeg

Python 1,274 117 Updated Oct 22, 2024

audiolabs / webMUSHRA

a MUSHRA compliant web audio API based experiment software

JavaScript 351 137 Updated Aug 9, 2024

LAION-AI / audio-dataset

Audio Dataset for training CLAP and other models

Python 632 53 Updated Feb 5, 2024

AudiogenAI / agc

Audiogen Codec

Python 127 11 Updated Jul 9, 2024

luferrer / ConfidenceIntervals

Confidence interval computation for evaluation in machine learning using the bootstrapping approach

Jupyter Notebook 66 8 Updated Apr 5, 2024

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 2,704 256 Updated Nov 5, 2024

metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS

Python 3,878 658 Updated Jul 30, 2024

Wataru-Nakata / miipher

Unofficial implementation of miipher

Python 111 15 Updated Apr 19, 2024

Kevin-thu / DiffMorpher

Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)

Python 416 43 Updated Apr 24, 2024

XingangPan / DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,706 3,454 Updated May 18, 2024

sp-uhh / storm

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

Python 179 24 Updated Sep 13, 2024

Fraunhofer-IIS / ODAQ

Python 34 3 Updated Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yi-Chiao WU bigpon

Achievements

Achievements

Block or report bigpon

Stars

sp-uhh / ears_benchmark

mdeff / fma

BradyFU / Video-MME

soham97 / PAM

gabrielmittag / NISQA

unilight / sheet

kyutai-labs / moshi

dmlc / decord

microsoft / fadtk

pyannote / pyannote-audio

NUS-HPC-AI-Lab / VideoSys

mira-space / MiraData

archinetai / cqt-pytorch

v-iashin / Synchformer

atong01 / conditional-flow-matching

facebookresearch / ears_dataset

Alpha-VLLM / Lumina-T2X

bfs18 / rfwave

slhck / ffmpeg-normalize

audiolabs / webMUSHRA

LAION-AI / audio-dataset

AudiogenAI / agc

luferrer / ConfidenceIntervals

Stability-AI / stable-audio-tools

metavoiceio / metavoice-src

Wataru-Nakata / miipher

Kevin-thu / DiffMorpher

XingangPan / DragGAN

sp-uhh / storm

Fraunhofer-IIS / ODAQ