Skip to content
View lwang114's full-sized avatar

Block or report lwang114

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 310 21 Updated Sep 3, 2024

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…

641 42 Updated Aug 9, 2024

Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).

Python 99 12 Updated Jun 14, 2023

[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383

Python 401 35 Updated Oct 28, 2022

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,492 1,548 Updated May 23, 2024

Word Discovery in Visually Grounded, Self-Supervised Speech Models

Jupyter Notebook 25 7 Updated Dec 4, 2023

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,538 6,407 Updated Oct 18, 2024

Phoneme segmentation using pre-trained speech models

Python 53 10 Updated Nov 4, 2022

American Sign Language to Speech Application.

Python 91 24 Updated Sep 30, 2020
Python 981 250 Updated Jun 28, 2020

speech self-supervised representations

Python 467 38 Updated Apr 27, 2023

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

Python 142 23 Updated Sep 1, 2020
Shell 5 1 Updated Sep 21, 2021

Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)

Python 137 31 Updated Aug 5, 2022

Global Rhythm Style Transfer Without Text Transcriptions

Python 261 35 Updated Oct 23, 2024

Bottom-up features extractor implemented in PyTorch.

Python 71 19 Updated Dec 5, 2019
Jupyter Notebook 7 1 Updated Oct 26, 2020
Python 35 12 Updated Jun 12, 2023

An open-source NLP research library, built on PyTorch.

Python 11,759 2,253 Updated Nov 22, 2022

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.

Python 9,038 1,835 Updated Apr 22, 2022

Unsupervised word segmentation and clustering of speech

Python 13 6 Updated Feb 17, 2017

💬 Command-line translator using Google Translate, Bing Translator, Yandex.Translate, etc.

Awk 6,996 393 Updated Mar 27, 2024

Pitman-Yor processes in python

Python 25 11 Updated Apr 18, 2014

Data and code for grapheme-to-phoneme transducers in lots of languages

HTML 130 19 Updated Apr 5, 2024
Python 12 11 Updated Feb 26, 2018