yhzhouowo Mortyzhou-Shef-BIT

🪐

Working from home

Living with attention is all we need.

39 followers · 465 following

UoS -> NUS & BIT
https://mortyzaigc.netlify.app/

Achievements

Block or Report

Block or report Mortyzhou-Shef-BIT

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Awesome-Transformer-Attention Public
Forked from cmhungsteve/Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

1 Updated Jun 30, 2023
Awesome-Multimodal-Research Public
Forked from Eurus-Holmes/Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Python MIT License Updated Jun 22, 2023
awesome-embodied-vision Public
Forked from ChanganVR/awesome-embodied-vision

Reading list for research topics in embodied vision

MIT License Updated May 31, 2023
TTS Public
Forked from coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python Mozilla Public License 2.0 Updated May 6, 2023
tango Public
Forked from declare-lab/tango

Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"

Python Other Updated Apr 28, 2023
AudioLDM Public
Forked from haoheliu/AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python Other Updated Feb 12, 2023
dialog_evaluation_paper_list Public
Forked from pygongnlp/dialog_evaluation_paper_list

Dialog Evaluation Paper List: include multiple different dialog tasks

Updated Nov 30, 2022
reentry Public
Forked from zexupan/reentry

Python Updated Aug 30, 2022
Speech-Resources Public
Forked from ddlBoJack/Speech-Resources

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

1 MIT License Updated Feb 16, 2022
SpeechTransProgress Public
Forked from kahne/SpeechTransProgress

Tracking the progress in end-to-end speech translation

Creative Commons Zero v1.0 Universal Updated Feb 15, 2022
TalkNet-ASD Public
Forked from TaoRuijie/TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Python MIT License Updated Feb 13, 2022
Awesome-Cloud-Edge-AI Public
Forked from swagshaw/Awesome-Cloud-Edge-AI

A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper notes are also provided.

MIT License Updated Jan 4, 2022
DYGANVC Public
Forked from MingjieChen/DYGANVC

source code for "DYGAN-VC: IMPROVING SPEECH CONTENT PRESERVATION FOR GAN VOICE CONVERSION USING DYNAMIC CONVOLUTION"

Python Updated Oct 8, 2021
StarGANv2-VC Public
Forked from yl4579/StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Python MIT License Updated Aug 29, 2021
espnet_model_zoo Public
Forked from espnet/espnet_model_zoo

ESPnet Model Zoo

Python Apache License 2.0 Updated Jul 12, 2021
FastVocoder Public
Forked from xcmyz/FastVocoder

Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

Python MIT License Updated Jul 2, 2021
ppg-vc Public
Forked from liusongxiang/ppg-vc

PPG-Based Voice Conversion

Python 1 Apache License 2.0 Updated Jun 28, 2021
VQMIVC Public
Forked from Wendison/VQMIVC

Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021

Python MIT License Updated Jun 21, 2021
HiSD Public
Forked from imlixinyang/HiSD

Official pytorch implementation of paper "Image-to-image Translation via Hierarchical Style Disentanglement" (CVPR 2021 Oral).

Python Other Updated Jun 15, 2021
Pytorch-MBNet Public
Forked from sky1456723/Pytorch-MBNet

A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK

Python Updated Jun 11, 2021
crank Public
Forked from k2kobayashi/crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Python MIT License Updated May 28, 2021
gdown Public
Forked from wkentaro/gdown

Download a large file from Google Drive (curl/wget fails because of the security notice).

Python MIT License Updated May 11, 2021
CMU-MultimodalSDK Public
Forked from Jie-Xie/CMU-MultimodalDataSDK

CMU MultimodalSDK is a machine learning platform for development of advanced multimodal models as well as easily accessing and processing multimodal datasets.

Python Other Updated Apr 29, 2021
Talking-Face_PC-AVS Public
Forked from Wendison/Talking-Face_PC-AVS

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

Python Creative Commons Attribution 4.0 International Updated Apr 28, 2021
speechmetrics Public
Forked from aliutkus/speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python Updated Apr 26, 2021
speech-synthesis-paper Public
Forked from wenet-e2e/speech-synthesis-paper

List of speech synthesis papers.

MIT License Updated Apr 19, 2021
transformers Public
Forked from huggingface/transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

Python Apache License 2.0 Updated Apr 15, 2021
s3prl Public
Forked from s3prl/s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Python MIT License Updated Apr 14, 2021
fairseq Public
Forked from facebookresearch/fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python MIT License Updated Apr 8, 2021
diffwave Public
Forked from lmnt-com/diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python Apache License 2.0 Updated Apr 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yhzhouowo Mortyzhou-Shef-BIT

Achievements

Achievements

Block or report Mortyzhou-Shef-BIT

Awesome-Transformer-Attention Public

Awesome-Multimodal-Research Public

awesome-embodied-vision Public

TTS Public

tango Public

AudioLDM Public

dialog_evaluation_paper_list Public

reentry Public

Speech-Resources Public

SpeechTransProgress Public

TalkNet-ASD Public

Awesome-Cloud-Edge-AI Public

DYGANVC Public

StarGANv2-VC Public

espnet_model_zoo Public

FastVocoder Public

ppg-vc Public

VQMIVC Public

HiSD Public

Pytorch-MBNet Public

crank Public

gdown Public

CMU-MultimodalSDK Public

Talking-Face_PC-AVS Public

speechmetrics Public

speech-synthesis-paper Public

transformers Public

s3prl Public

fairseq Public

diffwave Public