- UoS -> NUS & BIT
- https://mortyzaigc.netlify.app/
Block or Report
Block or report Mortyzhou-Shef-BIT
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
Awesome-Transformer-Attention Public
Forked from cmhungsteve/Awesome-Transformer-AttentionAn ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
1 UpdatedJun 30, 2023 -
Awesome-Multimodal-Research Public
Forked from Eurus-Holmes/Awesome-Multimodal-ResearchA curated list of Multimodal Related Research.
Python MIT License UpdatedJun 22, 2023 -
awesome-embodied-vision Public
Forked from ChanganVR/awesome-embodied-visionReading list for research topics in embodied vision
MIT License UpdatedMay 31, 2023 -
TTS Public
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Python Mozilla Public License 2.0 UpdatedMay 6, 2023 -
tango Public
Forked from declare-lab/tangoCodes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"
Python Other UpdatedApr 28, 2023 -
AudioLDM Public
Forked from haoheliu/AudioLDMAudioLDM: Generate speech, sound effects, music and beyond, with text.
Python Other UpdatedFeb 12, 2023 -
dialog_evaluation_paper_list Public
Forked from pygongnlp/dialog_evaluation_paper_listDialog Evaluation Paper List: include multiple different dialog tasks
UpdatedNov 30, 2022 -
-
Speech-Resources Public
Forked from ddlBoJack/Speech-Resources语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
-
SpeechTransProgress Public
Forked from kahne/SpeechTransProgressTracking the progress in end-to-end speech translation
Creative Commons Zero v1.0 Universal UpdatedFeb 15, 2022 -
TalkNet-ASD Public
Forked from TaoRuijie/TalkNet-ASDACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Python MIT License UpdatedFeb 13, 2022 -
Awesome-Cloud-Edge-AI Public
Forked from swagshaw/Awesome-Cloud-Edge-AIA curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper notes are also provided.
MIT License UpdatedJan 4, 2022 -
DYGANVC Public
Forked from MingjieChen/DYGANVCsource code for "DYGAN-VC: IMPROVING SPEECH CONTENT PRESERVATION FOR GAN VOICE CONVERSION USING DYNAMIC CONVOLUTION"
Python UpdatedOct 8, 2021 -
StarGANv2-VC Public
Forked from yl4579/StarGANv2-VCStarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Python MIT License UpdatedAug 29, 2021 -
espnet_model_zoo Public
Forked from espnet/espnet_model_zooESPnet Model Zoo
Python Apache License 2.0 UpdatedJul 12, 2021 -
FastVocoder Public
Forked from xcmyz/FastVocoderInclude Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Python MIT License UpdatedJul 2, 2021 -
ppg-vc Public
Forked from liusongxiang/ppg-vcPPG-Based Voice Conversion
-
VQMIVC Public
Forked from Wendison/VQMIVCOfficial implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021
Python MIT License UpdatedJun 21, 2021 -
HiSD Public
Forked from imlixinyang/HiSDOfficial pytorch implementation of paper "Image-to-image Translation via Hierarchical Style Disentanglement" (CVPR 2021 Oral).
Python Other UpdatedJun 15, 2021 -
Pytorch-MBNet Public
Forked from sky1456723/Pytorch-MBNetA pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
Python UpdatedJun 11, 2021 -
crank Public
Forked from k2kobayashi/crankA toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Python MIT License UpdatedMay 28, 2021 -
gdown Public
Forked from wkentaro/gdownDownload a large file from Google Drive (curl/wget fails because of the security notice).
Python MIT License UpdatedMay 11, 2021 -
CMU-MultimodalSDK Public
Forked from Jie-Xie/CMU-MultimodalDataSDKCMU MultimodalSDK is a machine learning platform for development of advanced multimodal models as well as easily accessing and processing multimodal datasets.
Python Other UpdatedApr 29, 2021 -
Talking-Face_PC-AVS Public
Forked from Wendison/Talking-Face_PC-AVSCode for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
Python Creative Commons Attribution 4.0 International UpdatedApr 28, 2021 -
speechmetrics Public
Forked from aliutkus/speechmetricsA wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
Python UpdatedApr 26, 2021 -
speech-synthesis-paper Public
Forked from wenet-e2e/speech-synthesis-paperList of speech synthesis papers.
MIT License UpdatedApr 19, 2021 -
transformers Public
Forked from huggingface/transformers🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Python Apache License 2.0 UpdatedApr 15, 2021 -
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit.
Python MIT License UpdatedApr 14, 2021 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedApr 8, 2021 -
diffwave Public
Forked from lmnt-com/diffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Python Apache License 2.0 UpdatedApr 2, 2021