Skip to content
View hopingZ's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report hopingZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,019 445 Updated Oct 10, 2024

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Python 334 61 Updated Jul 21, 2024

Implementation of the proposed minGRU in Pytorch

Python 156 8 Updated Oct 14, 2024

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 111 10 Updated Oct 12, 2024

Real-time Speech-Text Foundation Model Toolkit (wip)

Python 104 10 Updated Oct 14, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,498 301 Updated Oct 14, 2024

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 138 8 Updated Aug 25, 2024
Python 6,277 476 Updated Oct 14, 2024

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Jupyter Notebook 119 10 Updated Nov 12, 2022

The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems

Python 251 19 Updated Oct 10, 2023

An Open-Sourced LLM-empowered Foundation TTS System

Python 304 15 Updated Sep 25, 2024

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,409 99 Updated Aug 7, 2024

State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning

Python 346 36 Updated Oct 10, 2024

Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering"

Python 41 5 Updated May 19, 2023

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 700 40 Updated Sep 21, 2024

flow mirror models from JZX AI Labs

Python 38 2 Updated Sep 30, 2024

Official PyTorch code for Deep Audio-Signal Holistic Embeddings

Python 48 7 Updated Oct 10, 2024
Python 28 4 Updated Jun 13, 2024

Preprocess and segement audio files from ami-corpus

Python 2 1 Updated Aug 25, 2022

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,854 259 Updated Sep 25, 2024

Python implementation of pre-processing for End-to-End speech recognition

Python 69 23 Updated Feb 19, 2018

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Python 223 43 Updated Apr 29, 2024

Repository for Quantifying Valence and Arousal in Text with Multilingual Pre-trained Transformers

Python 25 Updated Feb 26, 2023

[ACMMM'2024] Generative Expressive Conversational Speech Synthesis

18 1 Updated Aug 20, 2024

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 51,197 11,408 Updated Oct 14, 2024

Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization

Python 151 10 Updated Jul 12, 2024

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

Jupyter Notebook 305 39 Updated May 10, 2024

DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability

Python 86 6 Updated Jul 10, 2024

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 247 19 Updated Oct 15, 2024
Next