Skip to content
View IMYBo's full-sized avatar
Block or Report

Block or report IMYBo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 93 3 Updated Jul 15, 2024

Expressive Anechoic Recordings of Speech (EARS)

Python 107 6 Updated Jun 25, 2024

Predicts the level of noise and reverberation on your audiofiles

Jupyter Notebook 122 22 Updated May 22, 2024

Some comprehensive papers about speaker diarization

170 3 Updated Mar 26, 2024

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

266 11 Updated Jul 30, 2024

Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models

Python 16 2 Updated Sep 21, 2023

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,073 989 Updated Aug 1, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,048 2,002 Updated Jul 25, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,152 2,324 Updated Aug 3, 2024

Python package for combining diarization system outputs.

Python 73 13 Updated Oct 12, 2023

Variational Bayes HMM over x-vectors diarization

Python 244 57 Updated Jan 15, 2024

Structured state space sequence models

Jupyter Notebook 2,297 280 Updated Jul 17, 2024

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Python 193 26 Updated Jul 14, 2024

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,525 224 Updated Jul 8, 2024

VB Diarization with Eigenvoice and HMM Priors, refactored

Python 14 3 Updated Jul 27, 2021

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,056 93 Updated Jul 11, 2024

CHIME-7 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Shell 56 4 Updated May 17, 2024

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Python 198 43 Updated Apr 8, 2021

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…

HTML 461 142 Updated Jul 1, 2024

BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION

Python 50 1 Updated Jul 8, 2024

Official repository of NeXt-TDNN for speaker verification

Python 43 2 Updated Apr 6, 2024

kmeans using PyTorch

Jupyter Notebook 458 74 Updated May 9, 2023

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Python 31,944 3,336 Updated Jul 20, 2024

Python package to add text to images, textures and different backgrounds

Python 149 20 Updated Jul 30, 2024

(N=1,2,3)-dimensional unfold (im2col) and fold (col2im) in PyTorch

Python 82 7 Updated Jun 14, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,526 459 Updated Aug 3, 2024
Shell 44 7 Updated May 11, 2024

BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab

Python 817 54 Updated Apr 22, 2024
Next