A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,152 2,324 Updated Aug 3, 2024

desh2608 / dover-lap

Python package for combining diarization system outputs.

Python 73 13 Updated Oct 12, 2023

BUTSpeechFIT / VBx

Variational Bayes HMM over x-vectors diarization

Python 244 57 Updated Jan 15, 2024

state-spaces / s4

Structured state space sequence models

Jupyter Notebook 2,297 280 Updated Jul 17, 2024

Kuray107 / S4ND-U-Net_speech_enhancement

Python 27 3 Updated May 17, 2024

slp-rl / aero

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Python 193 26 Updated Jul 14, 2024

wq2012 / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,525 224 Updated Jul 8, 2024

wq2012 / VB_diarization

VB Diarization with Eigenvoice and HMM Priors, refactored

Python 14 3 Updated Jul 27, 2021

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,056 93 Updated Jul 11, 2024

liyunlongaaa / NSD-MS2S

CHIME-7 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Shell 56 4 Updated May 17, 2024

rishikksh20 / hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Python 198 43 Updated Apr 8, 2021

microsoft / MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…

HTML 461 142 Updated Jul 1, 2024

yuguochencuc / BAE-Net

BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION

Python 50 1 Updated Jul 8, 2024

dmlguq456 / NeXt_TDNN_ASV

Official repository of NeXt-TDNN for speaker verification

Python 43 2 Updated Apr 6, 2024

subhadarship / kmeans_pytorch

kmeans using PyTorch

Jupyter Notebook 458 74 Updated May 9, 2023

LC044 / WeChatMsg

提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手

Python 31,944 3,336 Updated Jul 20, 2024

pythad / nider

Python package to add text to images, textures and different backgrounds

Python 149 20 Updated Jul 30, 2024

AkojimaSLP / Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming

speech-enhacement

Python 46 16 Updated Nov 5, 2019

f-dangel / unfoldNd

(N=1,2,3)-dimensional unfold (im2col) and fold (col2im) in PyTorch

Python 82 7 Updated Jun 14, 2024

01-ai / Yi

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,526 459 Updated Aug 3, 2024

BUTSpeechFIT / EEND_dataprep

Shell 44 7 Updated May 11, 2024

vivo-ai-lab / BlueLM

BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab

Python 817 54 Updated Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cyril Lv IMYBo

Achievements

Achievements

Block or report IMYBo

Stars

haoheliu / SemantiCodec-inference

facebookresearch / ears_dataset

marianne-m / brouhaha-vad

DongKeon / Awesome-Speaker-Diarization

metame-ai / awesome-audio-plaza

JusperLee / S4M

PKU-YuanGroup / Open-Sora-Plan

hpcaitech / Open-Sora

NVIDIA / NeMo