zxf-icpc

Follow

zxf-icpc

Follow

2 followers · 0 following

Stars

lucidrains / ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 461 27 Updated Aug 15, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,137 70 Updated Aug 13, 2024

Sreyan88 / GAMA

Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Python 70 6 Updated Jul 27, 2024

Stability-AI / stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 145 16 Updated Jul 25, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,747 954 Updated Aug 23, 2024

CrazyBoyM / llama3-Chinese-chat

Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

Python 3,968 325 Updated Sep 16, 2024

jaeyeonkim99 / EnCLAP

Official Implementation of EnCLAP (ICASSP 2024)

Python 88 5 Updated Jun 2, 2024

declare-lab / tango

A family of diffusion models for text-to-audio generation.

Python 1,000 79 Updated Jul 3, 2024

thuhcsi / SECap

Python 131 11 Updated Jul 9, 2024

happylittlecat2333 / Auffusion

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Jupyter Notebook 146 12 Updated Mar 25, 2024

ms-dot-k / TMT

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

Jupyter Notebook 14 Updated May 23, 2024

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,132 95 Updated Aug 18, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,502 387 Updated Oct 10, 2024