Skip to content
View zxf-icpc's full-sized avatar

Block or report zxf-icpc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 461 27 Updated Aug 15, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,137 70 Updated Aug 13, 2024

Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Python 70 6 Updated Jul 27, 2024

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 145 16 Updated Jul 25, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,747 954 Updated Aug 23, 2024

Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Python 3,968 325 Updated Sep 16, 2024

Official Implementation of EnCLAP (ICASSP 2024)

Python 88 5 Updated Jun 2, 2024

A family of diffusion models for text-to-audio generation.

Python 1,000 79 Updated Jul 3, 2024
Python 131 11 Updated Jul 9, 2024

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Jupyter Notebook 146 12 Updated Mar 25, 2024

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

Jupyter Notebook 14 Updated May 23, 2024

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,132 95 Updated Aug 18, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,502 387 Updated Oct 10, 2024