Skip to content
View Zth9730's full-sized avatar
🥬
Ataraxy
🥬
Ataraxy
  • Computer of Science and Technology Beijing

Highlights

  • Pro
Block or Report

Block or report Zth9730

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Python 107 15 Updated Jul 7, 2024

Reference-aware automatic speech evaluation toolkit

Python 80 5 Updated Feb 22, 2024

Multilingual Voice Understanding Model

Python 1,390 121 Updated Jul 17, 2024

LLM全栈优质资源汇总

Shell 261 28 Updated Jun 2, 2024

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,748 333 Updated Jul 17, 2024

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,281 210 Updated Mar 20, 2024

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 535 77 Updated Dec 27, 2023

Ongoing research training transformer models at scale

Python 9,420 2,118 Updated Jul 16, 2024

Audio Codec Speech processing Universal PERformance Benchmark

Python 184 22 Updated Jun 19, 2024

Awesome Papers related to Mamba.

980 50 Updated Jul 8, 2024

A generative speech model for daily dialogue.

Python 27,860 3,022 Updated Jul 16, 2024

Official code for Wav2Seq

Python 93 11 Updated Jul 19, 2022

Speech, Language, Audio, Music Processing with Large Language Model

Python 409 33 Updated Jul 3, 2024

Voice Face Association Learning Paper List

13 1 Updated May 20, 2023

Finetune VITS and MMS using HuggingFace's tools

Python 98 21 Updated Mar 31, 2024

FaRL for Facial Representation Learning [Official, CVPR 2022]

Python 351 21 Updated Jun 9, 2023

The official Meta Llama 3 GitHub site

Python 23,292 2,493 Updated Jul 17, 2024

NeMo text processing for ASR and TTS

Python 246 80 Updated Jul 17, 2024

A curated list of papers in Test-time Adaptation, Test-time Training and Source-free Domain Adaptation

443 42 Updated Jun 23, 2024

base_espnet

Shell 3 1 Updated Jul 10, 2023

MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing

83 12 Updated Jul 15, 2024

欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。

Jupyter Notebook 224 27 Updated Apr 10, 2024

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 219 14 Updated Mar 14, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,278 365 Updated Jul 15, 2024

Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch

Python 257 23 Updated Jun 17, 2024

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 578 46 Updated Sep 13, 2023

A collection of AWESOME things about mixture-of-experts

845 62 Updated Jun 25, 2024

BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing

Python 39 8 Updated Mar 11, 2024
Next