Zth9730

Follow

🥬

Ataraxy

TianHao Zhang Zth9730

🥬

Ataraxy

Follow

University of Science and Technology Beijing

4 followers · 13 following

Computer of Science and Technology Beijing

Achievements

Achievements

Highlights

Pro

Block or Report

Block or report Zth9730

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Stars

showlab / videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Python 107 15 Updated Jul 7, 2024

Takaaki-Saeki / DiscreteSpeechMetrics

Reference-aware automatic speech evaluation toolkit

Python 80 5 Updated Feb 22, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 1,390 121 Updated Jul 17, 2024

liguodongiot / llm-resource

LLM全栈优质资源汇总

Shell 261 28 Updated Jun 2, 2024

microsoft / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,748 333 Updated Jul 17, 2024

bigscience-workshop / Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,281 210 Updated Mar 20, 2024

yangdongchao / AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 535 77 Updated Dec 27, 2023

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 9,420 2,118 Updated Jul 16, 2024

voidful / Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

Python 184 22 Updated Jun 19, 2024

yyyujintang / Awesome-Mamba-Papers

Awesome Papers related to Mamba.

980 50 Updated Jul 8, 2024

multimodal-art-projection / MAP-NEO

Python 757 72 Updated Jun 21, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 27,860 3,022 Updated Jul 16, 2024

asappresearch / wav2seq

Official code for Wav2Seq

Python 93 11 Updated Jul 19, 2022

X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Python 409 33 Updated Jul 3, 2024

my-yy / vfal_papers

Voice Face Association Learning Paper List

13 1 Updated May 20, 2023

ylacombe / finetune-hf-vits

Finetune VITS and MMS using HuggingFace's tools

Python 98 21 Updated Mar 31, 2024

FacePerceiver / FaRL

FaRL for Facial Representation Learning [Official, CVPR 2022]

Python 351 21 Updated Jun 9, 2023

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 23,292 2,493 Updated Jul 17, 2024

NVIDIA / NeMo-text-processing

NeMo text processing for ASR and TTS

Python 246 80 Updated Jul 17, 2024

YuejiangLIU / awesome-source-free-test-time-adaptation

A curated list of papers in Test-time Adaptation, Test-time Training and Source-free Domain Adaptation

443 42 Updated Jun 23, 2024

bytedance / music_source_separation

Python 1,238 192 Updated Apr 18, 2024

GabrielHaoHao / Interformer_espnet

base_espnet

Shell 3 1 Updated Jul 10, 2023

HumanAIGC / MaTe3D

MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing

83 12 Updated Jul 15, 2024

Glanvery / LLM-Travel

欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。

Jupyter Notebook 224 27 Updated Apr 10, 2024

TXH-mercury / VAST

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 219 14 Updated Mar 14, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,278 365 Updated Jul 15, 2024

lucidrains / st-moe-pytorch

Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch

Python 257 23 Updated Jun 17, 2024

lucidrains / mixture-of-experts

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 578 46 Updated Sep 13, 2023

XueFuzhao / awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

845 62 Updated Jun 25, 2024

cwang621 / blsp

BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing

Python 39 8 Updated Mar 11, 2024