Mickey-Stone

Mickey Mickey-Stone

5 followers · 5 following

Achievements

Stars

xfyun / webapi-demo

Use for saving demo of web-api

C 11 13 Updated May 20, 2022

YooLiuXiao / FDRL-MHAD

Dataset

2 Updated Oct 11, 2024

facebookresearch / cc_net

Tools to download and cleanup Common Crawl data

Python 971 142 Updated Apr 25, 2023

alibaba / arthas

Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas

Java 35,677 7,500 Updated Nov 14, 2024

kyutai-labs / moshi

Python 6,741 523 Updated Oct 31, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,558 172 Updated Nov 14, 2024

facebookresearch / UnsupervisedMT

Phrase-Based & Neural Unsupervised Machine Translation

Python 1,506 262 Updated Sep 15, 2021

formiel / fairseq

Forked from facebookresearch/fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 19 Updated Aug 1, 2024

ReneeYe / ConST

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

Python 62 6 Updated May 25, 2022

MooreThreads / MooER

MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not …

Python 160 11 Updated Nov 5, 2024

backspacetg / simul_whisper

Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection

Python 45 4 Updated Nov 13, 2024

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,078 252 Updated Nov 15, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 6,300 674 Updated Nov 15, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 3,433 311 Updated Oct 18, 2024

Tele-AI / TeleSpeech-ASR

Python 542 47 Updated Jun 7, 2024

X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Python 576 52 Updated Nov 15, 2024

ga642381 / SpeechGen

《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》

74 5 Updated Jun 9, 2023

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,932 1,058 Updated Nov 14, 2024

voidful / asrp

ASR text preprocessing utility

Python 20 5 Updated Aug 5, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 34,379 4,239 Updated Nov 16, 2024

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 7,916 870 Updated Nov 16, 2024

HumanAIGC / EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,500 913 Updated Aug 21, 2024

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,483 107 Updated Jul 5, 2024

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 9,645 596 Updated Nov 11, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-V…

Python 4,248 376 Updated Nov 16, 2024

microsoft / TransformerCompression

For releasing code related to compression methods for transformers, accompanying our publications

Python 371 37 Updated Oct 11, 2024

snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 4,977 314 Updated Oct 18, 2023

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,361 425 Updated Nov 13, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,182 2,550 Updated Nov 9, 2024

huawei-noah / bolt

Bolt is a deep learning library with high performance and heterogeneous flexibility.

C++ 917 159 Updated Jul 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mickey Mickey-Stone

Achievements

Achievements

Block or report Mickey-Stone

Stars

xfyun / webapi-demo

YooLiuXiao / FDRL-MHAD

facebookresearch / cc_net

alibaba / arthas

kyutai-labs / moshi

ictnlp / LLaMA-Omni

facebookresearch / UnsupervisedMT

formiel / fairseq

ReneeYe / ConST

MooreThreads / MooER

backspacetg / simul_whisper

ufal / whisper_streaming

FunAudioLLM / CosyVoice

FunAudioLLM / SenseVoice

Tele-AI / TeleSpeech-ASR

X-LANCE / SLAM-LLM

ga642381 / SpeechGen

facebookresearch / seamless_communication

voidful / asrp

hiyouga / LLaMA-Factory

axolotl-ai-cloud / axolotl

HumanAIGC / EMO

QwenLM / Qwen-Audio

QwenLM / Qwen2.5

modelscope / ms-swift

microsoft / TransformerCompression

snakers4 / silero-models

snakers4 / silero-vad

microsoft / unilm

huawei-noah / bolt