Skip to content
View Mickey-Stone's full-sized avatar

Block or report Mickey-Stone

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Use for saving demo of web-api

C 11 13 Updated May 20, 2022

Dataset

2 Updated Oct 11, 2024

Tools to download and cleanup Common Crawl data

Python 971 142 Updated Apr 25, 2023

Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas

Java 35,677 7,500 Updated Nov 14, 2024
Python 6,741 523 Updated Oct 31, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,558 172 Updated Nov 14, 2024

Phrase-Based & Neural Unsupervised Machine Translation

Python 1,506 262 Updated Sep 15, 2021

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 19 Updated Aug 1, 2024

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

Python 62 6 Updated May 25, 2022

MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not …

Python 160 11 Updated Nov 5, 2024

Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection

Python 45 4 Updated Nov 13, 2024

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,078 252 Updated Nov 15, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 6,300 674 Updated Nov 15, 2024

Multilingual Voice Understanding Model

Python 3,433 311 Updated Oct 18, 2024
Python 542 47 Updated Jun 7, 2024

Speech, Language, Audio, Music Processing with Large Language Model

Python 576 52 Updated Nov 15, 2024

《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》

74 5 Updated Jun 9, 2023

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,932 1,058 Updated Nov 14, 2024

ASR text preprocessing utility

Python 20 5 Updated Aug 5, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 34,379 4,239 Updated Nov 16, 2024

Go ahead and axolotl questions

Python 7,916 870 Updated Nov 16, 2024

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,500 913 Updated Aug 21, 2024

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,483 107 Updated Jul 5, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 9,645 596 Updated Nov 11, 2024

Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-V…

Python 4,248 376 Updated Nov 16, 2024

For releasing code related to compression methods for transformers, accompanying our publications

Python 371 37 Updated Oct 11, 2024

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 4,977 314 Updated Oct 18, 2023

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,361 425 Updated Nov 13, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,182 2,550 Updated Nov 9, 2024

Bolt is a deep learning library with high performance and heterogeneous flexibility.

C++ 917 159 Updated Jul 30, 2024
Next