Skip to content
View lvzhiqiang's full-sized avatar

Block or report lvzhiqiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Train transformer language models with reinforcement learning.

Python 9,715 1,221 Updated Oct 12, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 308 29 Updated Oct 11, 2024

On-device AI across mobile, embedded and edge for PyTorch

C++ 1,918 312 Updated Oct 12, 2024

Implementation of the proposed minGRU in Pytorch

Python 144 7 Updated Oct 6, 2024

The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 370 28 Updated Oct 6, 2024

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 107 9 Updated Oct 12, 2024

Malfunctioning Industrial Machine Investigation and Inspection

1 Updated Sep 26, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 283 14 Updated Sep 25, 2024

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python 353 39 Updated Sep 25, 2024
Python 6,220 466 Updated Oct 11, 2024

Text-to-Music Generation with Rectified Flow Transformers

Python 1,553 120 Updated Sep 6, 2024

nanobind: tiny and efficient C++/Python bindings

C++ 2,314 193 Updated Oct 8, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,815 256 Updated Sep 25, 2024

Papers for LLM and foundation models for time series analytics

12 Updated Sep 30, 2024

LlamaVoice is a llama-based large voice generation model, providing inference and training ability.

Python 216 11 Updated Aug 26, 2024

Inference and training library for high-quality TTS models.

Python 4,346 440 Updated Sep 23, 2024

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 415 16 Updated Sep 25, 2024

Turns Data and AI algorithms into production-ready web applications in no time.

Python 13,509 1,483 Updated Oct 12, 2024

Realtime Web Apps and Dashboards for Python and R

Python 3,991 326 Updated Oct 3, 2024

LLM101n: Let's build a Storyteller

29,300 1,602 Updated Aug 1, 2024

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,185 149 Updated Oct 8, 2024

The open source code for LLM-Codec

Python 112 4 Updated Aug 18, 2024

uSherpaServer 给Unity提供流式语音识别的websocket服务

C# 3 Updated Jun 25, 2024

Transform datasets at scale. Optimize datasets for fast AI model training.

Python 340 39 Updated Oct 12, 2024

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,564 573 Updated Jul 2, 2024

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 122 10 Updated Jul 25, 2024

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 17,834 2,170 Updated Feb 4, 2024

A generative speech model for daily dialogue.

Python 31,424 3,412 Updated Oct 10, 2024

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 13,919 1,336 Updated Oct 11, 2024
Next