-
The Chinese University of Hong Kong, Shenzhen
- Shenzhen, China
- wabyking.github.io/old.html
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
A curated list of resources for using LLMs to develop more competitive grant applications.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…
An Large Language Model Framework for Fast Web User Experience Deficiencies Detection
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue…
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
LLMs Could Autonomously Learn Without External Supervision. (An Autonomous Learning Method)
An interpretable large language model (LLM) for medical diagnosis.
Scaling Diffusion Transformers with Mixture of Experts
PodGPT: A multilingual audio-augmented large language model for research and education
The official repository for the paper Multilingual Mathematical Autoformalization
Pandora: Towards General World Model with Natural Language Actions and Video States
Open source real-time translation app for Android that runs locally
Recent research papers about Foundation Models for Combinatorial Optimization