Skip to content
View wabyking's full-sized avatar

Highlights

  • Pro

Organizations

@TJUIRLAB

Block or report wabyking

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

ACoustic Llama (ACLlama) trianing code

Python 2 Updated Oct 25, 2024

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Python 4,920 518 Updated Oct 27, 2024

A curated list of resources for using LLMs to develop more competitive grant applications.

Python 2,488 331 Updated Mar 1, 2024

2D Game Animation in God Mode

Python 119 15 Updated Aug 19, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,878 271 Updated Oct 23, 2024

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…

TypeScript 43,593 9,795 Updated Oct 28, 2024

An Large Language Model Framework for Fast Web User Experience Deficiencies Detection

Python 5 Updated Oct 13, 2024
Python 735 21 Updated Oct 17, 2024
Python 440 44 Updated Oct 28, 2024
Python 159 11 Updated Sep 24, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,850 2,127 Updated Jul 18, 2024

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 672 27 Updated Sep 27, 2024

LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue…

Python 40 4 Updated Oct 2, 2024

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Python 171 10 Updated Oct 12, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,998 266 Updated Oct 16, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 962 42 Updated Oct 27, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,279 53 Updated Aug 15, 2024

LLMs Could Autonomously Learn Without External Supervision. (An Autonomous Learning Method)

Python 5 1 Updated Jun 14, 2024

An interpretable large language model (LLM) for medical diagnosis.

Python 85 1 Updated Sep 12, 2024

Scaling Diffusion Transformers with Mixture of Experts

Python 199 8 Updated Sep 9, 2024

PodGPT: A multilingual audio-augmented large language model for research and education

Python 25 1 Updated Sep 25, 2024

眼科问诊大模型

Python 74 13 Updated Jul 16, 2024

The official repository for the paper Multilingual Mathematical Autoformalization

32 1 Updated May 20, 2024

Pandora: Towards General World Model with Natural Language Actions and Video States

Python 476 33 Updated Sep 23, 2024

Kolors Team

Python 3,771 258 Updated Sep 4, 2024

Open source real-time translation app for Android that runs locally

C++ 6,715 506 Updated Sep 27, 2024

Recent research papers about Foundation Models for Combinatorial Optimization

135 11 Updated Oct 23, 2024
4 Updated Jun 5, 2024
Next