Starred repositories
A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
DSPy: The framework for programming—not prompting—foundation models
Web-based tool converts GitHub repository contents into a single formatted text file
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
A programming framework for agentic AI 🤖
AI agent workflow for generating profiles of clients and running research tasks for them. There is an agent for each part of the process: profile generation, web search, and report writing.
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Open-source vector similarity search for Postgres
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
RetinaFace: Deep Face Detection Library for Python
SoftVC VITS Singing Voice Conversion
Ergonomic and modular web framework built with Tokio, Tower, and Hyper
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
The Self-hosted AI Starter Kit is an open-source template that quickly sets up a local AI environment. Curated by n8n, it provides essential tools for creating secure, self-hosted AI workflows.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Stream audio from your ESP32 to a computer