Stars
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Multilingual Voice Understanding Model
CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞
Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.
StockBot powered by Groq: Lightning Fast AI Chatbot that Responds With Live Interactive Stock Charts, Financials, News, Screeners, and More. Powered by Llama3-70b on Groq, Vercel AI SDK, and Tradin…
This Repo is the official implementation of AgentCoder and AgentCoder+.
StoryMaker: Towards consistent characters in text-to-image generation
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
Chat first code editor. To download the packaged app:
Rapid Exploration with Multiple Unmanned Aerial Vehicles (UAV)
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Robust Speech Recognition via Large-Scale Weak Supervision
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Tools for merging pretrained large language models.
Run Mixtral-8x7B models in Colab or consumer desktops
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
A programming framework for agentic AI 🤖
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
SEED-Story: Multimodal Long Story Generation with Large Language Model
Policy Search for Model Predictive Control with Application to Agile Drone Flight
[ICRA'24 Best UAV Paper Award Finalist] An Efficient Gloabl Planner for Aerial Coverage
H2-Mapping: Real-time Dense Mapping Using Hierarchical Hybrid Representation (2023 RAL Best Paper Award)
UAV swarm, Cooperative search
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"