Stars
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Rembg is a tool to remove images background
AirLLM 70B inference with single 4GB GPU
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Port of OpenAI's Whisper model in C/C++
Repository for the Paper "Multi-LoRA Composition for Image Generation"
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Zero-Shot Speech Editing and Text-to-Speech in the Wild
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
Telegram clone is a web site-based two-way real-time chat communication application.
A new one shot face swap approach for image and video domains
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.
Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.
A lightweight framework for building LLM-based agents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
cicimmmmm / GITM
Forked from OpenGVLab/GITMGhost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Humanoid Agents: Platform for Simulating Human-like Generative Agents
A programming framework for agentic AI 🤖