Lists (8)
Sort Name descending (Z-A)
Starred repositories
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?
This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.
An autoagentic AGI that is self-evolving and modular.
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Vim-fork focused on extensibility and usability
Open-source Next.js template for building apps that are fully generated by AI. By E2B.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"
JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models
The official repository of "ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory".
Source code and demo for memory bank and SiliconFriend
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
ProAgent: Building Proactive Cooperative Agents with Large Language Models
This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…
An open-source LLM based automatically daily news collecting workflow showcase powered by Agently AI application development framework.
Web UI for AutoGen (A Framework Multi-Agent LLM Applications)
Run MemGPT-AutoGEN-Local LLM Together
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Enjoy the magic of Diffusion models!
A WhatsApp client library for NodeJS that connects through the WhatsApp Web browser app