γEnglish | Chineseγ
AgentVerse offers a versatile framework that streamlines the process of creating custom multi-agent environments for large language models (LLMs). Designed to facilitate swift development and customization with minimal effort, our framework empowers researchers to concentrate on their research, rather than being bogged down by implementation details.
-
π₯³ Efficient Environment Building: Our framework provides a collection of essential building blocks for effortlessly creating a multi-agent environment. With only a few lines in a configuration file, you can easily construct basic environments such as a chat room for LLMs. This process entails defining the environment's settings and prompts for LLMs, enabling researchers like you to concentrate on experimentation and analysis.
-
βοΈ Customizable Components: AgentVerse simplifies the multi-agent environment by dividing it into five functional modules and defining their respective interfaces. For complex environments that cannot be constructed directly using the basic modules offered in AgentVerse, you can customize one or more of the interfaces within these five functional modules to efficiently create your own multi-agent environment according to your requirements.
-
π Tools (Plugins) Utilization: AgentVerse supports the multi-agent environments with tools. Currently, AgentVerse supports tools provided in BMTools.
- [2023/8/22] π We're excited to share our work-in-progress paper AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents related to this repository.
You could refer the stay-tuned code in this branch.
- [2023/6/5] π We are thrilled to present an array of demos, including NLP Classroom, Prisoner Dilemma, Software Design, Database Administrator, and a simple H5 Pokemon Game that enables the interaction with the characters in Pokemon! Try out these demos and have fun!
- [2023/5/1] π AgentVerse is officially launched!
AgentVerse is on a mission to revolutionize the multi-agent environment for large language models, and we're eagerly looking for passionate collaborators to join us on this exciting journey.
-
Code Development: If you're an engineer, help us refine, optimize, and expand the current framework. We're always looking for talented developers to enhance our existing features and develop new modules.
-
Documentation and Tutorials: If you have a knack for writing, help us improve our documentation, create tutorials, or write blog posts to make AgentVerse more accessible to the broader community.
-
Application Exploration: If you're intrigued by multi-agent applications and are eager to experiment using AgentVerse, we'd be thrilled to support your journey and see what you create!
-
Feedback and Suggestions: Use AgentVerse and provide us with feedback. Your insights can lead to potential improvements and ensure that our framework remains top-notch.
Also, if you're passionate about advancing the frontiers of multi-agent environments and are eager to dive deeper into research, we invite you to join our team at THUNLP. To explore this exciting opportunity and embark on a collaborative journey with us, please reach out to [email protected] and [email protected] and express your interest. We're keen to welcome motivated individuals like you to our lab!
- Code release of our paper
- Add documentation
- Support more sophisticated memory for conversation history
- Add support for local LLM
- Auto-generate UI based on the given multi-agent environment
We demonstrate the following cases that are expertly crafted by AgentVerse.
In the NLP class, the professor and students engage in interactive communication. When students have a question, they raise their hands and patiently wait for the professor to call on them. Only after being called on by the professor, can students speak and ask their questions.
Use the following command to launch the NLP Classroom example:
python main_demo.py --task nlp_classroom_9players
classroom.mp4
A prisoner's Dilemma is a thought experiment that challenges two completely rational agents to a dilemma: they can cooperate with their partner for mutual benefit or betray their partner ("defect") for individual reward.
Use the following command to launch the Prisoner Dilemma example:
python main_demo.py --task prisoner_dilemma
prison.mp4
In the Software Design example, a code writer, a code tester and a code reviewer collaborate on the code generation problem. Given a problem, the code writer first composes the code implementation. The code tester runs the unit tests and provides the feedback. The code viewer then generates a review. After collecting the test feedback and review, the code writer iteratively refines the code.
Use the following command to launch the Software Design example:
python main_demo.py --task sde_team/sde_team_2players
sde.mp4
In the database diagnosis scenario, the Chief DBA monitors the system anomalies (e.g., slow queries, locks, crash down). If detected, the domain experts are alerted to analyze root causes, share insights, and suggest optimization solutions together. The Chief DBA then provides a summarized report to the user.
python main_demo.py --task db_diag
db_diag.mp4
In the context of the text evaluation scenario, we recommend users explore the ChatEval repo. They've implemented a multi-agent referee team on AgentVerse to assess the quality of text generated by different models. When given two distinct pieces of text, roles within ChatEval can autonomously debate the nuances and disparities, drawing upon their assigned personas, and subsequently provide their judgments. Experiments indicate that their referee team, enriched with diverse roles specified in config.yaml, aligns more closely with human evaluations. This demo is built upon the Fastchat repo, and we'd like to express our appreciation for their foundational work.
chateval.mp4
In the game, agents can visit shops, train their PokΓ©mon at the gym, and interact with one another. As a player, you take on the role of an agent and can engage with others at any time. There are 6 characters in the PokΓ©mon environment who appeared in Pokemon Emerald: May, Professor Birch, Steven Stone, Maxie, Archie and Joseph.
To launch the Pokemon game, first launch a local server with the following command:
uvicorn pokemon_server:app --reload --port 10002
Then open another terminal in the project's root path and run the following command:
cd ui
# If you do not have npm installed, you need to install it before running the following commands
# https://docs.npmjs.com/downloading-and-installing-node-js-and-npm
# We have tested on [email protected], [email protected]
npm install
npm run watch
Wait for the compilation to complete, and have fun! (WASD for moving around, and SPACE for launching a conversation.)
pokemon.mp4
- β¨ Features
- π° What's New
- π Join Us!
- π Coming Soon
- πΎ Simple Demo Video
- Contents
- π Getting Started
- π‘ Philosophy
- βοΈ Customize Your Own Environment
- π Examples
- Star History
- Citation
- Contact
pip install -U agentverse
Or you can install the package by manually cloning the latest repository
git clone https://github.com/OpenBMB/AgentVerse.git --depth 1
cd AgentVerse
pip install -r requirements.txt
Some users have reported problems installing the orjson
required by gradio
. One simple workaround is to install it with Anaconda conda install -c conda-forge orjson
.
You also need to export your OpenAI API key as follows
# Export your OpenAI API key
export OPENAI_API_KEY="your_api_key_here"
If you want to use the tools provided by BMTools, you need to install BMTools as follows:
git clone git+https://github.com/OpenBMB/BMTools.git
cd BMTools
pip install -r requirements.txt
python setup.py develop
You can create a multi-agent environments provided by us. Using the classroom scenario as an example. In this scenario, there are nine agents, one playing the role of a professor and the other eight as students.
python3 main.py --task nlp_classroom_9players
We also provide a local website demo for this environment. You can launch it with
python3 main_demo.py --task nlp_classroom_9players
After successfully launching the local server, you can visit https://127.0.0.1:7860/ to view the classroom environment.
At the core of our framework is the environment, which plays a crucial role in enabling researchers to study the behavior of agents under different conditions. We believe that the environment should be flexible and extensible, allowing researchers to easily customize it to fit their needs. To achieve this, we have abstracted the environment into five rule components, and implementing different environments is actually implementing different rules:
- Describer: This component provides a description of the environment at each turn for each agent. You can customize the describer to define the specific requirements of their environment, such as the agents with whom an agent can interact.
- Order: This component defines the order in which agents take actions within the environment. You can customize the order to reflect the desired interaction between agents. We provide several basic order options, including
random
,sequential
, andconcurrent
(in which all agents take an action in each turn). - Selector: This component selects the valid messages generated by agents. Sometimes agents may generate invalid responses, and the selector is used to filter out unexpected results.
- Updater: This component updates the memory of each agent. In certain cases, the response generated by one agent should not be seen by all agents (e.g., if agents are in different rooms). For each response, the updater updates only the agents who can see it.
- Visibility: This component maintains the list of agents that each agent can see throughout the environment's changes. For example, when an agent moves from one room to another, the list of visible agents of each agent should be updated by
visibility
.
By abstracting the environment into these five components, we have created a highly flexible and extensible framework that enables researchers to easily build and customize their own multi-agent environments.
Another fundamental component is the agent. Currently we provide two types of agents: ConversationAgent and ToolAgent. You can also customize your own agent by inheriting BaseAgent class (tutorial coming soon).
We have provided several examples in the agentverse/tasks
directory. To customize your environment, you should
- Create a task directory in
agentverse/tasks
- Write the configuration file
- Write the output parser that parses the response of your agents.
- Add your parser in
agentverse/tasks/__init__.py
We will use a simple example in agentverse/tasks/nlp_classroom_3players
to illustrate the procedure.
To illustrate how to customize your environment, we'll use a simple example of building a classroom environment where one agent is the professor, one is the student, and one is the teaching assistant.
First, we need to create a task directory and write our configuration file for the environment. In the agentverse/tasks
directory, create a new directory called nlp_classroom_3players
. Inside this directory, create a config.yaml
file and write the following configuration:
# config.yaml
environment:
env_type: basic # Use the basic environment provided in AgentVerse
max_turns: 10 # Specify the maximum number of dialogue turns
rule:
order:
type: sequential # Use the sequential order
visibility:
type: all # Each message can be seen by all agents
selector:
type: basic # Basic selector (do not select)
updater:
type: basic # Basic updater (update the message to all agents)
describer:
type: basic # Basic describer (no description)
This configuration specifies that we will use the basic environment provided in AgentVerse, with a maximum of 10 dialogue turns. We'll use the sequential order, with all messages visible to all agents. We won't be using any selectors, our updater will update the messages to all the agents and our describer will provide no description.
Next, we'll configure the agents. In the config.yaml
file, we'll add the configuration for each agent. Here's an example configuration for the professor:
# config.yaml
agents:
-
agent_type: conversation
name: Professor Micheal # Name of the agent
role_description: You are Prof. Micheal, ... # Description of the agent
memory:
memory_type: chat_history # Will store all the chat history
prompt_template: *professor_prompt
llm:
llm_type: text-davinci-003 # Will use OpenAICompletion LLM
model: text-davinci-003 # The arguments passed to the api call
temperature: 0.7
max_tokens: 250
In this example, we'll use the conversation
agent type. We've given the agent a name and a description, and we'll store the chat history in memory. We've also provided a prompt template with placeholders marked as ${placeholder}. These will be instantiated by the _fill_prompt_template
method of the agent.
The next step is to write a simple parser for your agent's response. Because you may have specified the output format in your prompt template, you need to provide a corresponding parser. In this example, we inform the model to output in the following format in our prompt template
Action: Speak
Action Input: (the content)
We'll write a parser to extract the content from the agent's response. Refer to the code for more details. We've decorated our parser function with @output_parser_registry.register('classroom_parser')
to register it with our framework. Finally, we import our parser in agentverse/tasks/__init__.py
.
With these steps, we've successfully built a simple classroom environment and customized it for our needs.
While we provide a basic framework for building environments with our five rule components, more complex environments may require further customization. A detailed documentation and tutorial is coming soon. Here we briefly introduce some steps you can take to customize your environment:
- Customize the five rule components. Each rule component has an interface, allowing you to customize its behavior to suit your specific needs. It's important to note that these components are not necessarily independent and can interact through the
rule_params
dictionary in the environment. You can create your own rule components and integrate them with the existing ones to build more complex interactions between agents. - Customize the environment itself. Our
basic
environment provides a default execution order for the five rule components that is suitable for most cases, but you can inherit theBaseEnvironment
class and write your ownrun
method to implement a more sophisticated execution order. - Customize the agent. Depending on your specific use case, you may also need to inherit the
BaseAgent
class. For example, you may want to use your local LLM as your agents or create agents with specialized knowledge or skills.
Currently, we offer some simple examples in the agentverse/tasks
directory, each demonstrating different possibilities of our framework. While the performance of these examples may not be optimal due to limited prompt engineering, they are intended to showcase the capabilities of our framework, such as allowing the use of tools.
Here's a brief overview of each example:
nlp_classroom_3players
: This example illustrates a simple case in which agents will speak in sequential order.nlp_classroom_9players
: This is an NLP class example. Here, students can raise their hand when they have a question, and the professor can call on the students to let them ask. Students are only allowed to speak after they are called on.nlp_classroom_9players_group
: This example showcases group discussions. The professor may initiate a group discussion when needed, and students can exclusively interact with fellow students within the same group during the discussion.nlp_classroom_3players_withtool
: Students in this classroom can use Bing search API when listening to the class.math_problem_2players_tools
: A simple example demonstrating how two agents can use the WolframAlpha API to play an arithmetic game.prisoner_dilema
: The Prisoner's Dilemma is a thought experiment involving two rational agents facing a choice between cooperating for mutual benefit or betraying their partner for individual gain.db_diag
: The Chief DBA monitors (agents) the database system for anomalies and alerts memory and CPU agents if any are detected. They (agents) analyze root causes and suggest optimization solutions. The Chief DBA (agent) provides a diagnosis summary to the user, who can give instructions or evaluate the proposed solutions' effectiveness.sde_team
: In the SDE team, code writer, code tester and code reviewer collaborate on the code generation problem.pokemon
: This example intimates Pokemon game.
If you find this repo helpful, feel free to cite us.
@misc{chen2023agentverse,
title={AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents},
author={Weize Chen and Yusheng Su and Jingwei Zuo and Cheng Yang and Chenfei Yuan and Chen Qian and Chi-Min Chan and Yujia Qin and Yaxi Lu and Ruobing Xie and Zhiyuan Liu and Maosong Sun and Jie Zhou},
year={2023},
eprint={2308.10848},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Weize Chen: [email protected]