Skip to content

The BamTwoogle dataset accompanies "ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent" paper (https://arxiv.org/abs/2312.10003). It was written to be a complementary, slightly more challenging sequel to Bamboogle dataset. It addresses some of the shortcomings of Bamboogle we discovered while performing human evals for the paper.

License

Notifications You must be signed in to change notification settings

google-research-datasets/BamTwoogle

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

BamTwoogle dataset

This repository contains the BamTwoogle dataset for the paper ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent.

Overview

BamTwoogle is a small (100 questions in total), handcrafted collection of information-seeking questions. It was written to be a complementary, slightly more challenging sequel to Bamboogle dataset.

Dataset Description

The topics and question formats vary, but in general, BamTwoogle adheres to the following guidelines.

Questions

  • The majority of questions require two searches or reasoning steps (like Bamboogle), but some of them need 3 or 4.
  • Must have been manually checked to ensure the answer doesn’t appear on the first page of Google search results.

Expected answers

  • Should not be ambiguous.
  • Should not be prone to change over time, either due to the phrasing of the question or to the nature of the answer.
  • Should account for multiple versions of proper names, etc., where appropriate.
  • Should prefer Wikipedia as the source of truth for facts (preference given to topics/articles not flagged for incompleteness, lack of sources, etc.)

Citation

@misc{aksitov2023restmeetsreactselfimprovement,
      title={ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent}, 
      author={Renat Aksitov and Sobhan Miryoosefi and Zonglin Li and Daliang Li and Sheila Babayan and Kavya Kopparapu and Zachary Fisher and Ruiqi Guo and Sushant Prakash and Pranesh Srinivasan and Manzil Zaheer and Felix Yu and Sanjiv Kumar},
      year={2023},
      eprint={2312.10003},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2312.10003}, 
}

About

The BamTwoogle dataset accompanies "ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent" paper (https://arxiv.org/abs/2312.10003). It was written to be a complementary, slightly more challenging sequel to Bamboogle dataset. It addresses some of the shortcomings of Bamboogle we discovered while performing human evals for the paper.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published