Data Interpreter: An LLM Agent For Data Science

Hong, Sirui; Lin, Yizhang; Liu, Bangbang; Wu, Binhao; Li, Danyang; Chen, Jiaqi; Zhang, Jiayi; Wang, Jinlin; Zhang, Lingyao; Zhuge, Mingchen; Guo, Taicheng; Zhou, Tuo; Tao, Wei; Wang, Wenyi; Tang, Xiangru; Lu, Xiangtao; Liang, Xinbing; Fei, Yaying; Cheng, Yuheng; Xu, Zongze; Wu, Chenglin; Zhang, Li; Yang, Min; Zheng, Xiawu

Computer Science > Artificial Intelligence

arXiv:2402.18679v1 (cs)

[Submitted on 28 Feb 2024 (this version), latest version 12 Mar 2024 (v3)]

Title:Data Interpreter: An LLM Agent For Data Science

Abstract:Large Language Model (LLM)-based agents have demonstrated remarkable effectiveness. However, their performance can be compromised in data science scenarios that require real-time data adjustment, expertise in optimization due to complex dependencies among various tasks, and the ability to identify logical errors for precise reasoning. In this study, we introduce the Data Interpreter, a solution designed to solve with code that emphasizes three pivotal techniques to augment problem-solving in data science: 1) dynamic planning with hierarchical graph structures for real-time data adaptability;2) tool integration dynamically to enhance code proficiency during execution, enriching the requisite expertise;3) logical inconsistency identification in feedback, and efficiency enhancement through experience recording. We evaluate the Data Interpreter on various data science and real-world tasks. Compared to open-source baselines, it demonstrated superior performance, exhibiting significant improvements in machine learning tasks, increasing from 0.86 to 0.95. Additionally, it showed a 26% increase in the MATH dataset and a remarkable 112% improvement in open-ended tasks. The solution will be released at this https URL.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2402.18679 [cs.AI]
	(or arXiv:2402.18679v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2402.18679

Submission history

From: Sirui Hong [view email]
[v1] Wed, 28 Feb 2024 19:49:55 UTC (34,982 KB)
[v2] Mon, 4 Mar 2024 18:58:37 UTC (36,808 KB)
[v3] Tue, 12 Mar 2024 17:26:53 UTC (12,998 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Artificial Intelligence

Title:Data Interpreter: An LLM Agent For Data Science

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Artificial Intelligence

Title:Data Interpreter: An LLM Agent For Data Science

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators