DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Li, Chengpeng; Dong, Guanting; Xue, Mingfeng; Peng, Ru; Wang, Xiang; Liu, Dayiheng

Computer Science > Computation and Language

arXiv:2407.04078 (cs)

[Submitted on 4 Jul 2024 (v1), last revised 17 Jul 2024 (this version, v3)]

Title:DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Authors:Chengpeng Li, Guanting Dong, Mingfeng Xue, Ru Peng, Xiang Wang, Dayiheng Liu

View PDF

Abstract:Large language models (LLMs) have made impressive progress in handling simple math problems, yet they still struggle with more challenging and complex mathematical tasks. In this paper, we introduce a series of LLMs that employs the Decomposition of thought with code assistance and self-correction for mathematical reasoning, dubbed as DotaMath. DotaMath models tackle complex mathematical tasks by decomposing them into simpler logical subtasks, leveraging code to solve these subtasks, obtaining fine-grained feedback from the code interpreter, and engaging in self-reflection and correction. By annotating diverse interactive tool-use trajectories and employing query evolution on GSM8K and MATH datasets, we generate an instruction fine-tuning dataset called DotaMathQA with 574K query-response pairs. We train a series of base LLMs using imitation learning on DotaMathQA, resulting in DotaMath models that achieve remarkable performance compared to open-source LLMs across various in-domain and out-of-domain benchmarks. Notably, DotaMath-deepseek-7B showcases an outstanding performance of 64.8% on the competitive MATH dataset and 86.7% on GSM8K. Besides, DotaMath-deepseek-7B maintains strong competitiveness on a series of in-domain and out-of-domain benchmarks (Avg. 80.1%). Looking forward, we anticipate that the DotaMath paradigm will open new pathways for addressing intricate mathematical problems. Our code is publicly available at this https URL.

Comments:	Work in progress
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2407.04078 [cs.CL]
	(or arXiv:2407.04078v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.04078

Submission history

From: Guanting Dong [view email]
[v1] Thu, 4 Jul 2024 17:39:16 UTC (11,485 KB)
[v2] Tue, 9 Jul 2024 15:29:03 UTC (11,487 KB)
[v3] Wed, 17 Jul 2024 13:13:05 UTC (11,516 KB)

Computer Science > Computation and Language

Title:DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators