Sparks of Artificial General Intelligence: Early experiments with GPT-4

Bubeck, Sébastien; Chandrasekaran, Varun; Eldan, Ronen; Gehrke, Johannes; Horvitz, Eric; Kamar, Ece; Lee, Peter; Lee, Yin Tat; Li, Yuanzhi; Lundberg, Scott; Nori, Harsha; Palangi, Hamid; Ribeiro, Marco Tulio; Zhang, Yi

Computer Science > Computation and Language

arXiv:2303.12712 (cs)

[Submitted on 22 Mar 2023 (v1), last revised 13 Apr 2023 (this version, v5)]

Title:Sparks of Artificial General Intelligence: Early experiments with GPT-4

Authors:Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang

View PDF

Abstract:Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an early version of GPT-4, when it was still in active development by OpenAI. We contend that (this early version of) GPT-4 is part of a new cohort of LLMs (along with ChatGPT and Google's PaLM for example) that exhibit more general intelligence than previous AI models. We discuss the rising capabilities and implications of these models. We demonstrate that, beyond its mastery of language, GPT-4 can solve novel and difficult tasks that span mathematics, coding, vision, medicine, law, psychology and more, without needing any special prompting. Moreover, in all of these tasks, GPT-4's performance is strikingly close to human-level performance, and often vastly surpasses prior models such as ChatGPT. Given the breadth and depth of GPT-4's capabilities, we believe that it could reasonably be viewed as an early (yet still incomplete) version of an artificial general intelligence (AGI) system. In our exploration of GPT-4, we put special emphasis on discovering its limitations, and we discuss the challenges ahead for advancing towards deeper and more comprehensive versions of AGI, including the possible need for pursuing a new paradigm that moves beyond next-word prediction. We conclude with reflections on societal influences of the recent technological leap and future research directions.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2303.12712 [cs.CL]
	(or arXiv:2303.12712v5 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2303.12712

Submission history

From: Sebastien Bubeck [view email]
[v1] Wed, 22 Mar 2023 16:51:28 UTC (13,667 KB)
[v2] Fri, 24 Mar 2023 17:07:43 UTC (6,453 KB)
[v3] Mon, 27 Mar 2023 22:36:40 UTC (6,470 KB)
[v4] Wed, 12 Apr 2023 17:00:10 UTC (12,943 KB)
[v5] Thu, 13 Apr 2023 20:41:31 UTC (6,476 KB)

Computer Science > Computation and Language

Title:Sparks of Artificial General Intelligence: Early experiments with GPT-4

Submission history

Access Paper:

References & Citations

10 blog links

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Sparks of Artificial General Intelligence: Early experiments with GPT-4

Submission history

Access Paper:

References & Citations

10 blog links

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators