Latent State Estimation Helps UI Agents to Reason

Bishop, William E; Li, Alice; Rawles, Christopher; Riva, Oriana

Computer Science > Artificial Intelligence

arXiv:2405.11120 (cs)

[Submitted on 17 May 2024]

Title:Latent State Estimation Helps UI Agents to Reason

Authors:William E Bishop, Alice Li, Christopher Rawles, Oriana Riva

View PDF HTML (experimental)

Abstract:A common problem for agents operating in real-world environments is that the response of an environment to their actions may be non-deterministic and observed through noise. This renders environmental state and progress towards completing a task latent. Despite recent impressive demonstrations of LLM's reasoning abilities on various benchmarks, whether LLMs can build estimates of latent state and leverage them for reasoning has not been explicitly studied. We investigate this problem in the real-world domain of autonomous UI agents. We establish that appropriately prompting LLMs in a zero-shot manner can be formally understood as forming point estimates of latent state in a textual space. In the context of autonomous UI agents we then show that LLMs used in this manner are more than $76\%$ accurate at inferring various aspects of latent state, such as performed (vs. commanded) actions and task progression. Using both public and internal benchmarks and three reasoning methods (zero-shot, CoT-SC & ReAct), we show that LLM-powered agents that explicitly estimate and reason about latent state are able to successfully complete up to 1.6x more tasks than those that do not.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2405.11120 [cs.AI]
	(or arXiv:2405.11120v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2405.11120

Submission history

From: William Bishop [view email]
[v1] Fri, 17 May 2024 23:27:33 UTC (3,474 KB)

Computer Science > Artificial Intelligence

Title:Latent State Estimation Helps UI Agents to Reason

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Latent State Estimation Helps UI Agents to Reason

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators