Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Liu, Evan Zheran; Guu, Kelvin; Pasupat, Panupong; Shi, Tianlin; Liang, Percy

Computer Science > Artificial Intelligence

arXiv:1802.08802 (cs)

[Submitted on 24 Feb 2018]

Title:Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Authors:Evan Zheran Liu, Kelvin Guu, Panupong Pasupat, Tianlin Shi, Percy Liang

View PDF

Abstract:Reinforcement learning (RL) agents improve through trial-and-error, but when reward is sparse and the agent cannot discover successful action sequences, learning stagnates. This has been a notable problem in training deep RL agents to perform web-based tasks, such as booking flights or replying to emails, where a single mistake can ruin the entire sequence of actions. A common remedy is to "warm-start" the agent by pre-training it to mimic expert demonstrations, but this is prone to overfitting. Instead, we propose to constrain exploration using demonstrations. From each demonstration, we induce high-level "workflows" which constrain the allowable actions at each time step to be similar to those in the demonstration (e.g., "Step 1: click on a textbox; Step 2: enter some text"). Our exploration policy then learns to identify successful workflows and samples actions that satisfy these workflows. Workflows prune out bad exploration directions and accelerate the agent's ability to discover rewards. We use our approach to train a novel neural policy designed to handle the semi-structured nature of websites, and evaluate on a suite of web tasks, including the recent World of Bits benchmark. We achieve new state-of-the-art results, and show that workflow-guided exploration improves sample efficiency over behavioral cloning by more than 100x.

Comments:	International Conference on Learning Representations (ICLR), 2018
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1802.08802 [cs.AI]
	(or arXiv:1802.08802v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1802.08802

Submission history

From: Evan Liu [view email]
[v1] Sat, 24 Feb 2018 05:32:47 UTC (362 KB)

Computer Science > Artificial Intelligence

Title:Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators