GitHub - showlab/GUI-Narrator: Repository of GUI Action Narrator

GUI Action Narrator: Where and When Did That Action Take Place?

Qinchen Wu, Difei Gao, Kevin Qinghong Lin, Zhuoyu Wu, Xiangwu Guo, Peiran Li, Weichen Zhang, Hengxu Wang, Mike Zheng Shou

🤖: Introduction

We introduce GUI action dataset Act2Cap as well as an effective framework: GUI Narrator for GUI video captioning that utilizes the cursor as a visual prompt to enhance the interpretation of high-resolution screenshots.

📑: Events

We release our paper on Arxiv.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
static		static
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GUI Action Narrator: Where and When Did That Action Take Place?

🤖: Introduction

📑: Events

About

Releases

Packages

Languages

showlab/GUI-Narrator

Folders and files

Latest commit

History

Repository files navigation

GUI Action Narrator: Where and When Did That Action Take Place?

🤖: Introduction

📑: Events

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages