Evaluating Tool-Augmented Agents in Remote Sensing Platforms

Singh, Simranjit; Fore, Michael; Stamoulis, Dimitrios

Computer Science > Computation and Language

arXiv:2405.00709 (cs)

[Submitted on 23 Apr 2024]

Title:Evaluating Tool-Augmented Agents in Remote Sensing Platforms

Authors:Simranjit Singh, Michael Fore, Dimitrios Stamoulis

View PDF HTML (experimental)

Abstract:Tool-augmented Large Language Models (LLMs) have shown impressive capabilities in remote sensing (RS) applications. However, existing benchmarks assume question-answering input templates over predefined image-text data pairs. These standalone instructions neglect the intricacies of realistic user-grounded tasks. Consider a geospatial analyst: they zoom in a map area, they draw a region over which to collect satellite imagery, and they succinctly ask "Detect all objects here". Where is `here`, if it is not explicitly hardcoded in the image-text template, but instead is implied by the system state, e.g., the live map positioning? To bridge this gap, we present GeoLLM-QA, a benchmark designed to capture long sequences of verbal, visual, and click-based actions on a real UI platform. Through in-depth evaluation of state-of-the-art LLMs over a diverse set of 1,000 tasks, we offer insights towards stronger agents for RS applications.

Comments:	ICLR 2024 Machine Learning for Remote Sensing (ML4RS) Workshop
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2405.00709 [cs.CL]
	(or arXiv:2405.00709v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2405.00709

Submission history

From: Simranjit Singh [view email]
[v1] Tue, 23 Apr 2024 20:37:24 UTC (10,269 KB)

Computer Science > Computation and Language

Title:Evaluating Tool-Augmented Agents in Remote Sensing Platforms

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Evaluating Tool-Augmented Agents in Remote Sensing Platforms

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators