Dealing with time #1

pbarker · 2024-05-11T14:26:22Z

In the current implementation, sometimes actions take time and the agent doesn't know how to reason about. The canonical example is clicking on the chrome icon and it taking time to boot. The agent then thinks it need to click on it again, but by the time it zooms chrome has loaded and it often clicks in the wrong location.

I have tried giving the agent the ability to bail out on each zoom step but this can cause unintended consequences as the model often will choose to bail out when it shouldn't.

Some options are:

Have a secondary manager agent which continues to watch and can readjust the worker agent. This may be pricey and have race conditions
Having a reflection step after every zoom, but this is costly and slow
Find a way to incorporate time into the prompts
Tune a model to be better at bailing out

pbarker added the bug Something isn't working label May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dealing with time #1

Dealing with time #1

pbarker commented May 11, 2024

Dealing with time #1

Dealing with time #1

Comments

pbarker commented May 11, 2024