Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dealing with time #1

Open
pbarker opened this issue May 11, 2024 · 0 comments
Open

Dealing with time #1

pbarker opened this issue May 11, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@pbarker
Copy link
Contributor

pbarker commented May 11, 2024

In the current implementation, sometimes actions take time and the agent doesn't know how to reason about. The canonical example is clicking on the chrome icon and it taking time to boot. The agent then thinks it need to click on it again, but by the time it zooms chrome has loaded and it often clicks in the wrong location.

I have tried giving the agent the ability to bail out on each zoom step but this can cause unintended consequences as the model often will choose to bail out when it shouldn't.

Some options are:

  • Have a secondary manager agent which continues to watch and can readjust the worker agent. This may be pricey and have race conditions
  • Having a reflection step after every zoom, but this is costly and slow
  • Find a way to incorporate time into the prompts
  • Tune a model to be better at bailing out
@pbarker pbarker added the bug Something isn't working label May 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant