Skip to content

Globot is an agent that controls your browser using playwright and GPT-4V.

License

Notifications You must be signed in to change notification settings

go-xoxo/globot

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

globot

With GPT-4V, we can finally complete the original vision of natbot.

Help solve general agents by contributing to this repo!

Ideas for Improvement

  • Scrolling (easy to add, but likely to cause divergence)
  • Better context management (learning from mistakes, more descriptive history)
  • Masking the image with node IDs
  • Better DOM parsing (please submit issues/PRs!)
  • More explicit planning
  • Data collection and fine-tuning

NOTE: Remember to use the latest release of the openai API for the vision model:

pip install --upgrade openai

About

Globot is an agent that controls your browser using playwright and GPT-4V.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%