SeeAct
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
Maintained by OSU-NLP-Group
Project Information
- GitHub Stars
- 737
- Language
- Python
- Last Updated
- April 2, 2025 at 05:00 PM
Topics
agent