SeeAct logo

SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Maintained by OSU-NLP-Group

Project Information

GitHub Stars
737
Language
Python
Last Updated
April 2, 2025 at 05:00 PM

Topics

agent

Explore More

Discover similar projects or browse the full catalog.