OSWorld
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Maintained by xlang-ai
Project Information
- GitHub Stars
- 1,778
- Language
- Python
- Last Updated
- April 18, 2025 at 07:24 AM
Topics
agent
artificial-intelligence
benchmark
multimodal
reinforcement-learning
rpa
code-generation
language-model
cli
gui
natural-language-processing
large-action-model
llm
vlm