OSWorld logo

OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Maintained by xlang-ai

Project Information

GitHub Stars
1,778
Language
Python
Last Updated
April 18, 2025 at 07:24 AM

Topics

agent
artificial-intelligence
benchmark
multimodal
reinforcement-learning
rpa
code-generation
language-model
cli
gui
natural-language-processing
large-action-model
llm
vlm

Explore More

Discover similar projects or browse the full catalog.