Paper page - WindowsWorld: A Process-Centric Benchmark of Autonomous GUI Agents in Professional Cross-Application Environments
… The following papers were recommended by the Semantic Scholar API OSExpert: Computer-Use Agents Learning Professional Skills via Exploration 2026 EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings 2026 OccuBench: Evaluating AI Agents on… …