Paper page - SWE-WebDevBench: Evaluating Coding Agent Application Platforms as Virtual Software Agencies
…We release SWE-WebDev Bench as a community benchmark to enable such replication and help platform builders identify and address these gaps. Code and benchmark resources are available at: https://github.com…
