Paper page - AgentLens: Revealing The Lucky Pass Problem in SWE-Agent Evaluation
Papers arxiv:2605.12925 AgentLens: Revealing The Lucky Pass Problem in SWE-Agent Evaluation Published on May 13 Submitted by taesiri on May 14 Authors: , , , , , , Abstract Software engineering agents are evaluated using…