Paper page - MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI
Papers arxiv:2605.08678 MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI Published on May 9 Submitted by Bohan22 on May 11 Authors: Bohan Lyu , , , Jiaru Zhang , Qixin Xu , , , , , , , , , Junlin Yang , , , , , , , , Abstract Current AI agents struggle to invent gen… …