Paper page - Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs
Papers arxiv:2605.09063 Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Published on May 9 Submitted by GUIJIN SON on May 12 2 Paper of the day EleutherAI Authors: , Seungone Kim , Catherine Arnett , Hyunwoo Ko , , , , , , , , Sang Park , , , Seungy… …
