Search

Showing top 63 results for "real-world evaluation"

huggingface.co › papers › 2602.00095

Paper page - EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions

… Unpacking Multimodal Error Analysis in Handwritten Math 2026 CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing 2026 EduIllustrate: Towards Scalable Automated Generation Of Multimodal Educational Content 2026 Unveiling Fine-Grained Visual Traces: Evaluati… …

May 8, 2026