Search: data/control

Paper page - InterLV-Search: Benchmarking Interleaved Multimodal Agentic Search

…below 50% overall accuracy, highlighting challenges in visual evidence seeking , search control , and multimodal evidence integration. We release the benchmark data and evaluation code at https://github.com/hbhalpha/InterLV-Search-Bench…

May 13, 2026

Paper page - LaRA: Layer-wise Representation Analysis for Detecting Data Contamination in RL Post-Training

…No dataset linking this paper Cite arxiv.org/abs/2605.29888 in a dataset README.md to link it from this page. No Space linking this paper Cite arxiv.org/abs/2605…

May 29, 2026

Paper page - On the Limits of LLM Adaptability: Impact of Model-Internalized Priors on Annotation Task Performance

…After controlling for dataset-level confounds, DSF shows a positive association with model performance (partial r = +0.41), while three distinct memorization metrics ( ROUGE-L , BERTScore , and embedding cosine similarity ) all fail…

Jun 12, 2026

Paper page - PhoneWorld: Scaling Phone-Use Agent Environments

…No dataset linking this paper Cite arxiv.org/abs/2605.29486 in a dataset README.md to link it from this page. No Space linking this paper Cite arxiv.org/abs/2605…

May 29, 2026

Paper page - Who Annotates in NLP? A Large-scale Assessment of Human Annotation Reporting between 2018 and 2025

…of much NLP research , from dataset construction to model evaluation, but papers often leave unclear who produced the annotations and how the annotation process was controlled. We provide the first large-scale…

Jun 2, 2026

Paper page - EMMA: Extracting Multiple physical parameters from Multimodal Data

…Our results establish EMMA as a general, scalable solution for physics-consistent model extraction from opportunistic multimodal data . Code and data are available at: https://github.com/ImpactLabASU/EMMA-CVPR2026 View arXiv…

Jun 9, 2026

Paper page - Breaking the Bubble: Asynchronous Pipeline Parallel Training with Bounded Weight Inconsistency

…No dataset linking this paper Cite arxiv.org/abs/2606.07881 in a dataset README.md to link it from this page. No Space linking this paper Cite arxiv.org/abs/2606…

Jun 11, 2026

Paper page - MAIC-UI: Making Interactive Courseware with Generative UI

…A controlled lab study with 40 participants shows MAIC-UI reduces editing iterations (4.9 vs. 7.0) and significantly improves learnability and controllability compared to direct Text-to-HTML generation. A…

Apr 29, 2026

Paper page - Answer Presence Drives RAG Rewriting Gains

Papers arxiv:2606.05633 Answer Presence Drives RAG Rewriting Gains Published on Jun 4 Submitted by ShinerYang on Jun 9 Authors: , , Ke Yang , , , , , , , , Abstract Controlled interventions reveal that gold answer presence in…

Jun 9, 2026

Paper page - SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

…No dataset linking this paper Cite arxiv.org/abs/2605.30993 in a dataset README.md to link it from this page. No Space linking this paper Cite arxiv.org/abs/2605…

Jun 1, 2026

Followed topics

Search

Paper page - InterLV-Search: Benchmarking Interleaved Multimodal Agentic Search

Top stories

Paper page - MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning

Paper page - Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking

Paper page - EvoDS: Self-Evolving Autonomous Data Science Agent with Skill Learning and Context Management

Paper page - Measuring the Symmetry--Data Exchange Rate