Search: data recovery

Paper page - Recovering Hidden Reward in Diffusion-Based Policies

…curious how robust the reward recovery is when the max-entropy assumption is only approximately satisfied in real expert data. the core move—using the energy gradient as the denoising field and…

May 8, 2026

Paper page - PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

…Further analysis shows that agents are especially vulnerable when failures lack explicit error signals or when recovery requires longer alternative tool-use paths . These results establish PlanBench-XL as a testbed for…

Jun 23, 2026

Paper page - GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors

…This privileged setup better conditions 4D recovery , allowing model-based object tracking, human motion estimation, and interaction-aware optimization to reconstruct metric 4D human-object interaction (HOI) trajectories with reduced depth ambiguity…

Jun 4, 2026

Paper page - EgoForce: Forearm-Guided Camera-Space 3D Hand Pose from a Monocular Egocentric Camera

…As a result, models typically require extensive training on device-specific datasets, which are costly and laborious to acquire. This paper addresses these challenges by introducing EgoForce, a monocular 3D hand reconstruction…

May 13, 2026

Paper page - MemReread: Enhancing Agentic Long-Context Reasoning via Memory-Guided Rereading

…No dataset linking this paper Cite arxiv.org/abs/2605.10268 in a dataset README.md to link it from this page. No Space linking this paper Cite arxiv.org/abs/2605…

May 14, 2026

Paper page - Taylor-Calibrate: Principled Initialization for Hybrid Linear Attention Distillation

…No dataset linking this paper Cite arxiv.org/abs/2606.16429 in a dataset README.md to link it from this page. No Space linking this paper Cite arxiv.org/abs/2606…

Jun 19, 2026

Paper page - A Foundation Model for Zero-Shot Logical Rule Induction

…AI-generated summary Inductive Logic Programming (ILP) learns interpretable logical rules from data. Existing methods are transductive: their learned parameters are bound to specific predicates and require retraining for each new task…

May 8, 2026

Paper page - DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes

…models or heavily curated difficult datasets, limiting scalable capability improvement. In this paper, we introduce DenoiseRL, a reinforcement learning framework that substitutes external supervision with recovery-oriented optimization over failures from weak…

May 28, 2026

Paper page - SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment

…No dataset linking this paper Cite arxiv.org/abs/2605.04012 in a dataset README.md to link it from this page. No Space linking this paper Cite arxiv.org/abs/2605…

May 7, 2026

Paper page - Dynamic Latent Routing

…In low-data fine-tuning settings, DLR matches or outperforms supervised fine-tuning across four datasets and six models, achieving a mean gain of +6.6 percentage points, while prior discrete-latent…

May 15, 2026

Followed topics