Search: Performance fixes

Paper page - Key-Value Means

…AI-generated summary We present Key-Value Means ("KVM"), a novel block-recurrence for attention that can accommodate either fixed-size or growing state . Equipping a strong transformer baseline with fixed-size…

May 12, 2026

Paper page - Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

…points along sampling trajectories rather than only at a few fixed anchors. Second, we propose a continuous-time alignment objective that performs active off-trajectory matching on latents extrapolated via the student…

May 8, 2026

Paper page - UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

…However, existing methods often train separate models for each problem setting, which fixes the input-output mapping and limits the modeling of correlations across modalities. We present UniVidX, a unified multimodal framework…

May 4, 2026

Paper page - Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

…One nuance is that γ is estimated across different depths, so fixing the horizon cannot directly test γ itself. But I agree that a tighter ablation would be useful: fixing the horizon…

May 8, 2026

Paper page - SimWorld Studio: Automatic Environment Generation with Evolving Coding Agent for Embodied Agent Learning

…reliability, generated environments substantially improve embodied agent performance that generalizes to unseen benchmarks, and co-evolution yields an 18-point success-rate gain over fixed-environment learning and a 40-point gain…

May 20, 2026

Paper page - Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation

…This yields an optimization mechanism that modulates the geometry of weight matrices while keeping their spectral norm fixed. We derive the Pion update rule, systematically examine its design choices, and analyze its…

May 13, 2026

Paper page - StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

…Xiangyuan Xue , Yifan Zhou , , , , , , Abstract Strategic Trajectory Abstraction framework enhances long-horizon decision making in large language models by introducing trajectory-level strategies that improve sample efficiency and performance across interactive environments…

May 8, 2026

Followed topics

Search

Paper page - Key-Value Means

Paper page - Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

Paper page - UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper page - Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper page - SimWorld Studio: Automatic Environment Generation with Evolving Coding Agent for Embodied Agent Learning

Paper page - Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation

Paper page - StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Paper page - AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Paper page - Rethinking Memory as Continuously Evolving Connectivity

Paper page - Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs