Search: Performance & optimization

Paper page - LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents

…Furthermore, applying Direct Multi-turn Preference Optimization ( DMPO ) on our RL environments yields additional performance gains. These results systematically demonstrate that fully synthetic, executable environments offer a scalable and verifiable supervision signal…

May 29, 2026

Paper page - Operating-Layer Controls for Onchain Language-Model Agents Under Real Capital

…Agentic harnesses must be specifically built, evaluated and optimized for markets in order to perform reliably. The paper reports concrete failure modes, fabricated rules, fee paralysis, numeric anchoring, cadence trading, and tokenomics…

Apr 30, 2026

Paper page - SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training

…gradual architecture transitions lead to better optimization trajectories. Putting it all together, we compress Qwen3-Next-80A3B to a 23A2B model that retains competitive performance. These results offer practical guidance for efficient…

May 12, 2026

Paper page - Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

…In this work, we introduce Continuous-Time Distribution Matching (CDM), migrating the DMD framework from discrete anchoring to continuous optimization for the first time. CDM achieves this through two continuous-time designs…

May 8, 2026

Gemma 3n fully available in the open-source ecosystem!

Sensational release! Are the MobileNet-v5 weights coming to https://huggingface.co/timm ? Are there any results on the performance of this model (the vision encoder only) to inspect? Thank you all…

Mar 6, 2026 · Aritra Roy Gosthipaty

Paper page - What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion

…Reaches competitive performance in just 80 epochs compared to 800+ epochs for previous methods. Diffusion-Friendly Manifold: Explicitly optimizes three key geometric properties: Spatial Structure Coherence, Local Manifold Continuity, and Global Semantic…

May 11, 2026

Paper page - Beyond Reasoning: Reinforcement Learning Unlocks Parametric Knowledge in LLMs

…To put it directly, RL fundamentally optimizes the recall of latent knowledge. 2️⃣ The unexpected contribution of 0/128 samples: Remarkably, ~83% of the performance jump is driven by training on the…

May 13, 2026

Paper page - Crosslingual On-Policy Self-Distillation for Multilingual Reasoning

…Yihong Liu , , , Abstract COPSD transfers high-resource language model reasoning behavior to low-resource languages using self-distillation with crosslingual context, improving mathematical reasoning performance. AI-generated summary Large language models (LLMs…

May 12, 2026

Paper page - GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

…These developments lead to strong performance in multimodal coding, visual tool use , and framework-based agentic tasks, while preserving competitive text-only coding capability. More importantly, our development process offers practical insights…