Search: Performance fixes

Paper page - UniPath: Adaptive Coordination of Understanding and Generation for Unified Multimodal Reasoning

… This suggests that exploiting such diversity is key to improving performance. …

May 13, 2026

Paper page - Balanced Aggregation: Understanding and Fixing Aggregation Bias in GRPO

… Experiments with Qwen2.5-Math-7B and Qwen3-1.7B on DAPO-17k and Polaris, evaluated on six reasoning and coding benchmarks, show that BA consistently improves training stability and final performance over standard token and sequence aggregation . …

May 8, 2026

Paper page - BEAM: Binary Expert Activation Masking for Dynamic Routing in MoE

… The results are also amazing: over 98% performance retention with up to 85% MoE FLOPs reduction, 2.5× faster decoding, and 1.4× higher throughput. …

May 15, 2026

Paper page - MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

… We further propose Area Under the Rank Accuracy Curve AURAC , a metric that consistently evaluates the performance of hierarchical low-rank adapters. …

May 11, 2026

Paper page - Solve the Loop: Attractor Models for Language and Reasoning

Papers arxiv:2605.12466 Solve the Loop: Attractor Models for Language and Reasoning Published on May 12 Submitted by Paria Rashidinejad on May 13 University of Southern California Authors: , Paria Rashidinejad Abstract Attractor Models enable efficient iterative refinement through fixed-point solvi… …

May 13, 2026

Paper page - Learning, Fast and Slow: Towards LLMs That Adapt Continually

… Fast-Slow Training FST is up to 3x more sample-efficient than only slow learning RL across reasoning tasks, while consistently reaching a higher performance asymptote. …

May 13, 2026

Paper page - Mela: Test-Time Memory Consolidation based on Transformation Hypothesis

… Moreover, with the pretrained context length fixed at 4K, Mela maintains performance on significantly longer contexts, whereas Transformer baselines degrade rapidly beyond their training length. …

May 12, 2026

Paper page - TacoMAS: Test-Time Co-Evolution of Topology and Capability in LLM-based Multi-Agent Systems

Papers arxiv:2605.09539 TacoMAS: Test-Time Co-Evolution of Topology and Capability in LLM-based Multi-Agent Systems Published on May 10 Submitted by Xinyu Lin on May 13 Authors: , , , , , , Abstract Test-time co-evolution framework for multi-agent systems that jointly adapts agent capabilities and … …

May 12, 2026

Paper page - DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models

… The following papers were recommended by the Semantic Scholar API Flow-OPD: On-Policy Distillation for Flow Matching Models 2026 $R \text{dm}$: Re-conceptualizing Distribution Matching as a Reward for Diffusion Distillation 2026 V-GRPO: Online Reinforcement Learning for Denoising Generative Models … …

May 15, 2026

Paper page - AgensFlow: A Coordination-Policy Substrate for Multi-Agent Systems

…LLMs) require many coordination choices that are difficult to fix a priori: which skill protocol to invoke, which agent role should perform a subtask, which model to bind to each role, how…

May 28, 2026

Followed topics

Paper page - UniPath: Adaptive Coordination of Understanding and Generation for Unified Multimodal Reasoning

Paper page - Balanced Aggregation: Understanding and Fixing Aggregation Bias in GRPO

Paper page - BEAM: Binary Expert Activation Masking for Dynamic Routing in MoE

Paper page - MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

Paper page - Solve the Loop: Attractor Models for Language and Reasoning

Paper page - Learning, Fast and Slow: Towards LLMs That Adapt Continually

Paper page - Mela: Test-Time Memory Consolidation based on Transformation Hypothesis

Paper page - TacoMAS: Test-Time Co-Evolution of Topology and Capability in LLM-based Multi-Agent Systems

Paper page - DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models

Paper page - AgensFlow: A Coordination-Policy Substrate for Multi-Agent Systems