Search: AI training and model updates

Paper page - Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

…Empirically, the flagship Darwin-27B-Opus achieves 86.9% on GPQA Diamond, ranking #6 among 1,252 evaluated models, and outperforming its fully trained foundation model without any gradient-based training. Across…

May 15, 2026

Paper page - MARBLE: Multi-Aspect Reward Balance for Diffusion RL

…AI-generated summary Reinforcement learning fine-tuning has become the dominant approach for aligning diffusion models with human preferences. However, assessing images is intrinsically a multi-dimensional task , and multiple evaluation criteria…

May 8, 2026

Paper page - HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

…Tingyang Chen , , , , , , , , , , , , , Abstract HarnessX enables adaptive and evolvable AI agent runtime interfaces through compositional primitives, trace-driven evolution, and feedback loops that improve both harness design and model training. Generated by Qwen…

Jun 16, 2026

Paper page - NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation

…training and evaluating next-generation autonomous driving policies. We additionally show preliminary results indicating that a world-action model (WAM) post-trained from OmniDreams achieves strong performance on the Physical AI Autonomous…

Jun 3, 2026

Paper page - MDN: Parallelizing Stepwise Momentum for Delta Linear Attention

…attention models face challenges with information decay and convergence, which are addressed through a momentum-based approach that improves training efficiency and performance over existing models like Mamba2 and GDN. AI-generated…

May 11, 2026

Paper page - MindZero: Learning Online Mental Reasoning With Zero Annotations

…After training, MindZero internalizes model-based reasoning into fast single-pass inference. We evaluate MindZero against baselines across challenging mental reasoning and AI assistance tasks in gridworld and household domains . We found…

Jun 2, 2026

Paper page - MinT: Managed Infrastructure for Training and Serving Millions of LLMs

…adaptation training and serving by keeping base models resident and moving lightweight adapter revisions, scaling across multiple dimensions including large model architectures, reduced storage requirements, and distributed policy management. AI-generated summary…

May 14, 2026

Followed topics

Search