Search

Showing top 129 results for "AI training data"

Paper page - SkillOS: Learning Skill Curation for Self-Evolving Agents

…No dataset linking this paper Cite arxiv.org/abs/2605.06614 in a dataset README.md to link it from this page. No Space linking this paper Cite arxiv.org/abs/2605…

May 8, 2026

Paper page - Think, then Score: Decoupled Reasoning and Scoring for Video Reward Modeling

…AI-generated summary Recent advances in generative video models are increasingly driven by post-training and test-time scaling, both of which critically depend on the quality of video reward models (RMs…

May 8, 2026

Paper page - Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

…AI-generated summary We present Darwin Family, a framework for training-free evolutionary merging of large language models via gradient-free weight-space recombination . We ask whether frontier-level reasoning performance can…

May 15, 2026

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

Followed topics

Search

Paper page - SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper page - Think, then Score: Decoupled Reasoning and Scoring for Video Reward Modeling

Paper page - Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

Paper page - Fast Byte Latent Transformer

Paper page - Memory-Efficient Looped Transformer: Decoupling Compute from Memory in Looped Language Models

Paper page - LASE: Language-Adversarial Speaker Encoding for Indic Cross-Script Identity Preservation

Paper page - Stream-T1: Test-Time Scaling for Streaming Video Generation

Paper page - Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria

Paper page - VLM3: Vision Language Models Are Native 3D Learners