Search: community feedback

Paper page - Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

…View arXiv page View PDF Project page Add to collection Community FINAL Bench introduces a new evaluation paradigm for LLMs: functional metacognitive reasoning — not just "can the model solve it," but "does…

May 15, 2026

Paper page - SkillOS: Learning Skill Curation for Self-Evolving Agents

…However, they still struggle to learn complex long-term curation policies from indirect and delayed feedback. To tackle this challenge, we propose SkillOS, an experience-driven RL training recipe for learning skill…

May 8, 2026

Paper page - Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

…View arXiv page View PDF GitHub 1 Add to collection Community We believe this work provides a step toward bridging ICL from pattern matching to in-context test time learning with two…

May 14, 2026

Paper page - Let ViT Speak: Generative Language-Image Pre-training

…View arXiv page View PDF Project page GitHub 116 Add to collection Community that gated attention trick to curb attention sink in a single, concatenated vision+text transformer is the most interesting…

May 4, 2026

Paper page - The First Token Knows: Single-Decode Confidence for Hallucination Detection

…View arXiv page View PDF Add to collection Community Sharing our paper "The First Token Knows: Single-Decode Confidence for Hallucination Detection" . A single greedy decode captures almost all the hallucination-detection…

May 7, 2026

Paper page - Stream-T1: Test-Time Scaling for Streaming Video Generation

…View arXiv page View PDF Project page GitHub 34 Add to collection Community While Test-Time Scaling (TTS) offers a promising direction to enhance video generation without the surging costs of training…

May 7, 2026

Paper page - PREPING: Building Agent Memory without Tasks

…A Proposer generates synthetic tasks conditioned on this state, a Solver executes them, and a Validator determines which trajectories are eligible for memory insertion while also providing feedback to guide future proposals…

May 15, 2026

Paper page - MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills

…View arXiv page View PDF GitHub 889 Add to collection Community We're in the middle of a skill/agent explosion — everyone is packaging capabilities as reusable modules. But medical research skills…

May 7, 2026

Paper page - ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control

…to collection Community ExoActor provides a scalable approach to modeling interaction-rich humanoid behaviors, potentially opening a new avenue for generative models to advance general-purpose humanoid intelligence. Feedback is very welcome…

May 1, 2026

Paper page - RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environments

…https://be2rlab.github.io/radio_vipe View arXiv page View PDF Project page GitHub 125 Add to collection Community We present RADIO-ViPE (Reduce All Domains Into One — Video Pose Engine), an…

Apr 30, 2026

Followed topics

Paper page - Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

Paper page - SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper page - Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

Paper page - Let ViT Speak: Generative Language-Image Pre-training

Paper page - The First Token Knows: Single-Decode Confidence for Hallucination Detection

Paper page - Stream-T1: Test-Time Scaling for Streaming Video Generation

Paper page - PREPING: Building Agent Memory without Tasks

Paper page - MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills

Paper page - ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control

Paper page - RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environments