Search: Advanced automations

Paper page - T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

…Despite advances in stabilization techniques such as fine-grained credit assignment and trajectory filtering , instability remains pervasive and often leads to training collapse. We argue that this instability stems from inefficient exploration…

May 5, 2026

Paper page - EvoDS: Self-Evolving Autonomous Data Science Agent with Skill Learning and Context Management

…Generated by Qwen/Qwen2.5-Coder-32B-Instruct Recent progress in Large Language Model (LLM) agents has enabled promising advances in automated data science. However, existing approaches remain fundamentally limited by their…

Jun 5, 2026

Paper page - LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

…Tong Zheng , , , , , , Runpeng Dai , , , Tianyi Xiong , , , Abstract AutoTTS automates test-time scaling strategy discovery by formulating it as controller synthesis over reasoning trajectories and probe signals, achieving improved accuracy-cost tradeoffs with…

May 11, 2026

Paper page - ClawGym: A Scalable Framework for Building Effective Claw Agents

…This is an automated message from the Librarian Bot . I found the following papers similar to this paper. The following papers were recommended by the Semantic Scholar API SWE-Shepherd: Advancing PRMs…

Apr 30, 2026

Paper page - UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents

…Generated by Qwen/Qwen2.5-Coder-32B-Instruct Recent advances in mobile GUI agents have shown strong potential for automating mobile tasks, but most effective systems still depend on large vision-language…

May 29, 2026

Paper page - Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders

…https://researchpod.app/episode/611e7f79-5e5b-4659-bdb9-99d8d696c41e Generated automatically by ResearchPod — happy to take feedback from the authors. This is an automated message from the Librarian Bot . I found the…

May 28, 2026

Paper page - VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction

…AI-generated summary High-quality 3D scene reconstruction has recently advanced toward generalizable feed-forward architectures , enabling the generation of complex environments in a single forward pass. However, despite their strong performance…

May 15, 2026

Paper page - YoCausal: How Far is Video Generation from World Model? A Causality Perspective

…Generated by Qwen/Qwen2.5-Coder-32B-Instruct As video diffusion models (VDMs) advance toward world models , a key question arises: do they truly understand causality , or merely overfit to statistical temporal…

May 29, 2026

Paper page - Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

…Keming Wu , Zuhao Yang , Kaichen Zhang , Shizun Wang , , , Zhongyu Yang , , , , , , , , , , , , , , , Abstract Visual generation models need to advance beyond appearance synthesis to incorporate structural, dynamic, and causal understanding through a five-level taxonomy…

May 1, 2026

Paper page - DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes

…AI-generated summary Reinforcement learning has become a central paradigm for advancing reasoning in large language models , yet most existing methods still depend on stronger teacher models or heavily curated difficult datasets…