Search: agentic AI direction

Paper page - HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?

…Towards Trustworthy Evaluation of Autonomous Agents (2026) ClawArena: Benchmarking AI Agents in Evolving Information Environments (2026) AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation (2026) OccuBench: Evaluating AI Agents…

May 5, 2026

Paper page - SkillGrad: Optimizing Agent Skills Like Gradient Descent

…AI-generated summary Agent skills provide a lightweight way to adapt LLM agents to specialized domains by storing reusable procedural knowledge in structured files. However, whether downloaded from third parties or self…

May 28, 2026

Paper page - ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both

…AI-generated summary Visual reasoning , often interleaved with intermediate visual states, has emerged as a promising direction in the field. A straightforward approach is to directly generate images via unified models during…

May 15, 2026

Paper page - Operating-Layer Controls for Onchain Language-Model Agents Under Real Capital

Papers arxiv:2604.26091 Operating-Layer Controls for Onchain Language-Model Agents Under Real Capital Published on Apr 28 Submitted by Poof on Apr 30 DXRG AI Inc Authors: , , , , , , Abstract Autonomous language…

Apr 30, 2026

Paper page - Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas

…an outer-loop AI agent autonomously redesigns the inner-loop pipeline of an LLM policy-synthesis system for multi-agent Sequential Social Dilemmas (SSDs). A researcher agent R (run as a coding…

May 29, 2026

Paper page - EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning

…checkout this Space You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @ librarian-bot recommend Get this paper in your agent: hf papers read 2606.03108…

Jun 11, 2026

Paper page - SPIN: Structural LLM Planning via Iterative Navigation for Industrial Tasks

…AI-generated summary Industrial LLM agent systems often separate planning from execution, yet LLM planners frequently produce structurally invalid or unnecessarily long workflows, leading to brittle failures and avoidable tool and API…

May 15, 2026

Paper page - SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

…Position-Aware Undetectable Skill Injection on LLM Agents (2026) Plant, Persist, Trigger: Sleeper Attack on Large Language Model Agents (2026) AgentCanary: A Security Evaluation Framework for Autonomous AI Agents in Real Executable…

Jun 10, 2026

Paper page - GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

…AI Agents (2026) Please give a thumbs up to this comment if you found it helpful! If you want recommendations for any Paper on Hugging Face checkout this Space You can directly…