Search

Showing top 106 results for "agentic improvements"

All sources huggingface.co 52 github.blog 23 amd.com 7 phoronix.com 4 theverge.com 3 blogs.nvidia.com 3 9to5mac.com 2 developer.nvidia.com 2 wired.com 2 aws.amazon.com 1 9to5google.com 1 blogs.microsoft.com 1

Videos

Paper page - T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

…View arXiv page View PDF GitHub 29 Add to collection Community We are excited to share T²PO, an uncertainty-guided exploration control method for stable multi-turn agentic reinforcement learning. T²PO improves…

May 5, 2026

Paper page - FineVerify: Scaling Test-Time Compute with Fine-Grained Self-Verification for Agentic Search

…Zhao on Jun 2 Authors: , Hui Chen , , Abstract FineVerify is a self-verification framework for agentic search that improves accuracy through decomposed sub-question checking and trajectory selection. Generated by Qwen/Qwen2…

Jun 2, 2026

Copilot cloud agent supports auto model selection - GitHub Changelog

…copilot Jun.11 Release Agentic workflows no longer need a personal access token copilot Jun.10 Release Copilot Chat now sees your agent sessions copilot Jun.10 Improvement Dedicated security review command…

May 14, 2026 · Allison

Xcode 26.5 adds two features that make agentic coding more useful - 9to5Mac

…Two of these improvements make agentic coding workflows even smarter. Here are the details. Xcode 26.5 adds two useful Coding Intelligence features Apple released Xcode 26.5 yesterday, with two features…

May 12, 2026 · Marcus Mendes

Discussions and forums

Hacker News · u/gmays · 2w ago

Bill Gates AI on AI (one month later)

# The Agentic Tidal Wave*To:* Executive Staff and Direct Reports *From:* Bill Gates *Date:* April 26, 2026Our vision for the last 20 years can be summarized in a succinct way. We saw that exponential improvements in clou…

Paper page - MemTrain: Self-Supervised Context Memory Training

…Ziheng Li , , , , Abstract A self-supervised training framework called MemTrain enhances long-horizon language model agents' memory capabilities through proxy tasks optimized via GRPO, improving downstream reasoning performance. Generated by Qwen/Qwen2…

Jun 4, 2026

Followed topics

Search

Videos

Paper page - T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Top stories

HPE Expands AI Factory Portfolio for Agentic AI Deployments

Paper page - Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking

Paper page - Skill-3D: Evolving Scene-Aware Skills for Agentic 3D Spatial Reasoning

Paper page - FineVerify: Scaling Test-Time Compute with Fine-Grained Self-Verification for Agentic Search

Copilot cloud agent supports auto model selection - GitHub Changelog

Xcode 26.5 adds two features that make agentic coding more useful - 9to5Mac

Discussions and forums

Building self-improving tax agents with Codex

Building self-improving tax agents with Codex

Reading of OpenAI's Self-Improving Tax Agents

The engineering practices Claude Code and Codex use to improve AI agents

Bill Gates AI on AI (one month later)

Paper page - MemTrain: Self-Supervised Context Memory Training

Building Hybrid Multi-Agent Systems from Client to Cloud

Building Hybrid Multi-Agent Systems from Client to Cloud

When Code Writes Back: Coding Agents and the Future of Open Source Software Development

Build Your Openclaw Agent with Multi-Modal Models

Build Your Openclaw Agent with Multi-Modal Models