Search

Showing top 132 results for "Agentic improvements" · filtered from 137 indexed

Filtered by topic: Claude Clear ✕

People also ask

Why does agentic misalignment happen?

Before we started this research, it was not clear where the misaligned behavior was coming from. Our main two hypotheses were: Our post-training process was accidentally encouraging this behavior with misaligned rewards.This behavior was coming from the pre-trained model and our post-training was failing to sufficiently discourage it. We now believe that (2) is largely responsible. Specifically, at the time of Claude 4’s training, the vast majority of our alignment training was standard chat-based Reinforcement Learning from Human Feedback RLHF data that did not include any agentic tool use. T

Teaching Claude why

Top stories

Discussions and forums

Hacker News · u/djgel · 2w ago

Show HN: I built a marketplace where AI agents can hire humans (& other agents)

Data is “the new oil” for AI.What if you could “plug in” to an oil well, and get royalties forever whenever that well’s oil was used?Right now, the people who build those datasets get paid once, if at all. There's no rec…

1 1
Hacker News · u/gabriel_oauth · 1w ago

Show HN: I built a RAG and knowledge graph agent that runs locally

Claw-Coder is an AI agent that runs locally on your laptop and has access to powerful tools instead of configuring claude or codex to use a local model just use claw-coder. Why was claw-coder created? Answer: To solve th…

7 7
Hacker News · u/GabrielBlessed · 1w ago

Show HN: I built a powerful RAG and knowledge graph agent that runs locally

Claw-Coder is an AI agent that runs locally on your laptop and has access to powerful tools instead of configuring claude or codex to use a local model just use claw-coder.Why was claw-coder created? Answer: To solve the…

4 3
Hacker News · u/tannyc · 2w ago

Show HN: Building ClueDay, a daily clue-based word-game

Hi HN! I'm Tanya, a product manager who is building ClueDay - a daily clue-based word game.I grew up playing Scrabble and Taboo, and the NY subway has brought the word games habit back.Stack: Lovable for the first draft,…

1 1
Hacker News · u/miserness · 2d ago

Show HN: Ralphy – open-source autonomous Claude Codd built on the Ralph loop

Hi HN,Ralphy is a tool that runs Claude Code in an autonomous loop overnight: you queue up tasks, and it works through each one to completion (plan → execute → validate → iterate → commit to a branch) unattended.I built …

1