Search

Showing top 14 results for "hack automation"

Paper page - LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

…This rubric reward is applied only to responses with correct final answers (positive-only strategy), distinguishing the reasoning quality among correct responses and preventing reward hacking . Experiments on three reasoning LLMs (4B…

Jun 1, 2026

Paper page - SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

…By cascading these adaptive efficiency metrics with a strict correctness gate, our approach effectively avoids the brevity bias associated with absolute penalties and mitigates reward hacking. Extensive experiments on long-horizon benchmarks…

Jun 9, 2026

Paper page - LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs

…real-world leakage propensity, not just worst-case hacks, to give us a true, comprehensive picture of this phenomenon. This is an automated message from the Librarian Bot . I found the following…

Jun 5, 2026

Paper page - Flow-OPD: On-Policy Distillation for Flow Matching Models

…This is an automated message from the Librarian Bot . I found the following papers similar to this paper. The following papers were recommended by the Semantic Scholar API $R_\text{dm}$: Re…

May 11, 2026

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

Followed topics

Paper page - LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Paper page - SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

Paper page - LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs

Paper page - Flow-OPD: On-Policy Distillation for Flow Matching Models