Paper page - StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction
…StraTA samples a compact strategy from the initial task state, conditions subsequent actions on that strategy, and trains strategy generation and action execution jointly with a hierarchical GRPO-style rollout design, further…