Paper page - F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking
…Rohan Surana , , , , , , , , , , , Abstract A unified framework combines candidate generation and ranking in a single autoregressive model using factorized group-relative policy optimization to address credit assignment challenges in end-to-end retrieval…