Paper page - Rubric-based On-policy Distillation
… The following papers were recommended by the Semantic Scholar API A Survey of On-Policy Distillation for Large Language Models 2026 DP-OPD: Differentially Private On-Policy Distillation for Language Models 2026 MAD-OPD: Breaking the Ceiling in On-Policy Distillation via Multi-Agent Debate 2026 Uni-… …