Paper page - Flow-OPD: On-Policy Distillation for Flow Matching Models
…Unified Post-Training for Large Vision-Language Models (2026) OP-GRPO: Efficient Off-Policy GRPO for Flow-Matching Models (2026) SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting…