Overall framework
This paper proposes Multi-View GRPO for preference alignment in text-to-image flow models. Instead of evaluating a group of generated samples against a single condition, the method augments the condition space with semantically adjacent captions and re-estimates advantages from multiple views, improving optimization signals without regenerating samples.