From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space

Overall framework

Abstract

This paper proposes Multi-View GRPO for preference alignment in text-to-image flow models. Instead of evaluating a group of generated samples against a single condition, the method augments the condition space with semantically adjacent captions and re-estimates advantages from multiple views, improving optimization signals without regenerating samples.

Publication
In European Conference on Computer Vision (ECCV)
Tong WU 吴桐
Tong WU 吴桐
Assistant Professor @ Fudan

My research interests include 3d vision, long-tailed recognition, and robustness.

Related