Overall framework
Generated Reality studies interactive human-centric world simulation for extended reality. The system conditions video generation on tracked head pose and joint-level hand poses, then distills a bidirectional video diffusion teacher into a causal interactive model for egocentric virtual environments.