Tong Wu's Homepage
Tong Wu's Homepage
Home
Experience
Publications
Publications
Type
Conference paper
Journal article
Report
Date
2026
2025
2024
2023
2022
2021
2020
2019
Effective Multi-sensor Conditioning for Street-view Novel-view Synthesis
PDF
Cite
Project
Infinite Gaze Generation for Videos with Autoregressive Diffusion
PDF
Cite
From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space
PDF
Cite
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control
PDF
Cite
Project
UniHand: A Unified Model for Diverse Controlled 4D Hand Motion Modeling
PDF
Cite
Image2Garment: Simulation-ready Garment Generation from a Single Image
PDF
Cite
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties
PDF
Cite
BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
PDF
Cite
Project
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
PDF
Cite
Project
Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity
PDF
Cite
Project
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience
PDF
Cite
Video World Models with Long-term Spatial Memory
PDF
Cite
Project
Towards Vision-Language-Garment Models for Web Knowledge Garment Understanding and Generation
PDF
Cite
SS4D: Native 4D Generative Model via Structured Spacetime Latents
PDF
Cite
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
PDF
Cite
Code
Project
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
PDF
Cite
Code
Project
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
PDF
Cite
Code
Project
RelightVid: Temporal-Consistent Diffusion Model for Video Relighting
PDF
Cite
Code
Project
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
PDF
Cite
Imagine360: Immersive 360 Video Generation from Perspective Anchor
Cite
Code
Project
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Cite
Code
Project
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
PDF
Cite
Code
Project
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
PDF
Cite
Code
Project
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
PDF
Cite
Code
Project
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
PDF
Cite
Code
Project
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images
PDF
Cite
Code
Project
Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data
PDF
Cite
Code
Project
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
PDF
Cite
Code
Project
Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials
PDF
Cite
Code
Project
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
PDF
Cite
Code
ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance
PDF
Cite
Project
3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors
PDF
Cite
Code
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
PDF
Cite
Code
Project
GPT4Point: A Unified Framework for Point-Language Understanding and Generation
PDF
Cite
Code
Project
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
PDF
Cite
Code
Project
Large-vocabulary 3d diffusion model with transformer
PDF
Cite
Code
Project
HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image
Cite
Project
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation
PDF
Cite
Code
Project
V3Det: Vast Vocabulary Visual Detection Dataset
PDF
Cite
Code
Project
Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction
PDF
Cite
Code
Few-Shot Object Detection via Association and DIscrimination
PDF
Cite
Code
Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion
PDF
Cite
Code
Adversarial Robustness under Long-Tailed Distribution
PDF
Cite
Code
Towards Evaluating and Training Verifiably Robust Neural Networks
PDF
Cite
Code
Shaping Deep Feature Space towards Gaussian Mixture for Visual Classification
PDF
Cite
Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets
PDF
Cite
Code
Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation
PDF
Cite
Project
Physical Adversarial Attack on Vehicle Detector in the Carla Simulator
PDF
Cite
Visual-friendly Aesthetic QR Code Generation using Image Style Transfer
PDF
Cite
Cite
×