Tong Wu's Homepage
Tong Wu's Homepage
Home
Experience
Publications
Light
Dark
Automatic
Publications
Type
Conference paper
Journal article
Report
Date
2025
2024
2023
2022
2021
2020
2019
Video World Models with Long-term Spatial Memory
PDF
Cite
Project
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
PDF
Cite
Code
Project
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
PDF
Cite
Code
Project
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
PDF
Cite
Code
Project
RelightVid: Temporal-Consistent Diffusion Model for Video Relighting
PDF
Cite
Code
Project
Imagine360: Immersive 360 Video Generation from Perspective Anchor
Cite
Code
Project
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Cite
Code
Project
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
PDF
Cite
Code
Project
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
PDF
Cite
Code
Project
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
PDF
Cite
Code
Project
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
PDF
Cite
Code
Project
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images
PDF
Cite
Code
Project
Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data
PDF
Cite
Code
Project
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
PDF
Cite
Code
Project
Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials
PDF
Cite
Code
Project
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
PDF
Cite
Code
ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance
PDF
Cite
Project
3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors
PDF
Cite
Code
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
PDF
Cite
Code
Project
GPT4Point: A Unified Framework for Point-Language Understanding and Generation
PDF
Cite
Code
Project
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
PDF
Cite
Code
Project
Large-vocabulary 3d diffusion model with transformer
PDF
Cite
Code
Project
HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image
Cite
Project
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation
PDF
Cite
Code
Project
V3Det: Vast Vocabulary Visual Detection Dataset
PDF
Cite
Code
Project
Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction
PDF
Cite
Code
Few-Shot Object Detection via Association and DIscrimination
PDF
Cite
Code
Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion
PDF
Cite
Code
Adversarial Robustness under Long-Tailed Distribution
PDF
Cite
Code
Towards Evaluating and Training Verifiably Robust Neural Networks
PDF
Cite
Code
Shaping Deep Feature Space towards Gaussian Mixture for Visual Classification
PDF
Cite
Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets
PDF
Cite
Code
Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation
PDF
Cite
Project
Physical Adversarial Attack on Vehicle Detector in the Carla Simulator
PDF
Cite
Visual-friendly Aesthetic QR Code Generation using Image Style Transfer
PDF
Cite
Cite
×