Tong Wu's Homepage
Tong Wu's Homepage
Home
Experience
Publications
Light
Dark
Automatic
Publications
Type
Conference paper
Journal article
Report
Date
2024
2023
2022
2021
2020
2019
Imagine360: Immersive 360 Video Generation from Perspective Anchor
Cite
Code
Project
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Cite
Code
Project
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
PDF
Cite
Code
Project
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
PDF
Cite
Code
Project
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
PDF
Cite
Code
Project
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
PDF
Cite
Code
Project
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images
PDF
Cite
Code
Project
Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data
PDF
Cite
Code
Project
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
PDF
Cite
Code
Project
Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials
PDF
Cite
Code
Project
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
PDF
Cite
Code
ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance
PDF
Cite
Project
3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors
PDF
Cite
Code
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
PDF
Cite
Code
Project
GPT4Point: A Unified Framework for Point-Language Understanding and Generation
PDF
Cite
Code
Project
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
PDF
Cite
Code
Project
Large-vocabulary 3d diffusion model with transformer
PDF
Cite
Code
Project
HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image
Cite
Project
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation
PDF
Cite
Code
Project
V3Det: Vast Vocabulary Visual Detection Dataset
PDF
Cite
Code
Project
Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction
PDF
Cite
Code
Few-Shot Object Detection via Association and DIscrimination
PDF
Cite
Code
Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion
PDF
Cite
Code
Adversarial Robustness under Long-Tailed Distribution
PDF
Cite
Code
Towards Evaluating and Training Verifiably Robust Neural Networks
PDF
Cite
Code
Shaping Deep Feature Space towards Gaussian Mixture for Visual Classification
PDF
Cite
Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets
PDF
Cite
Code
Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation
PDF
Cite
Project
Physical Adversarial Attack on Vehicle Detector in the Carla Simulator
PDF
Cite
Visual-friendly Aesthetic QR Code Generation using Image Style Transfer
PDF
Cite
Cite
×