"au:"Sicheng Xu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Sicheng Xu"" — arXiv2 Search

Showing 1–18 of 18 results

/ Date/ Name

Apr 24, 2020Deep 3D Portrait from a Single Image Jul 26, 2025Neural Estimation of the Information Bottleneck Based on a Mapping Approach Jul 21, 2025Estimating Rate-Distortion Functions Using the Energy-Based Model Sep 5, 2023AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections Apr 15, 2026Beyond Voxel 3D Editing: Learning from 3D Masks and Self-Constructed Data Mar 20, 2019Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set Jul 3, 2025MoGe-2: Accurate Monocular Geometry with Metric Scale and Sharp Details Oct 24, 2025Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos Feb 28, 2023RemoteTouch: Enhancing Immersive 3D Video Communication with Hand Touch Oct 24, 2024MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision Dec 2, 2024Structured 3D Latents for Scalable and Versatile 3D Generation Dec 16, 2025VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image Oct 22, 2025Seeing Across Views: Benchmarking Spatial Reasoning of Vision-Language Models in Robotic Scenes Mar 26, 2026HiSpatial: Taming Hierarchical 3D Spatial Understanding in Vision-Language Models Apr 16, 2024VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time Dec 16, 2025Native and Compact Structured Latents for 3D Generation Nov 29, 2024CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation Jul 31, 2025Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis