Showing 1–18 of 18 results
/ Date/ Name
Apr 24, 2020Deep 3D Portrait from a Single ImageJul 26, 2025Neural Estimation of the Information Bottleneck Based on a Mapping ApproachJul 21, 2025Estimating Rate-Distortion Functions Using the Energy-Based ModelSep 5, 2023AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image CollectionsApr 15, 2026Beyond Voxel 3D Editing: Learning from 3D Masks and Self-Constructed DataMar 20, 2019Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image SetJul 3, 2025MoGe-2: Accurate Monocular Geometry with Metric Scale and Sharp DetailsOct 24, 2025Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity VideosFeb 28, 2023RemoteTouch: Enhancing Immersive 3D Video Communication with Hand TouchOct 24, 2024MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training SupervisionDec 2, 2024Structured 3D Latents for Scalable and Versatile 3D GenerationDec 16, 2025VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single ImageOct 22, 2025Seeing Across Views: Benchmarking Spatial Reasoning of Vision-Language Models in Robotic ScenesMar 26, 2026HiSpatial: Taming Hierarchical 3D Spatial Understanding in Vision-Language ModelsApr 16, 2024VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real TimeDec 16, 2025Native and Compact Structured Latents for 3D GenerationNov 29, 2024CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic ManipulationJul 31, 2025Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis