Showing 1–20 of 30 results
/ Date/ Name
Apr 24, 2026Agentic World Modeling: Foundations, Capabilities, Laws, and BeyondMar 17, 2026Demystifing Video ReasoningMar 16, 2026HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene InteractionsFeb 23, 2026A Very Big Video Reasoning SuiteJan 29, 2026DynamicVLA: A Vision-Language-Action Model for Dynamic Object ManipulationDec 11, 2025WorldLens: Full-Spectrum Evaluations of Driving World Models in Real WorldOct 26, 2025IGGT: Instance-Grounded Geometry Transformer for Semantic 3D ReconstructionAug 18, 20254DNeX: Feed-Forward 4D Generative Modeling Made EasyAug 7, 2025Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical ValidityJul 28, 2025Reconstructing 4D Spatial Intelligence: A SurveyJul 2, 2025FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion ModelMar 25, 2025AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion TransformersJan 14, 2025Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion ModelsDec 4, 2024Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular VideosOct 9, 2024AvatarGO: Zero-shot 4D Human-Object Interaction Generation and AnimationOct 7, 2024GS-VTON: Controllable 3D Virtual Try-on with Gaussian SplattingAug 24, 2024Transmissive RIS Enabled Transceiver Systems:Architecture, Design Issues and OpportunitiesJul 10, 2024VEnhancer: Generative Space-Time Enhancement for Video GenerationJul 8, 2024CrowdMoGen: Zero-Shot Text-Driven Collective Motion GenerationJun 10, 2024Generative Gaussian Splatting for Unbounded 3D City Generation