Showing 1–20 of 27 results
/ Date/ Name
Sep 28, 2024Multi-sensor Learning Enables Information Transfer across Different Sensory Data and Augments Multi-modality ImagingFeb 12, 2026AssetFormer: Modular 3D Assets Generation with Autoregressive TransformerAug 27, 2023Cheap Lunch for Medical Image Segmentation by Fine-tuning SAM on Few ExemplarsJan 21, 2024EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian SplattingMar 16, 2023Taming Diffusion Models for Audio-Driven Co-Speech Gesture GenerationMar 24, 2025MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-ProcessingSep 26, 2025Large Material Gaussian Model for Relightable 3D GenerationJul 19, 2023Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI SynthesisFeb 13, 2025Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian SplattingMar 19, 2024Generative Enhancement for 3D Medical ImagesJun 10, 2024Generalizable Human Gaussians from Single-View ImageNov 24, 2025LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination ContextJan 18, 2024CustomVideo: Customizing Text-to-Video Generation with Multiple SubjectsFeb 28, 2026Cross-Scale Pansharpening via ScaleFormer and the PanScale BenchmarkMay 28, 2024HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene ReconstructionFeb 12, 2025AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse GuidanceDec 2, 2025ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement LearningMar 13, 2025Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image GenerationNov 20, 2025V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation ModelsMay 26, 2025StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation