Showing 401–420 of 2,609 results
/ Date/ Name
Dec 11, 2025VL-JEPA: Joint Embedding Predictive Architecture for Vision-languageDec 11, 2025Mull-Tokens: Modality-Agnostic Latent ThinkingDec 11, 2025Topology-Agnostic Animal Motion Generation from Text PromptDec 9, 2025MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte GuidanceDec 9, 2025Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation MapsDec 7, 2025Spatial Retrieval Augmented Autonomous DrivingDec 5, 2025Edit-aware RAW ReconstructionDec 4, 2025Identity Clue Refinement and Enhancement for Visible-Infrared Person Re-IdentificationDec 3, 2025CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map UnderstandingDec 2, 2025SMP: Reusable Score-Matching Motion Priors for Physics-Based Character ControlDec 1, 2025EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the FlyNov 29, 2025FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward SignalNov 28, 2025Mammo-FM: Breast-specific foundational model for Integrated Mammographic Diagnosis, Prognosis, and ReportingNov 28, 2025GeoWorld: Unlocking the Potential of Geometry Models to Facilitate High-Fidelity 3D Scene GenerationNov 28, 2025DiskChunGS: Large-Scale 3D Gaussian SLAM Through Chunk-Based Memory ManagementNov 27, 2025Wukong's 72 Transformations: High-fidelity Textured 3D Morphing via Flow ModelsNov 26, 2025Monet: Reasoning in Latent Visual Space Beyond Images and LanguageNov 25, 2025DAPointMamba: Domain Adaptive Point Mamba for Point Cloud CompletionNov 25, 2025Bootstrap Dynamic-Aware 3D Visual Representation for Scalable Robot LearningNov 24, 2025UMCL: Unimodal-generated Multimodal Contrastive Learning for Cross-compression-rate Deepfake Detection