Showing 181–200 of 2,609 results
/ Date/ Name
Apr 22, 2026Amodal SAM: A Unified Amodal Segmentation Framework with GeneralizationApr 22, 2026Lifecycle-Aware Federated Continual Learning in Mobile Autonomous SystemsApr 22, 2026Render-in-the-Loop: Vector Graphics Generation via Visual Self-FeedbackApr 22, 2026GeoRelight: Learning Joint Geometrical Relighting and Reconstruction with Flexible Multi-Modal Diffusion TransformersApr 22, 2026SSL-R1: Self-Supervised Visual Reinforcement Post-Training for Multimodal Large Language ModelsApr 22, 2026R-CoV: Region-Aware Chain-of-Verification for Alleviating Object Hallucinations in LVLMsApr 22, 2026The Expense of Seeing: Attaining Trustworthy Multimodal Reasoning Within the Monolithic ParadigmApr 22, 2026MAPRPose: Mask-Aware Proposal and Amodal Refinement for Multi-Object 6D Pose EstimationApr 22, 2026RSRCC: A Remote Sensing Regional Change Comprehension Benchmark Constructed via Retrieval-Augmented Best-of-N RankingApr 22, 2026Beyond ZOH: Advanced Discretization Strategies for Vision MambaApr 22, 2026Physics-Informed Conditional Diffusion for Motion-Robust Retinal Temporal Laser Speckle Contrast ImagingApr 22, 2026Structure-Augmented Standard Plane Detection with Temporal Aggregation in Blind-Sweep Fetal UltrasoundApr 22, 2026On the Impact of Face Segmentation-Based Background Removal on Recognition and Morphing Attack DetectionApr 22, 2026Where are they looking in the operating room?Apr 22, 2026Exploring Spatial Intelligence from a Generative PerspectiveApr 22, 2026Evian: Towards Explainable Visual Instruction-tuning Data AuditingApr 22, 2026RefAerial: A Benchmark and Approach for Referring Detection in Aerial ImagesApr 22, 2026AttentionBender: Manipulating Cross-Attention in Video Diffusion Transformers as a Creative ProbeApr 22, 2026From Image to Music Language: A Two-Stage Structure Decoding Approach for Complex Polyphonic OMRApr 22, 2026CHASM: Unveiling Covert Advertisements on Chinese Social Media