Showing 121–140 of 2,609 results
/ Date/ Name
Apr 23, 2026Pre-process for segmentation task with nonlinear diffusion filtersApr 23, 2026S1-VL: Scientific Multimodal Reasoning Model with Thinking-with-ImagesApr 23, 2026You Only Gaussian Once: Controllable 3D Gaussian Splatting for Ultra-Densely Sampled ScenesApr 23, 2026VG-CoT: Towards Trustworthy Visual Reasoning via Grounded Chain-of-ThoughtApr 23, 2026Supervised Learning Has a Necessary Geometric Blind Spot: Theory, Consequences, and Minimal RepairApr 23, 2026EdgeFormer: local patch-based edge detection transformer on point cloudsApr 23, 2026KD-CVG: A Knowledge-Driven Approach for Creative Video GenerationApr 23, 2026Prototype-Based Test-Time Adaptation of Vision-Language ModelsApr 23, 2026SparseGF: A Height-Aware Sparse Segmentation Framework with Context Compression for Robust Ground Filtering Across Urban to Natural ScenesApr 23, 2026Trust-SSL: Additive-Residual Selective Invariance for Robust Aerial Self-Supervised LearningApr 23, 2026Symbolic Grounding Reveals Representational Bottlenecks in Abstract Visual ReasoningApr 23, 2026Beyond Single Plots: A Benchmark for Question Answering on Multi-ChartsApr 23, 2026Latent Denoising Improves Visual Alignment in Large Multimodal ModelsApr 23, 2026Teacher-Guided Routing for Sparse Vision Mixture-of-ExpertsApr 23, 2026MiMIC: Mitigating Visual Modality Collapse in Universal Multimodal Retrieval While Avoiding Semantic MisalignmentApr 23, 2026Temporal Prototyping and Hierarchical Alignment for Unsupervised Video-based Visible-Infrared Person Re-IdentificationApr 23, 2026FryNet: Dual-Stream Adversarial Fusion for Non-Destructive Frying Oil Oxidation AssessmentApr 23, 2026PLAS-Net: Pixel-Level Area Segmentation for UAV-Based Beach Litter MonitoringApr 23, 2026The First Challenge on Remote Sensing Infrared Image Super-Resolution at NTIRE 2026: Benchmark Results and Method OverviewApr 23, 2026an interpretable vision transformer framework for automated brain tumor classification