Showing 281–300 of 2,609 results
/ Date/ Name
Mar 19, 2026TerraScope: Pixel-Grounded Visual Reasoning for Earth ObservationMar 18, 2026VISTA: Validation-Guided Integration of Spatial and Temporal Foundation Models with Anatomical Decoding for Rare-Pathology VCE Event DetectionMar 18, 2026AHOY! Animatable Humans under Occlusion from YouTube Videos with Gaussian Splatting and Video Diffusion PriorsMar 18, 2026TAPESTRY: From Geometry to Appearance via Consistent Turntable VideosMar 17, 2026LLM-Powered Flood Depth Estimation from Social Media Imagery: A Vision-Language Model Framework with Mechanistic Interpretability for Transportation ResilienceMar 17, 2026Demystifing Video ReasoningMar 17, 2026Unified Removal of Raindrops and Reflections: A New Benchmark and A Novel PipelineMar 16, 2026HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene InteractionsMar 15, 2026RL-ScanIQA: Reinforcement-Learned Scanpaths for Blind 360°Image Quality AssessmentMar 14, 2026Towards Generalizable Deepfake Detection via Real Distribution Bias CorrectionMar 13, 2026Finite Difference Flow Optimization for RL Post-Training of Text-to-Image ModelsMar 11, 2026Neural Field Thermal Tomography: A Differentiable Physics Framework for Non-Destructive EvaluationMar 11, 2026AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial MemoryMar 10, 2026InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and EditingMar 10, 2026Geometry-Aware Semantic Reasoning for Training Free Video Anomaly DetectionMar 9, 2026TALON: Test-time Adaptive Learning for On-the-Fly Category DiscoveryMar 9, 2026Enhancing Cross-View UAV Geolocalization via LVLM-Driven Relational ModelingMar 8, 2026Generalization in Online Reinforcement Learning for Mobile AgentsMar 7, 2026StructSAM: Structure- and Spectrum-Preserving Token Merging for Segment Anything ModelsMar 6, 2026Word-Anchored Temporal Forgery Localization