/ Date/ Name

Computer Vision

cs.CV

/ Date/ Name

/ Date/ Name

Showing 641–660 of 2,609 results

/ Date/ Name

Jun 5, 2025Degradation-Aware Image Enhancement via Vision-Language Classification Jun 4, 2025WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning Jun 4, 2025Object-centric 3D Motion Field for Robot Learning from Human Videos Jun 4, 2025Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset Jun 3, 2025SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios Jun 2, 2025EarthMind: Leveraging Cross-Sensor Data for Advanced Earth Observation Interpretation with a Unified Multimodal LLM Jun 2, 2025Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models Jun 1, 2025Towards Predicting Any Human Trajectory In Context May 30, 2025Applying Vision Transformers on Spectral Analysis of Astronomical Objects May 30, 2025Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces May 30, 2025Reading Recognition in the Wild May 30, 2025DisTime: Distribution-based Time Representation for Video Large Language Models May 29, 2025ScaleLong: A Multi-Timescale Benchmark for Long Video Understanding May 29, 2025Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought May 29, 2025OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation May 29, 2025Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual Information May 29, 2025Quality assessment of 3D human animation: Subjective and objective evaluation May 29, 2025iHDR: Iterative HDR Imaging with Arbitrary Number of Exposures May 28, 2025Test-time augmentation improves efficiency in conformal prediction May 28, 2025VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models

← Previous Next →