/ Date/ Name

Computer Vision

cs.CV

/ Date/ Name

/ Date/ Name

Showing 501–520 of 2,609 results

/ Date/ Name

Sep 29, 2025Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and Algorithm Sep 28, 2025SIE3D: Single-Image Expressive 3D Avatar Generation via Semantic Embedding and Perceptual Expression Loss Sep 28, 2025Revisit the Imbalance Optimization in Multi-task Learning: An Experimental Analysis Sep 28, 2025Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution Sep 28, 2025HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and Generation Sep 28, 2025EfficientMIL: Efficient Linear-Complexity MIL Method for WSI Classification Sep 26, 2025MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing Sep 26, 2025PartSAM: A Scalable Promptable Part Segmentation Model Trained on Native 3D Data Sep 26, 2025PANICL: Mitigating Over-Reliance on Single Prompt in Visual In-Context Learning Sep 26, 2025Multimodal Neural Operators for Real-Time Biomechanical Modelling of Traumatic Brain Injury Sep 25, 2025Nuclear Diffusion Models for Low-Rank Background Suppression in Videos Sep 25, 2025QuadGPT: Native Quadrilateral Mesh Generation with Autoregressive Models Sep 24, 2025Seedream 4.0: Toward Next-generation Multimodal Image Generation Sep 24, 2025OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving Sep 22, 2025TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs Sep 22, 2025Qwen3-Omni Technical Report Sep 22, 2025COLA: Context-aware Language-driven Test-time Adaptation Sep 21, 2025LLM-Assisted Semantic Guidance for Sparsely Annotated Remote Sensing Object Detection Sep 20, 2025ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied Agents Sep 18, 2025A Knowledge-driven Adaptive Collaboration of LLMs for Enhancing Medical Decision-making

← Previous Next →