/ Date/ Name

Computer Vision

cs.CV

/ Date/ Name

/ Date/ Name

Showing 381–400 of 2,609 results

/ Date/ Name

Dec 31, 2025TeleWorld: Towards Dynamic Multimodal Synthesis with a 4D World Model Dec 29, 2025Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion Dec 27, 2025Towards Robust Optical-SAR Object Detection under Missing Modalities: A Dynamic Quality-Aware Fusion Framework Dec 23, 2025CHAMMI-75: Pre-training multi-channel models with heterogeneous microscopy images Dec 23, 2025Few-Shot-Based Modular Image-to-Video Adapter for Diffusion Models Dec 22, 2025Widget2Code: From Visual Widgets to UI Code via Multimodal LLMs Dec 22, 2025BabyFlow: 3D modeling of realistic and expressive infant faces Dec 21, 2025A Study of Finetuning Video Transformers for Multi-view Geometry Tasks Dec 18, 2025EasyV2V: A High-quality Instruction-based Video Editing Framework Dec 18, 2025Smile on the Face, Sadness in the Eyes: Bridging the Emotion Gap with a Multimodal Dataset of Eye and Facial Behaviors Dec 16, 2025Artificial Intelligence for the Assessment of Peritoneal Carcinosis during Diagnostic Laparoscopy for Advanced Ovarian Cancer Dec 15, 2025Adapting MLLMs for Nuanced Video Retrieval Dec 15, 2025Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model Dec 15, 2025Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation Dec 15, 2025Ego-EXTRA: video-language Egocentric Dataset for EXpert-TRAinee assistance Dec 15, 2025Comprehensive Deployment-Oriented Assessment for Cross-Environment Generalization in Deep Learning-Based mmWave Radar Sensing Dec 12, 2025MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator Dec 12, 2025The N-Body Problem: Parallel Execution from Single-Person Egocentric Video Dec 11, 2025WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World Dec 11, 2025Omni-Attribute: Open-vocabulary Attribute Encoder for Visual Concept Personalization

← Previous Next →