Showing 581–600 of 2,609 results
/ Date/ Name
Jul 27, 2025T$^\text{3}$SVFND: Towards an Evolving Fake News Detector for Emergencies with Test-time Training on Short Video PlatformsJul 27, 2025Detection of Medial Epicondyle Avulsion in Elbow Ultrasound Images via Bone Structure ReconstructionJul 25, 2025Object-centric Video Question Answering with Visual Grounding and ReferringJul 25, 2025Back to the Features: DINO as a Foundation for Video World ModelsJul 25, 2025RealisVSR: Detail-enhanced Diffusion for Real-World 4K Video Super-ResolutionJul 24, 2025LMM-Det: Make Large Multimodal Models Excel in Object DetectionJul 23, 2025MaskedCLIP: Bridging the Masked and CLIP Space for Semi-Supervised Medical Vision-Language Pre-trainingJul 22, 2025Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical ReportJul 22, 2025DenseSR: Image Shadow Removal as Dense PredictionJul 19, 2025Docopilot: Improving Multimodal Models for Document-Level UnderstandingJul 18, 2025GOSPA and T-GOSPA quasi-metrics for evaluation of multi-object tracking algorithmsJul 16, 2025Intra-view and Inter-view Correlation Guided Multi-view Novel Class DiscoveryJul 13, 2025WordCraft: Interactive Artistic Typography with Attention Awareness and Noise BlendingJul 12, 2025ProactiveVideoQA: A Comprehensive Benchmark Evaluating Proactive Interactions in Video Large Language ModelsJul 11, 2025MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal FusionJul 10, 2025Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and MethodologyJul 9, 2025Learning from Sparse Point Labels for Dense Carcinosis Localization in Advanced Ovarian Cancer AssessmentJul 9, 2025EXAONE Path 2.0: Pathology Foundation Model with End-to-End SupervisionJul 8, 2025Mamba Goes HoME: Hierarchical Soft Mixture-of-Experts for 3D Medical Image SegmentationJul 8, 2025GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing