Showing 501–520 of 2,609 results
/ Date/ Name
Sep 29, 2025Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and AlgorithmSep 28, 2025SIE3D: Single-Image Expressive 3D Avatar Generation via Semantic Embedding and Perceptual Expression LossSep 28, 2025Revisit the Imbalance Optimization in Multi-task Learning: An Experimental AnalysisSep 28, 2025Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-ResolutionSep 28, 2025HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and GenerationSep 28, 2025EfficientMIL: Efficient Linear-Complexity MIL Method for WSI ClassificationSep 26, 2025MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document ParsingSep 26, 2025PartSAM: A Scalable Promptable Part Segmentation Model Trained on Native 3D DataSep 26, 2025PANICL: Mitigating Over-Reliance on Single Prompt in Visual In-Context LearningSep 26, 2025Multimodal Neural Operators for Real-Time Biomechanical Modelling of Traumatic Brain InjurySep 25, 2025Nuclear Diffusion Models for Low-Rank Background Suppression in VideosSep 25, 2025QuadGPT: Native Quadrilateral Mesh Generation with Autoregressive ModelsSep 24, 2025Seedream 4.0: Toward Next-generation Multimodal Image GenerationSep 24, 2025OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous DrivingSep 22, 2025TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMsSep 22, 2025Qwen3-Omni Technical ReportSep 22, 2025COLA: Context-aware Language-driven Test-time AdaptationSep 21, 2025LLM-Assisted Semantic Guidance for Sparsely Annotated Remote Sensing Object DetectionSep 20, 2025ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied AgentsSep 18, 2025A Knowledge-driven Adaptive Collaboration of LLMs for Enhancing Medical Decision-making