Showing 1–16 of 16 results
/ Date/ Name
Mar 15, 2023Bi-directional Distribution Alignment for Transductive Zero-Shot LearningMar 28, 2024Enhance Image Classification via Inter-Class Image Mixup with Diffusion ModelJul 15, 2022Parameterization of Cross-Token Relations with Relative Positional Encoding for Vision MLPJul 16, 2024Model Inversion Attacks Through Target-Specific Conditional Diffusion ModelsMar 11, 2021Flatband-Induced Itinerant Ferromagnetism in RbCo$_2$Se$_2$Jul 3, 2024BACON: Improving Clarity of Image Captions via Bag-of-Concept GraphsFeb 27, 2026Look Carefully: Adaptive Visual Reinforcements in Multimodal Large Language Models for Hallucination MitigationMar 13, 2023Backdoor Defense via Deconfounded Representation LearningFeb 5, 2024DiffsFormer: A Diffusion Transformer on Stock Factor AugmentationOct 31, 2023Generate What You Prefer: Reshaping Sequential Recommendation via Guided DiffusionOct 19, 2025Res-Bench: Benchmarking the Robustness of Multimodal Large Language Models to Dynamic Resolution InputNov 13, 2025Causal-HalBench: Uncovering LVLMs Object Hallucinations Through Causal InterventionApr 22, 2026Mitigating Hallucinations in Large Vision-Language Models without Performance DegradationJul 4, 2025Dynamic Multimodal Prototype Learning in Vision-Language ModelsAug 11, 2025UniSVG: A Unified Dataset for Vector Graphic Understanding and Generation with Multimodal Large Language ModelsJul 3, 2024PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition