"au:"Zhicai Wang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Zhicai Wang"" — arXiv2 Search

Showing 1–16 of 16 results

/ Date/ Name

Mar 15, 2023Bi-directional Distribution Alignment for Transductive Zero-Shot Learning Mar 28, 2024Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Jul 15, 2022Parameterization of Cross-Token Relations with Relative Positional Encoding for Vision MLP Jul 16, 2024Model Inversion Attacks Through Target-Specific Conditional Diffusion Models Mar 11, 2021Flatband-Induced Itinerant Ferromagnetism in RbCo$_2$Se$_2$Jul 3, 2024BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs Feb 27, 2026Look Carefully: Adaptive Visual Reinforcements in Multimodal Large Language Models for Hallucination Mitigation Mar 13, 2023Backdoor Defense via Deconfounded Representation Learning Feb 5, 2024DiffsFormer: A Diffusion Transformer on Stock Factor Augmentation Oct 31, 2023Generate What You Prefer: Reshaping Sequential Recommendation via Guided Diffusion Oct 19, 2025Res-Bench: Benchmarking the Robustness of Multimodal Large Language Models to Dynamic Resolution Input Nov 13, 2025Causal-HalBench: Uncovering LVLMs Object Hallucinations Through Causal Intervention Apr 22, 2026Mitigating Hallucinations in Large Vision-Language Models without Performance Degradation Jul 4, 2025Dynamic Multimodal Prototype Learning in Vision-Language Models Aug 11, 2025UniSVG: A Unified Dataset for Vector Graphic Understanding and Generation with Multimodal Large Language Models Jul 3, 2024PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition