"au:"Zhe Chen"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Zhe Chen"" — arXiv2 Search

Showing 1–12 of 12 results

/ Date/ Name

Dec 18, 2025Smile on the Face, Sadness in the Eyes: Bridging the Emotion Gap with a Multimodal Dataset of Eye and Facial Behaviors Aug 25, 2025InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Jul 19, 2025Docopilot: Improving Multimodal Models for Document-Level Understanding Jul 7, 2025Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities May 30, 2025Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces Jan 17, 2025FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Nov 8, 2024Smile upon the Face but Sadness in the Eyes: Emotion Recognition based on Facial Expressions and Eye Behaviors Jun 12, 2024OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Apr 26, 2024Open-Set Video-based Facial Expression Recognition with Human Expression-sensitive Prompting May 18, 2023VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks May 4, 2023Noise-Resistant Multimodal Transformer for Emotion Recognition Jun 23, 2022CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose