arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Enzhi Wang"" — arXiv2 Search
Showing 1–5 of 5 results
/ Date
/ Name
Sep 18, 2025
Cross-Modal Knowledge Distillation for Speech Large Language Models
Aug 6, 2025
RealTalk-CN: A Realistic Chinese Speech-Text Dialogue Benchmark With Cross-Modal Interaction Analysis
Jul 24, 2025
DIFFA: Large Language Diffusion Models Can Listen and Understand
Sep 21, 2025
Interpretable Audio Editing Evaluation via Chain-of-Thought Difference-Commonality Reasoning with Multimodal LLMs
Aug 15, 2023
Better Zero-Shot Reasoning with Role-Play Prompting