arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Junkai Wu"" — arXiv2 Search
Showing 1–8 of 8 results
/ Date
/ Name
Sep 7, 2024
Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue
Jan 25, 2026
AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking
May 15, 2022
Learning Representations for New Sound Classes With Continual Self-Supervised Learning
Dec 17, 2023
Meta-AF Echo Cancellation for Improved Keyword Spotting
Sep 19, 2025
SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models
May 11, 2025
Bridging Ears and Eyes: Analyzing Audio and Visual Large Language Models to Humans in Visible Sound Recognition and Reducing Their Sensory Gap via Cross-Modal Distillation
Sep 20, 2022
Meta-Learning for Adaptive Filters with Higher-Order Frequency Dependencies
May 3, 2023
Unsupervised Improvement of Audio-Text Cross-Modal Representations