arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Shiyin Kang"" — arXiv2 Search
Showing 1–2 of 2 results
/ Date
/ Name
Oct 15, 2025
InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue
Jan 6, 2020
Audio-visual Recognition of Overlapped speech for the LRS2 dataset