arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Akshaj Gupta"" — arXiv2 Search
Showing 1–6 of 6 results
/ Date
/ Name
Oct 2, 2025
TART: A Comprehensive Tool for Technique-Aware Audio-to-Tab Guitar Transcription
Oct 8, 2025
AV-EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Omni-modal LLMS with Audio-visual Cues
Feb 11, 2026
Conversational Behavior Modeling Foundation Model With Multi-Level Perception
May 11, 2026
MolSight: Molecular Property Prediction with Images
Dec 25, 2025
Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech
Aug 5, 2025
LCS-CTC: Leveraging Soft Alignments to Enhance Phonetic Transcription Robustness