arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Jun Song"" — arXiv2 Search
Showing 1–4 of 4 results
/ Date
/ Name
Feb 24, 2026
How Foundational Skills Influence VLM-based Embodied Agents:A Native Perspective
Nov 13, 2025
Time-Layer Adaptive Alignment for Speaker Similarity in Flow-Matching Based Zero-Shot TTS
Sep 9, 2025
VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions
Aug 25, 2025
Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation