Showing 1–15 of 15 results
/ Date/ Name
Apr 24, 2026UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text InstructionsMar 26, 2026Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion ScaleAug 28, 2025NPG-Muse: Scaling Long Chain-of-Thought Reasoning with NP-Hard Graph ProblemsAug 1, 2025AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song GenerationJul 17, 2025Apple Intelligence Foundation Language Models: Tech Report 2025Jun 24, 2025Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio GenerationOct 23, 2023An Aluminum-coated sCMOS sensor for X-Ray AstronomyMay 14, 2023REMAST: Real-time Emotion-based Music Arrangement with Soft TransitionMar 15, 2023Investigating the image lag of a scientific CMOS sensor in X-ray detectionNov 1, 2022SDMuse: Stochastic Differential Music Editing and Generation via Hybrid RepresentationSep 30, 2022X-ray performance of a customized large-format scientifc CMOS detectorMar 25, 2022Automatic Song Translation for Tonal LanguagesSep 20, 2021TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage MethodSep 16, 2021PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics TranscriptionJun 14, 2020UWSpeech: Speech to Speech Translation for Unwritten Languages