Showing 1–16 of 16 results
/ Date/ Name
Apr 24, 2026Uni-Encoder Meets Multi-Encoders: Representation Before Fusion for Brain Tumor Segmentation with Missing ModalitiesApr 2, 2026Director: Instance-aware Gaussian Splatting for Dynamic Scene Modeling and UnderstandingDec 7, 2025Multi-Accent Mandarin Dry-Vocal Singing Dataset: Benchmark for Singing Accent RecognitionDec 7, 2025Singing Timbre Popularity Assessment Based on Multimodal Large Foundation ModelDec 2, 2025DeepSeek-V3.2: Pushing the Frontier of Open Large Language ModelsNov 12, 2025Diff-V2M: A Hierarchical Conditional Diffusion Model with Explicit Rhythmic Modeling for Video-to-Music GenerationSep 17, 2025Assessing Data Replication in Symbolic Music via Adapted Structural Similarity Index MeasureMay 22, 2025Losing is for Cherishing: Data Valuation Based on Machine Unlearning and Shapley ValueMay 11, 2025Seed1.5-VL Technical ReportApr 1, 2025A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal PerspectivesJan 24, 2025Humanity's Last ExamJul 3, 2024MuDiT & MuSiT: Alignment with Colloquial Expression in Description-to-Song GenerationFeb 15, 2024MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of MusicSep 19, 2023MelodyGLM: Multi-task Pre-training for Symbolic Melody GenerationMay 14, 2023REMAST: Real-time Emotion-based Music Arrangement with Soft TransitionFeb 3, 2023Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents