Showing 1–20 of 235 results
/ Date/ Name
Apr 24, 2026TTS-PRISM: A Perceptual Reasoning and Interpretable Speech Model for Fine-Grained DiagnosisApr 24, 2026UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text InstructionsApr 23, 2026Dilated CNNs for Periodic Signal Processing: A Low-Complexity ApproachApr 22, 2026ONOTE: Benchmarking Omnimodal Notation Processing for Expert-level Music IntelligenceApr 21, 2026Tonnetz Theory, Classical Harmony, and the Combinatorial Geometry of Abstract Musical ResourcesApr 21, 2026Self-Noise Reduction for Capacitive Sensors via Photoelectric DC Servo: Application to Condenser MicrophonesApr 20, 2026Incremental learning for audio classification with Hebbian Deep Neural NetworksApr 19, 2026HCFD: A Benchmark for Audio Deepfake Detection in HealthcareApr 13, 2026Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and MusicApr 6, 2026Joint Fullband-Subband Modeling for High-Resolution SingFake DetectionJan 29, 2026Qwen3-ASR Technical ReportJan 26, 2026VIBEVOICE-ASR Technical ReportJan 22, 2026Qwen3-TTS Technical ReportJan 12, 2026Elastic overtones: an equal temperament 12 tone music system with "perfect" fifthsDec 29, 2025MiMo-Audio: Audio Language Models are Few-Shot LearnersDec 5, 2025Noise Suppression for Time Difference of Arrival: Performance Evaluation of a Generalized Cross-Correlation Method Using Mean Signal and Inverse FilterNov 13, 2025Time-Layer Adaptive Alignment for Speaker Similarity in Flow-Matching Based Zero-Shot TTSNov 12, 2025Diff-V2M: A Hierarchical Conditional Diffusion Model with Explicit Rhythmic Modeling for Video-to-Music GenerationSep 22, 2025Qwen3-Omni Technical ReportSep 17, 2025Assessing Data Replication in Symbolic Music via Adapted Structural Similarity Index Measure