Showing 1–20 of 20 results
/ Date/ Name
Jul 4, 2024High Fidelity Text-Guided Music Editing via Single-Stage Flow MatchingSep 15, 2023Stack-and-Delay: a new codebook pattern for music generationSep 15, 2023Enhance audio generation controllability through representation similarity regularizationAug 13, 2020Automatic Quality Assessment for Audio-Visual Verification Systems. The LOVe submission to NIST SRE Challenge 2019Jan 18, 2026SLAP: Scalable Language-Audio Pretraining with Variable-Duration Audio and Multi-Objective TrainingSep 19, 2023Exploring Speech Enhancement for Low-resource Speech SynthesisJan 9, 2024Masked Audio Generation using a Single Non-Autoregressive TransformerOct 8, 2021On the invertibility of a voice privacy system using embedding alignementAug 14, 2024Supervised and Unsupervised Alignments for Spoofing Behavioral BiometricsFeb 3, 2026Conditional Flow Matching for Visually-Guided Acoustic HighlightingApr 14, 2026RPRA: Predicting an LLM-Judge for Efficient but Performant InferenceApr 6, 2026Free-Range Gaussians: Non-Grid-Aligned Generative 3D Gaussian ReconstructionSep 19, 2023FoleyGen: Visually-Guided Audio GenerationNov 1, 2023In-Context Prompt Editing For Conditional Audio GenerationFeb 5, 2026EgoAVU: Egocentric Audio-Visual UnderstandingMar 14, 2022Mobile Behavioral Biometrics for Passive AuthenticationDec 3, 2024SyncFlow: Toward Temporally Aligned Joint Audio-Video Generation from TextApr 23, 2025Scalable and Performant Data LoadingApr 7, 2026Neural ComputersApr 26, 2026Exploring Audio Hallucination in Egocentric Video Understanding