Showing 1–20 of 27 results
/ Date/ Name
Oct 26, 2020Contrastive Unsupervised Learning for Audio FingerprintingSep 10, 2018AAG-Stega: Automatic Audio Generation-based SteganographyFeb 24, 2024ByteComposer: a Human-like Melody Composition Method based on Language Model AgentMar 21, 2023ByteCover3: Accurate Cover Song Identification on Short QueriesMay 2, 2018End-to-End Residual CNN with L-GM Loss Speaker Verification SystemOct 16, 2023Joint Music and Language Attention Models for Zero-shot Music TaggingOct 27, 2020ByteCover: Cover Song Identification via Multi-Loss TrainingApr 18, 2019RepGN:Object Detection with Relational Proposal Graph NetworkJan 2, 2019End-to-End Model for Speech Enhancement by Consistent Spectrogram MaskingDec 15, 2021Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled DataMay 11, 2023Universal Source Separation with Weakly Labelled DataApr 8, 2024Eagle and Finch: RWKV with Matrix-Valued States and Dynamic RecurrenceAug 26, 2024Foundation Models for Music: A SurveyFeb 2, 2022HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and DetectionFeb 19, 2021CatNet: music source separation system with mix-audio augmentationFeb 19, 2021Speech enhancement with weakly labelled data from AudioSetFeb 25, 2025NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training ParadigmsMar 18, 2025RWKV-7 "Goose" with Expressive Dynamic State EvolutionJun 21, 2021Attention-based cross-modal fusion for audio-visual voice activity detection in musical video streamsFeb 6, 2025Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis