Showing 1–20 of 52 results
/ Date/ Name
Oct 25, 2022Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using $β$-VAEJan 30, 2026Universal Adversarial Attacks against Closed-Source MLLMs via Target-View Routed Meta OptimizationAug 29, 2023Global structure of the spectrum of periodic Non-hermitian Jacobi operatorsNov 26, 2025When Robots Obey the Patch: Universal Transferable Patch Attacks on Vision-Language-Action ModelsDec 2, 2022Private Multiparty Perception for NavigationJul 7, 2021VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech SynthesisMar 18, 2024TCNet: Continuous Sign Language Recognition from Trajectories and Correlated RegionsDec 11, 2023Compensation Sampling for Improved Convergence in Diffusion ModelsJul 18, 2024Autonomous self-evolving research on biomedical data: the DREAM paradigmJun 27, 2024Snakes and Ladders: Two Steps Up for VideoMambaMar 24, 2024Enhancing Video Transformers for Action Understanding with VLM-aided TrainingNov 22, 2025MambaTAD: When State-Space Models Meet Long-Range Temporal Action DetectionMar 22, 2023CH-Go: Online Go System Based on Chunk Data StorageJul 19, 2021Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech AttacksMar 2, 2022A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTSOct 18, 2015On Termination of Polynomial Programs with Equality ConditionsJul 3, 2024Investigating Decoder-only Large Language Models for Speech-to-text TranslationMar 22, 2025Architectural and System Implications of CXL-enabled Tiered MemoryFeb 23, 2025VidLBEval: Benchmarking and Mitigating Language Bias in Video-Involved LVLMsJun 16, 2025FOAM: A General Frequency-Optimized Anti-Overlapping Framework for Overlapping Object Perception