Showing 1–20 of 45 results
/ Date/ Name
Mar 8, 2022SpeechFormer: A Hierarchical Efficient Framework Incorporating the Characteristics of SpeechJun 9, 2017Manifold Regularized Slow Feature Analysis for Dynamic Texture RecognitionSep 9, 2024AS-Speech: Adaptive Style For Speech SynthesisApr 23, 2025SoCov: Semi-Orthogonal Parametric Pooling of Covariance Matrix for Speaker RecognitionSep 19, 2025MNV-17: A High-Quality Performative Mandarin Dataset for Nonverbal Vocalization Recognition in SpeechOct 15, 2025MimicParts: Part-aware Style Injection for Speech-Driven 3D Motion GenerationJul 11, 2025Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in ConversationFeb 24, 2026MERRY: Semantically Decoupled Evaluation of Multimodal Emotional and Role Consistencies of Role-Playing AgentsMay 14, 2019Listwise View Ranking for Image CroppingMar 4, 2024PointCore: Efficient Unsupervised Point Cloud Anomaly Detector Using Local-Global FeaturesOct 24, 2023BianQue: Balancing the Questioning and Suggestion Ability of Health LLMs with Multi-turn Health Conversations Polished by ChatGPTJul 3, 2024Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking DatasetSep 2, 2025Think2Sing: Orchestrating Structured Motion Subtitles for Singing-Driven 3D Head AnimationOct 13, 2025BridgeCode: A Dual Speech Representation Paradigm for Autoregressive Zero-Shot Text-to-Speech SynthesisSep 23, 2025HD-PPT: Hierarchical Decoding of Content- and Prompt-Preference Tokens for Instruction-based TTSNov 1, 2023SoulChat: Improving LLMs' Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy ConversationsMar 5, 2024FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing ModelMar 12, 2024Towards Zero-shot Human-Object Interaction Detection via Vision-Language IntegrationFeb 27, 2023DST: Deformable Speech Transformer for Emotion RecognitionApr 23, 2019BIT: Biologically Inspired Tracker