Showing 1–20 of 30 results
/ Date/ Name
Mar 7, 2026Seeing the Context: Rich Visual Context-Aware Speech Recognition via Multimodal ReasoningJan 25, 2026dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech RecognitionJul 14, 2025DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio SynthesisAug 8, 2025Llasa+: Free Lunch for Accelerated and Streaming Llama-Based Speech SynthesisFeb 25, 2026EmoOmni: Bridging Emotional Understanding and Expression in Omni-Modal LLMsOct 25, 2012Some spacetimes containing non-rotating extremal isolated horizonsFeb 19, 2015Lovelock-Brans-Dicke gravitySep 15, 2014Friedmann equations from nonequilibrium thermodynamics of the Universe: A unified formulation for modified gravityApr 30, 2014Lessons from $f(R,R_c^2,R_m^2, L_m)$ gravity: Smooth Gauss-Bonnet limit, energy-momentum conservation and nonminimal couplingDec 30, 2015Thermal relics as hot, warm and cold dark matter in power-law $f(R)$ gravityNov 10, 2015Big Bang nucleosynthesis and baryogenesis in power-law $f(R)$ gravity: Revised constraints from the semianalytical approachAug 10, 2015Traversable wormholes and energy conditions in Lovelock-Brans-Dicke gravityJul 27, 2015Local energy-momentum conservation in scalar-tensor-like gravity with generic curvature invariantsNov 24, 2014Apparent horizon and gravitational thermodynamics of the Universe: Solutions to the temperature and entropy confusions, and extensions to modified gravityDec 12, 2024YingSound: Video-Guided Sound Effects Generation with Multi-modal Chain-of-Thought ControlsJan 28, 2025CosyAudio: Improving Audio Generation with Confidence Scores and Synthetic CaptionsOct 9, 2025DialoSpeech: Dual-Speaker Dialogue Generation with LLM and Flow MatchingFeb 25, 2025Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of ThoughtOct 1, 2025PodEval: A Multimodal Evaluation Framework for Podcast Audio GenerationDec 22, 2024KALL-E:Autoregressive Speech Synthesis with Next-Distribution Prediction