"au:"Dong Yu"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Dong Yu"" — arXiv2 Search

Showing 1–19 of 19 results

/ Date/ Name

Apr 23, 2026Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding Oct 16, 2025Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents Oct 10, 2025Don't Throw Away Your Pretrained Model May 28, 2025VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models May 26, 2025WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback May 22, 2025UNCLE: Benchmarking Uncertainty Expressions in Long-Form Generation May 20, 2025Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training May 6, 2025Recall with Reasoning: Chain-of-Thought Distillation for Mamba's Long-Context Memory and Extrapolation Apr 23, 2025WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World Model Apr 16, 2025WebRollback: Enhancing Web Agents with Explicit Rollback Mechanisms Dec 23, 2024A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression Nov 26, 2024Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens Sep 12, 2024DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?Jul 20, 2022Diffsound: Discrete Diffusion Model for Text-to-sound Generation Mar 29, 2022Integrating Lattice-Free MMI into End-to-End Speech Recognition Jan 6, 2022Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model Nov 16, 2020Audio-visual Multi-channel Integration and Recognition of Overlapped Speech May 18, 2020Audio-visual Multi-channel Recognition of Overlapped Speech Jan 6, 2020Audio-visual Recognition of Overlapped speech for the LRS2 dataset