Showing 1–19 of 19 results
/ Date/ Name
Apr 23, 2026Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI GroundingOct 16, 2025Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research AgentsOct 10, 2025Don't Throw Away Your Pretrained ModelMay 28, 2025VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language ModelsMay 26, 2025WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and RollbackMay 22, 2025UNCLE: Benchmarking Uncertainty Expressions in Long-Form GenerationMay 20, 2025Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional TrainingMay 6, 2025Recall with Reasoning: Chain-of-Thought Distillation for Mamba's Long-Context Memory and ExtrapolationApr 23, 2025WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World ModelApr 16, 2025WebRollback: Enhancing Web Agents with Explicit Rollback MechanismsDec 23, 2024A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context CompressionNov 26, 2024Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training TokensSep 12, 2024DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?Jul 20, 2022Diffsound: Discrete Diffusion Model for Text-to-sound GenerationMar 29, 2022Integrating Lattice-Free MMI into End-to-End Speech RecognitionJan 6, 2022Improving Mandarin End-to-End Speech Recognition with Word N-gram Language ModelNov 16, 2020Audio-visual Multi-channel Integration and Recognition of Overlapped SpeechMay 18, 2020Audio-visual Multi-channel Recognition of Overlapped SpeechJan 6, 2020Audio-visual Recognition of Overlapped speech for the LRS2 dataset