Showing 1–15 of 15 results
/ Date/ Name
Sep 25, 2025Rethinking Explainable Disease Prediction: Synergizing Accuracy and Reliability via Reflective Cognitive ArchitectureJun 28, 2024ShortcutsBench: A Large-Scale Real-world Benchmark for API-based AgentsMay 16, 2025DRAGON: Domain-specific Robust Automatic Data Generation for RAG OptimizationFeb 12, 2026MEME: Modeling the Evolutionary Modes of Financial MarketsFeb 12, 2026AlphaPROBE: Alpha Mining via Principled Retrieval and On-graph biased evolutionMay 15, 2025MASS: Muli-agent simulation scaling for portfolio constructionJan 13, 2026M3-BENCH: Process-Aware Evaluation of LLM Agents' Social Behaviors in Mixed-Motive GamesJan 10, 2026BabyVision: Visual Reasoning Beyond LanguageJan 5, 2026MDAgent2: Large Language Model for Code Generation and Knowledge Q&A in Molecular DynamicsOct 28, 2025Tongyi DeepResearch Technical ReportOct 17, 2025Accelerating Mobile Language Model via Speculative Decoding and NPU-Coordinated ExecutionJul 20, 2025WebShaper: Agentically Data Synthesizing via Information-Seeking FormalizationOct 28, 2025ParallelMuse: Agentic Parallel Thinking for Deep Information SeekingOct 28, 2025WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich SeekingApr 23, 2025PixelWeb: The First Web GUI Dataset with Pixel-Wise Labels