Showing 1–20 of 38 results
/ Date/ Name
Mar 9, 2026HiAR: Efficient Autoregressive Long Video Generation via Hierarchical DenoisingMar 24, 2022GX-Plug: a Middleware for Plugging Accelerators to Distributed Graph ProcessingMar 18, 2026EchoGen: Cycle-Consistent Learning for Unified Layout-Image Generation and UnderstandingOct 15, 2025Uni-MMMU: A Massive Multi-discipline Multimodal Unified BenchmarkOct 8, 2024Enhancing SPARQL Generation by Triplet-order-sensitive Pre-trainingAug 8, 2025BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research AgentMar 27, 2025VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic FaithfulnessFeb 10, 2026Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment LocalizationOct 20, 2023Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence TasksJun 26, 2024MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math DataJan 11, 2025Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMsSep 1, 2025VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool UseOct 19, 2025Leave It to the Experts: Detecting Knowledge Distillation via MoE Expert SignaturesNov 25, 2024Cavity-Quantum Electrodynamics with Moiré Flatband Photonic CrystalsApr 9, 2026ClawBench: Can AI Agents Complete Everyday Online Tasks?Mar 21, 2026SWE-Next: Scalable Real-World Software Engineering Tasks for AgentsDec 5, 2023Inherent limitations of LLMs regarding spatial informationDec 12, 2020Fractal superconducting nanowires detect infrared single photons with 84% system detection efficiency, 1.02 polarization sensitivity, and 20.8 ps timing resolutionMar 22, 2020A platform for high performance photon correlation measurementsMay 7, 2024Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in Structured Finance: The Application of Multi-agent Framework