Showing 1–20 of 28 results
/ Date/ Name
Oct 14, 2024Agent-as-a-Judge: Evaluate Agents with AgentsMay 26, 2023Mindstorms in Natural Language-Based Societies of MindFeb 2, 2023QR-CLIP: Introducing Explicit Open-World Knowledge for Location and Time ReasoningApr 7, 2026Neural ComputersMar 30, 2021Kaleido-BERT: Vision-Language Pre-training on Fashion DomainJul 17, 2024Goldfish: Vision-Language Understanding of Arbitrarily Long VideosFeb 28, 2024Data Interpreter: An LLM Agent For Data ScienceAug 17, 2025You Don't Know Until You Click:Automated GUI Testing for Production-Ready Software EvaluationJan 8, 2026VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering TwiceApr 9, 2026Small Vision-Language Models are Smart Compressors for Long Video UnderstandingAug 1, 2023MetaGPT: Meta Programming for A Multi-Agent Collaborative FrameworkMar 11, 2025Beyond Outlining: Heterogeneous Recursive Planning for Adaptive Long-form Writing with Language ModelsMar 31, 2025Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe SystemsOct 14, 2024AFlow: Automating Agentic Workflow GenerationNov 5, 2021Fast Camouflaged Object Detection via Edge-based Reversible Re-calibration NetworkFeb 26, 2024Language Agents as Optimizable GraphsOct 24, 2025Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving MachineMar 19, 2026dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language ModelsJan 19, 2021Salient Object Detection via Integrity LearningMar 8, 2022Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs