Showing 21–40 of 59 results
/ Date/ Name
Feb 26, 2018VR-SGD: A Simple Stochastic Variance Reduction Method for Machine LearningJan 27, 2025FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual RegistersOct 30, 2025SIRAJ: Diverse and Efficient Red-Teaming for LLM Agents via Distilled Structured ReasoningSep 10, 2025MESH -- Understanding Videos Like Human: Measuring Hallucinations in Large Video ModelsNov 23, 2025$A^2Flow:$ Automating Agentic Workflow Generation via Self-Adaptive Abstraction OperatorsOct 22, 2018Norm-Range Partition: A Universal Catalyst for LSH based Maximum Inner Product Search (MIPS)Dec 16, 2021On the Finite-Time Complexity and Practical Computation of Approximate Stationarity Concepts of Lipschitz FunctionsJan 29, 2024Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQAFeb 5, 2024Enhancing Neural Subset Selection: Integrating Background Information into Set RepresentationsJul 4, 2025Less is More: Empowering GUI Agent with Context-Aware SimplificationOct 19, 2024SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent EvaluationMay 22, 2025GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI AgentMay 21, 2025GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI AgentsOct 7, 2025Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic PresentationsOct 13, 2025More than A Point: Capturing Uncertainty with Adaptive Affordance Heatmaps for Spatial Grounding in Robotic TasksApr 3, 2026A Unified Perspective on Adversarial Membership Manipulation in Vision ModelsDec 1, 2025Syndrome-Flow Consistency Model Achieves One-step Denoising Error Correction CodesFeb 24, 2025Generative Models in Decision Making: A SurveyOct 29, 2023Does Invariant Graph Learning via Environment Augmentation Learn Invariance?Nov 27, 2022Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning