"au:"Ran Xu"" — arXiv2 SearchShowing 1–9 of 9 results
/ Date/ Name
Apr 23, 2026VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI AutomationAug 16, 2024xGen-MM (BLIP-3): A Family of Open Large Multimodal ModelsAug 11, 2023BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous AgentsAug 4, 2023Retroformer: Retrospective Large Language Agents with Policy Gradient OptimizationJul 18, 2023REX: Rapid Exploration and eXploitation for AI AgentsMar 16, 2023HIVE: Harnessing Human Feedback for Instructional Visual EditingDec 19, 2022LayoutDETR: Detection Transformer Is a Good Multimodal Layout DesignerSep 29, 2021MetaHistoSeg: A Python Framework for Meta Learning in Histopathology Image SegmentationAug 28, 2019ApproxNet: Content and Contention-Aware Video Analytics System for Embedded Clients