"au:"Manling Li"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Manling Li"" — arXiv2 Search

Showing 1–20 of 75 results

/ Date/ Name

Jan 13, 2022CLIP-Event: Connecting Text and Images with Event Structures Apr 13, 2021The Future is not One-dimensional: Complex Event Schema Induction by Graph Modeling for Event Prediction May 8, 2025Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging May 22, 2022Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners Oct 19, 2025VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents Nov 27, 2023InfoPattern: Unveiling Information Propagation Patterns in Social Media Oct 9, 2024MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders Aug 25, 2022Multimedia Generative Script Learning for Task Planning Oct 2, 2025AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning Jan 11, 2026Artificial Entanglement in the Fine-Tuning of Large Language Models Jul 30, 2025FairReason: Balancing Reasoning and Social Bias in MLLMs Mar 3, 2025Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas Feb 24, 2026Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs Dec 18, 2025Adaptation of Agentic AI: A Survey of Post-Training, Memory, and Skills Nov 26, 2025ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction Jul 1, 2020COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation Jun 1, 2025Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary Jan 28, 2026Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents May 27, 2023Non-Sequential Graph Script Induction via Multimedia Grounding Jun 5, 2022Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval