Showing 1–20 of 75 results
/ Date/ Name
Jan 13, 2022CLIP-Event: Connecting Text and Images with Event StructuresApr 13, 2021The Future is not One-dimensional: Complex Event Schema Induction by Graph Modeling for Event PredictionMay 8, 2025Bring Reason to Vision: Understanding Perception and Reasoning through Model MergingMay 22, 2022Language Models with Image Descriptors are Strong Few-Shot Video-Language LearnersOct 19, 2025VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM AgentsNov 27, 2023InfoPattern: Unveiling Information Propagation Patterns in Social MediaOct 9, 2024MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health DisordersAug 25, 2022Multimedia Generative Script Learning for Task PlanningOct 2, 2025AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement LearningJan 11, 2026Artificial Entanglement in the Fine-Tuning of Large Language ModelsJul 30, 2025FairReason: Balancing Reasoning and Social Bias in MLLMsMar 3, 2025Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus AreasFeb 24, 2026Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMsDec 18, 2025Adaptation of Agentic AI: A Survey of Post-Training, Memory, and SkillsNov 26, 2025ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric InteractionJul 1, 2020COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report GenerationJun 1, 2025Position: Agent Should Invoke External Tools ONLY When Epistemically NecessaryJan 28, 2026Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User IntentsMay 27, 2023Non-Sequential Graph Script Induction via Multimedia GroundingJun 5, 2022Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval