Showing 1–14 of 14 results
/ Date/ Name
Oct 23, 2025Empower Words: DualGround for Structured Phrase and Sentence-Level Temporal GroundingMar 23, 2026Revisiting Weakly-Supervised Video Scene Graph Generation via Pair Affinity LearningSep 3, 2025Effect of Magnetic Anisotropy on Magnetoelastic Waves in Ni/LiNbO3 Hybrid DeviceNov 4, 2025LEGO-Eval: Towards Fine-Grained Evaluation on Synthesizing 3D Embodied Environments with Tool AugmentationNov 23, 2025SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale ScenesMar 6, 2020Diverse and Admissible Trajectory Forecasting through Multimodal Context UnderstandingJul 25, 2025Can You Share Your Story? Modeling Clients' Metacognition and Openness for LLM Therapist EvaluationMar 30, 2026LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA ModelsApr 16, 2026Seen-to-Scene: Keep the Seen, Generate the Unseen for Video OutpaintingJul 3, 2024Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral TheoryMay 21, 2025Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsSep 22, 2025PRINCIPLES: Synthetic Strategy Memory for Proactive Dialogue AgentsApr 16, 2026CMTM: Cross-Modal Token Modulation for Unsupervised Video Object SegmentationApr 21, 2025GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection