Showing 1–15 of 15 results
/ Date/ Name
Jan 8, 2025H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous DrivingJul 3, 2025VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement LearningFeb 29, 2024Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video RecognitionDec 8, 2025M-STAR: Multi-Scale Spatiotemporal Autoregression for Human Mobility ModelingMar 13, 2025LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM AgentsAug 7, 2025G-UBS: Towards Robust Understanding of Implicit Feedback via Group-Aware User Behavior SimulationJan 9, 2025A study on the 1-$Γ$ inverse of tensors via the M-ProductDec 19, 2023M-BEV: Masked BEV Perception for Robust Autonomous DrivingSep 6, 2022Realizing convex codes with axis-parallel boxesNov 20, 2023Bounding Lifts of Markoff Triples mod $p$Dec 5, 2023MagicStick: Controllable Video Editing via Control Handle TransformationsJun 9, 2025Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video UnderstandingApr 3, 2023Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free VideosDec 8, 2024On the extensions of the GD inverse of tensors via the M-ProductNov 24, 2025When Top-ranked Recommendations Fail: Modeling Multi-Granular Negative Feedback for Explainable and Robust Video Recommendation