"au:"Rongrong Ji"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Rongrong Ji"" — arXiv2 Search

Showing 1–9 of 9 results

/ Date/ Name

Apr 23, 2026Prototype-Based Test-Time Adaptation of Vision-Language Models Feb 11, 2026Flow caching for autoregressive video generation Nov 4, 2025LTD-Bench: Evaluating Large Language Models by Letting Them Draw Oct 17, 2025FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification May 30, 2025Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces Feb 7, 2025Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy Sep 12, 2020Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning Jan 15, 2020Filter Grafting for Deep Neural Networks Nov 17, 2017Action-Attending Graphic Neural Network