arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Yu Qiao"" — arXiv2 Search
Showing 21–29 of 29 results
/ Date
/ Name
Jun 15, 2023
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
May 18, 2023
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Apr 28, 2023
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model
Dec 6, 2022
InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Aug 6, 2022
Frozen CLIP Models are Efficient Video Learners
May 8, 2022
ConvMAE: Masked Convolution Meets Masked Autoencoders
Mar 15, 2022
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy
Nov 24, 2021
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning
Nov 16, 2021
INTERN: A New Learning Paradigm Towards General Vision
← Previous