Showing 1–12 of 12 results
/ Date/ Name
Apr 23, 2025Amplified Vulnerabilities: Structured Jailbreak Attacks on LLM-based Multi-Agent DebateMay 23, 2024TerDiT: Ternary Diffusion Models with TransformersFeb 8, 2024SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsNov 13, 2023SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language ModelsJun 15, 2023Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language ModelsApr 28, 2023LLaMA-Adapter V2: Parameter-Efficient Visual Instruction ModelMar 9, 2023Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature MimickingAug 6, 2022Frozen CLIP Models are Efficient Video LearnersJun 27, 2022ST-Adapter: Parameter-Efficient Image-to-Video Transfer LearningMay 8, 2022ConvMAE: Masked Convolution Meets Masked AutoencodersJun 16, 20201st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020May 10, 2018Avatar-Net: Multi-scale Zero-shot Style Transfer by Feature Decoration