Showing 1–15 of 15 results
/ Date/ Name
Oct 14, 2025Migration and spreading of a droplet driven by a chemical stepJun 16, 2025MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning AttentionFeb 13, 2025MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and EfficiencyJan 14, 2025Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion ModelsJul 10, 2024VEnhancer: Generative Space-Time Enhancement for Video GenerationMay 23, 2024TerDiT: Ternary Diffusion Models with TransformersFeb 8, 2024SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsNov 13, 2023SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language ModelsApr 28, 2023LLaMA-Adapter V2: Parameter-Efficient Visual Instruction ModelMar 9, 2023Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature MimickingAug 6, 2022Frozen CLIP Models are Efficient Video LearnersMay 8, 2022ConvMAE: Masked Convolution Meets Masked AutoencodersNov 18, 2020End-to-End Object Detection with Adaptive Clustering TransformerApr 26, 2020Transmission electron microscopy of organic-inorganic hybrid perovskites: myths and truthsNov 13, 2019Learning Where to Focus for Efficient Video Object Detection