Showing 21–40 of 63 results
/ Date/ Name
Oct 7, 2024MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant GeneralizationApr 19, 2016Streaming Label Learning for Modeling Labels on the FlyFeb 27, 2025Learning Mask Invariant Mutual Information for Masked Image ModelingMay 15, 2025VQ-Logits: Compressing the Output Bottleneck of Large Language Models via Vector Quantized LogitsNov 17, 2025ActVAR: Activating Mixtures of Weights and Tokens for Efficient Visual Autoregressive GenerationJan 7, 2026LEGATO: Good Identity Unlearning Is ContinuousSep 18, 2025DeCoP: Enhancing Self-Supervised Time Series Representation with Dependency Controlled Pre-trainingMay 23, 2025Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical PerspectiveMay 15, 2025ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector AttentionJan 27, 2021Towards Improving the Consistency, Efficiency, and Flexibility of Differentiable Neural Architecture SearchJul 24, 2020On the learnability of quantum neural networksMay 21, 2022Knowledge Distillation from A Stronger TeacherMay 29, 2022Masked Distillation with Receptive TokensMay 25, 2023Knowledge Diffusion for DistillationNov 7, 2023Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation ModelsMar 11, 2024Active Generation for Image ClassificationNov 19, 2025Learning to Expand Images for Efficient Visual Autoregressive ModelingMay 4, 2026Seeing Realism from Simulation: Efficient Video Transfer for Vision-Language-Action Data AugmentationMar 14, 2024LocalMamba: Visual State Space Model with Windowed Selective ScanAug 21, 2023CoNe: Contrast Your Neighbours for Supervised Image Classification