Showing 1–20 of 27 results
/ Date/ Name
Sep 21, 2022Contrastive Learning for Time Series on Dynamic GraphsMar 26, 2023Frame Flexible NetworkNov 18, 2022Look More but Care Less in Video RecognitionMar 11, 2025REGEN: Learning Compact Video Embedding with (Re-)Generative DecoderDec 6, 2024Slicing Vision Transformer for Flexible InferenceJul 15, 2024Accessing Vision Foundation Models via ImageNet-1KNov 7, 2023Multi-resolution Time-Series Transformer for Long-term ForecastingMar 14, 2024Don't Judge by the Look: Towards Motion Coherent Video RepresentationJun 17, 2025SKOLR: Structured Koopman Operator Linear RNN for Time-Series ForecastingOct 30, 2024SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image FusionApr 8, 2025Fusing Global and Local: Transformer-CNN Synergy for Next-Gen Current EstimationSep 19, 2025MTS-DMAE: Dual-Masked Autoencoder for Unsupervised Multivariate Time Series Representation LearningFeb 27, 2026Ref-Adv: Exploring MLLM Visual Reasoning in Referring Expression TasksMar 28, 2025GmNet: Revisiting Gating Mechanisms From A Frequency ViewApr 14, 2026Distorted or Fabricated? A Survey on Hallucination in Video LLMsOct 13, 2022Parameter-Efficient Masking NetworksApr 21, 2024CKGConv: General Graph Convolution with Continuous KernelsJan 9, 2025Progressive Growing of Video Tokenizers for Temporally Compact Latent SpacesFeb 10, 2026Fine-T2I: An Open, Large-Scale, and Diverse Dataset for High-Quality T2I Fine-TuningMar 27, 2025Boosting Large Language Models with Mask Fine-Tuning