Showing 1–20 of 21 results
/ Date/ Name
Oct 21, 2019Batch Face Alignment using a Low-rank GANJul 23, 2021Cross-Sentence Temporal and Semantic Relations in Video Activity LocalisationAug 20, 2025Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation ModelsApr 25, 2019Unsupervised Deep Learning by Neighbourhood DiscoveryJun 8, 2020Unsupervised Transfer Learning with Self-Supervised RemedyJun 26, 2022Video Activity Localisation with Uncertainties in Temporal BoundarySep 4, 2023Code Representation Pre-training with Complements from Program ExecutionsMar 17, 2026Empirical Recipes for Efficient and Compact Vision-Language ModelsMar 3, 2021Deep Clustering by Semantic Contrastive LearningFeb 4, 2024UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program TestingJun 25, 2024MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment RetrievalOct 15, 2024InvSeg: Test-Time Prompt Inversion for Semantic SegmentationAug 27, 2025UNIFORM: Unifying Knowledge from Large-scale and Diverse Pre-trained ModelsOct 17, 2025Neuro-Symbolic Spatial Reasoning in SegmentationSep 1, 2023Zero-Shot Video Moment Retrieval from Frozen Vision-Language ModelsMar 11, 2026UniCompress: Token Compression for Unified Vision-Language Understanding and GenerationMay 23, 2022Feature-Distribution Perturbation and Calibration for Generalized Person ReIDJun 3, 2024Hybrid-Learning Video Moment Retrieval across Multi-Domain LabelsJan 24, 2024Generative Video Diffusion for Unseen Novel Semantic Video Moment RetrievalFeb 28, 2023Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training