Showing 1–13 of 13 results
/ Date/ Name
Aug 19, 2024RealCustom++: Representing Images as Real Textual Word for Real-Time CustomizationMar 1, 2024RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image CustomizationOct 12, 2023Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video RetrievalOct 8, 2023Symmetrical Linguistic Feature Distillation with CLIP for Scene Text RecognitionJul 6, 2023MomentDiff: Generative Video Moment Retrieval from Random to RealMay 9, 2023Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text RecognitionOct 12, 2022Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small DatasetsSep 1, 2022REMOT: A Region-to-Whole Framework for Realistic Human Motion TransferAug 22, 2021From Two to One: A New Scene Text Recognizer with Visual Language Modeling NetworkJun 13, 2021Cross-Modal Attention Consistency for Video-Audio Unsupervised LearningApr 1, 2020Graph Structured Network for Image-Text MatchingMar 30, 2020Multi-Objective Matrix Normalization for Fine-grained Visual RecognitionAug 23, 2019ACE-Net: Biomedical Image Segmentation with Augmented Contracting and Expansive Paths