"au:"Yongdong Zhang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Yongdong Zhang"" — arXiv2 Search

Showing 1–13 of 13 results

/ Date/ Name

Aug 19, 2024RealCustom++: Representing Images as Real Textual Word for Real-Time Customization Mar 1, 2024RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization Oct 12, 2023Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval Oct 8, 2023Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition Jul 6, 2023MomentDiff: Generative Video Moment Retrieval from Random to Real May 9, 2023Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition Oct 12, 2022Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets Sep 1, 2022REMOT: A Region-to-Whole Framework for Realistic Human Motion Transfer Aug 22, 2021From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network Jun 13, 2021Cross-Modal Attention Consistency for Video-Audio Unsupervised Learning Apr 1, 2020Graph Structured Network for Image-Text Matching Mar 30, 2020Multi-Objective Matrix Normalization for Fine-grained Visual Recognition Aug 23, 2019ACE-Net: Biomedical Image Segmentation with Augmented Contracting and Expansive Paths