Showing 1–20 of 33 results
/ Date/ Name
Aug 22, 2019Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesFeb 21, 2022Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale FusionJul 13, 2019SynthText3D: Synthesizing Scene Text Images from 3D Virtual WorldsNov 21, 2016TextBoxes: A Fast Text Detector with a Single Deep Neural NetworkApr 14, 2024TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language ModelsSep 18, 2018Scene Text Recognition from Two-Dimensional PerspectiveApr 2, 2021MOST: A Multi-Oriented Scene Text Detector with Localization RefinementJul 18, 2020Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text SpottingNov 20, 2019Real-time Scene Text Detection with Differentiable BinarizationJul 6, 2018Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesJan 9, 2018TextBoxes++: A Single-Shot Oriented Scene Text DetectorMar 14, 2018Rotation-Sensitive Regression for Oriented Scene Text DetectionAug 31, 2017ICDAR2017 Competition on Reading Chinese Text in the Wild (RCTW-17)Oct 8, 2024PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse SamplingMar 23, 2022Comprehensive Benchmark Datasets for Amharic Scene Text Detection and RecognitionMar 5, 2024Android in the Zoo: Chain-of-Action-Thought for GUI AgentsNov 15, 2024Partial Scene Text RetrievalSep 15, 2025MindVL: Towards Efficient and Effective Training of Multimodal Large Language Models on Ascend NPUsAug 17, 2023Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective ApproachFeb 21, 2024Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition