This repository contains a comprehensive collection of resources related to OCR (Optical Character Recognition) and Document AI, such as papers, datasets, and APIs.
2025.01.05Include papers that have been published in 2023 and 2024.
TODO
- HCIILAB Scene-Text-Detection. https://github.com/HCIILAB/Scene-Text-Detection
- HCIILAB Scene-Text-Recognition. https://github.com/HCIILAB/Scene-Text-Recognition
- HCIILAB Scene-Text-End2end. https://github.com/HCIILAB/Scene-Text-End2end
- A general list of resources to image text localization and recognition. https://github.com/whitelok/image-text-localization-recognition
- A curated list of resources dedicated to scene text localization and recognition. https://github.com/chongyangtao/Awesome-Scene-Text-Recognition
- A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods. https://github.com/hwalsuklee/awesome-deep-text-detection-recognition
- Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized. https://github.com/Jyouhou/SceneTextPapers
- Links to awesome OCR projects. https://github.com/kba/awesome-ocr
- A curated list of promising OCR resources. https://github.com/wanghaisheng/awesome-ocr
- Z. Chen, W. Wang, et al. Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling. In ArXiv, 2024.