Categories
- All Posts 549
- Practical Open Source Projects 478
- Tutorial Articles 22
- Online Utilities 13
- AI news 7
- Tiny Startups Showcase 7
- Claude Code Skills 6
- Prompt Templates 5
- Hugging Face Spaces 3
- OpenClaw Use Cases 3
- LLM Learning Resources 1
- Online AI Image Tools 1
- OpenClaw Master Skills Collection 1
- Rust Training Resources 1
- AI Short Drama Tools 1
- My Favorites 0
Posts tagged with: OCR
Content related to OCR
EasyOCR: A Fast, Multilingual OCR Library for Python
EasyOCR brings 80+ language support right into your Python projects. With a quick pip install, lightweight model downloads, and an intuitive API, you can extract text from images in seconds. This guide covers everything from basic usage and custom language sets to Docker deployment and Hugging Face Space integration. Whether you’re building a photo‑management tool or a data‑entry pipeline, EasyOCR gives you the speed and accuracy you need.
DeepSeek-OCR: Advanced Vision-Language Model for OCR
Discover DeepSeek-OCR, a cutting-edge open-source project by DeepSeek AI designed for robust Optical Character Recognition and visual-text compression. This project provides a powerful AI model that investigates the role of vision encoders from an LLM-centric viewpoint, offering impressive capabilities for converting documents to markdown, parsing figures, and general image description. Explore its various resolution modes, from Tiny to Gundam, and learn how to implement it using vLLM or Transformers for high-performance inference. DeepSeek-OCR aims to push the boundaries of visual-text understanding, making advanced OCR accessible for developers and researchers.
Dango-Translator: Real-Time OCR & Comic Translation Software
Dive into Dango-Translator, an open-source OCR-based tool designed to break language barriers in real-time. Whether you're playing foreign games, browsing untranslated websites, or reading raw comics, this powerful Windows software instantly captures and translates text from your screen. Featuring advanced image processing for comics (including text recognition, erasure, and re-embedding), support for 15 diverse translation sources, and cloud-saved settings, Dango-Translator offers a seamless and efficient solution for handling 'raw' content. Discover how this practical project can transform your digital experience, making inaccessible content instantly understandable and enhancing your engagement with multilingual media.