An open API service providing repository metadata for many open source software ecosystems.

Topic: "ocr"

tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

Language: C++ - Size: 51.2 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 71,586 - Forks: 10,442

PaddlePaddle/PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Language: Python - Size: 1.59 GB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 66,775 - Forks: 9,539

opendatalab/MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Language: Python - Size: 139 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 51,415 - Forks: 4,271

siyuan-note/siyuan

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

Language: TypeScript - Size: 455 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 40,166 - Forks: 2,491

hiroi-sora/Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Language: Python - Size: 268 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 39,748 - Forks: 3,936

naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

Language: JavaScript - Size: 104 MB - Last synced at: 18 days ago - Pushed at: 21 days ago - Stars: 37,600 - Forks: 2,351

paperless-ngx/paperless-ngx

A community-supported supercharged document management system: scan, index and archive all your documents

Language: Python - Size: 169 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 35,254 - Forks: 2,221

ShareX/ShareX

ShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.

Language: C# - Size: 60.7 MB - Last synced at: 15 days ago - Pushed at: 18 days ago - Stars: 34,884 - Forks: 3,548

ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Language: Python - Size: 64.3 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 32,099 - Forks: 2,249

JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language: Python - Size: 154 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 28,632 - Forks: 3,518

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language: Python - Size: 9.06 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 15,959 - Forks: 1,270

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

Language: JavaScript - Size: 77.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 15,818 - Forks: 746

Unstructured-IO/unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

Language: HTML - Size: 194 MB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 13,532 - Forks: 1,117

sml2h3/ddddocr

带带弟弟 通用验证码识别OCR pypi版

Language: Python - Size: 71.4 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 12,937 - Forks: 2,129

DayBreak-u/chineseocr_lite

超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

Language: C++ - Size: 457 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 12,209 - Forks: 2,295

getomni-ai/zerox

OCR & Document Extraction using vision models

Language: TypeScript - Size: 164 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 11,855 - Forks: 808

tisfeng/Easydict

一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.

Language: Swift - Size: 141 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 11,305 - Forks: 561

dataelement/bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

Language: TypeScript - Size: 85.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 10,834 - Forks: 1,767

HIllya51/LunaTranslator

视觉小说翻译器 / Visual Novel Translator

Language: C++ - Size: 18.5 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 10,056 - Forks: 998

ripperhe/Bob

Bob 是一款 macOS 平台的翻译和 OCR 软件。

Size: 96.7 KB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 9,502 - Forks: 525

zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

Language: Python - Size: 85.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8,969 - Forks: 877

pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Language: Python - Size: 342 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 8,759 - Forks: 675

bytedance/Dolphin

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Language: Python - Size: 22 MB - Last synced at: 15 days ago - Pushed at: 18 days ago - Stars: 8,054 - Forks: 675

the-paperless-project/paperless 📦

Scan, index, and archive all of your paper documents

Language: Python - Size: 6.94 MB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 7,921 - Forks: 501

microsoft/ailab

Experience, Learn and Code the latest breakthrough innovations with Microsoft AI

Language: C# - Size: 98.8 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 7,837 - Forks: 1,397

YaoFANGUK/video-subtitle-extractor

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Language: Python - Size: 1.19 GB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 7,748 - Forks: 810

CVHub520/X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Language: Python - Size: 143 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 7,616 - Forks: 842

tesseract-ocr/tessdata

Trained models with fast variant of the "best" LSTM models + legacy models

Size: 3.1 GB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 7,066 - Forks: 2,368

adithya-s-k/omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Language: Python - Size: 782 KB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 6,685 - Forks: 531

clovaai/donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language: Python - Size: 61.3 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 6,565 - Forks: 539

xushengfeng/eSearch

截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator 支持Windows Linux macOS

Language: TypeScript - Size: 73.4 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 6,175 - Forks: 458

chineseocr/chineseocr

yolo3+ocr

Language: Python - Size: 34.9 MB - Last synced at: 7 months ago - Pushed at: over 3 years ago - Stars: 6,060 - Forks: 1,730

Swift-AI/Swift-AI

The Swift machine learning library.

Language: Swift - Size: 28.7 MB - Last synced at: 4 days ago - Pushed at: over 8 years ago - Stars: 6,049 - Forks: 552

axa-group/Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

Language: JavaScript - Size: 52.6 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 6,001 - Forks: 318

PaddlePaddle/PaddleX

All-in-One Development Tool based on PaddlePaddle

Language: Python - Size: 854 MB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 5,964 - Forks: 1,131

yusufkaraaslan/Skill_Seekers

Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection

Language: Python - Size: 1.07 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 5,839 - Forks: 606

mindee/doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Language: Python - Size: 98.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5,646 - Forks: 600

RapidAI/RapidOCR

📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.

Language: Python - Size: 20.2 MB - Last synced at: 6 days ago - Pushed at: 9 days ago - Stars: 5,536 - Forks: 539

Layout-Parser/layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

Language: Python - Size: 58.3 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 5,511 - Forks: 512

jonaswinkler/paperless-ng 📦

A supercharged version of paperless: scan, index and archive all your physical documents

Language: Python - Size: 18.3 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 5,406 - Forks: 353

STranslate/STranslate

A ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具

Language: C# - Size: 146 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 4,933 - Forks: 277

NMAC427/SwiftOCR

Fast and simple OCR library written in Swift

Language: Swift - Size: 11.1 MB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 4,638 - Forks: 479

open-mmlab/mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Language: Python - Size: 15.8 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 4,628 - Forks: 772

Tencent/TNN

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.

Language: C++ - Size: 56 MB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 4,599 - Forks: 771

dmMaze/BallonsTranslator

深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning

Language: Python - Size: 57.2 MB - Last synced at: 15 days ago - Pushed at: 19 days ago - Stars: 4,376 - Forks: 285

TheJoeFin/Text-Grab

Use OCR in Windows quickly and easily with Text Grab. With optional background process and notifications.

Language: C# - Size: 46.4 MB - Last synced at: 14 days ago - Pushed at: 16 days ago - Stars: 4,341 - Forks: 279

oomol-lab/pdf-craft

PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.

Language: Python - Size: 22 MB - Last synced at: 1 day ago - Pushed at: 4 days ago - Stars: 4,319 - Forks: 274

ramjke/Translumo

Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.

Language: C# - Size: 100 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4,284 - Forks: 235

clovaai/deep-text-recognition-benchmark

Text recognition (optical character recognition) with deep learning methods, ICCV 2019

Language: Jupyter Notebook - Size: 3 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 3,844 - Forks: 1,123

UB-Mannheim/tesseract Fork of tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

Language: C++ - Size: 118 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 3,803 - Forks: 493

cyanfish/naps2

Scan documents to PDF and more, as simply as possible.

Language: C# - Size: 109 MB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 3,748 - Forks: 378

breezedeus/CnOCR Fork of diaomin/crnn-mxnet-chinese-text-recognition

CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】

Language: Python - Size: 17.5 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 3,704 - Forks: 534

Belval/TextRecognitionDataGenerator

A synthetic data generator for text recognition

Language: Python - Size: 149 MB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 3,612 - Forks: 1,017

aim-uofa/AdelaiDet

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Language: Python - Size: 663 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 3,451 - Forks: 653

eragonruan/text-detection-ctpn

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

Language: Python - Size: 330 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 3,443 - Forks: 1,332

clovaai/CRAFT-pytorch

Official implementation of Character Region Awareness for Text Detection (CRAFT)

Language: Python - Size: 1.65 MB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 3,247 - Forks: 924

kerlomz/captcha_trainer

[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.

Language: Python - Size: 16.5 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 3,165 - Forks: 831

deepdoctection/deepdoctection

A Repo For Document AI

Language: Python - Size: 30.5 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 3,111 - Forks: 185

argman/EAST

A tensorflow implementation of EAST text detector

Language: C++ - Size: 1.95 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 3,043 - Forks: 1,047

AnyListen/tools-ocr

树洞 OCR 文字识别(一款跨平台的 OCR 小工具)

Language: Java - Size: 21.1 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 3,020 - Forks: 482

thiagoalessio/tesseract-ocr-for-php

A wrapper to work with Tesseract OCR inside PHP.

Language: PHP - Size: 1.09 MB - Last synced at: 7 months ago - Pushed at: 10 months ago - Stars: 2,970 - Forks: 552

xiaofengShi/CHINESE-OCR

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Language: Python - Size: 121 MB - Last synced at: 9 months ago - Pushed at: over 6 years ago - Stars: 2,937 - Forks: 968

CatchTheTornado/text-extract-api

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

Language: Python - Size: 5.06 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 2,916 - Forks: 247

otiai10/gosseract

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

Language: Go - Size: 1.08 MB - Last synced at: 8 months ago - Pushed at: 10 months ago - Stars: 2,869 - Forks: 295

InkTimeRecord/TTime

🚀 Screenshots, word marking, OCR, AI, translation software || 截图、划词、文字识别、AI、翻译软件

Language: TypeScript - Size: 42.7 MB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 2,832 - Forks: 159

datalab-to/chandra

OCR model that handles complex tables, forms, handwriting with full layout.

Language: Python - Size: 13.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2,828 - Forks: 316

ciur/papermerge

Open Source Document Management System for Digital Archives (Scanned Documents)

Language: Python - Size: 25.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2,824 - Forks: 298

Dicklesworthstone/llm_aided_ocr

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

Language: Python - Size: 1.41 MB - Last synced at: 20 days ago - Pushed at: 10 months ago - Stars: 2,790 - Forks: 197

alisen39/TrWebOCR

开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~

Language: Python - Size: 314 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 2,788 - Forks: 623

ypwhs/captcha_break

验证码识别

Language: Jupyter Notebook - Size: 6.6 MB - Last synced at: 7 months ago - Pushed at: almost 4 years ago - Stars: 2,767 - Forks: 679

zhoubear/open-paperless

Scan, index, and archive all of your paper documents (acquired by Mayan EDMS)

Language: Python - Size: 23.6 MB - Last synced at: 8 months ago - Pushed at: about 7 years ago - Stars: 2,557 - Forks: 222

breezedeus/Pix2Text

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.

Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 2,552 - Forks: 234

hwalsuklee/awesome-deep-text-detection-recognition

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

Size: 1.03 MB - Last synced at: 11 days ago - Pushed at: over 4 years ago - Stars: 2,534 - Forks: 507

openrecall/openrecall

OpenRecall is a fully open-source, privacy-first alternative to proprietary solutions like Microsoft's Windows Recall. With OpenRecall, you can easily access your digital history, enhancing your memory and productivity without compromising your privacy.

Language: Python - Size: 4.05 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 2,482 - Forks: 151

openpaperwork/paperwork 📦

Personal document manager (Linux/Windows) -- Moved to Gnome's Gitlab

Language: Python - Size: 18.2 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 2,433 - Forks: 147

dynobo/normcap

OCR powered screen-capture tool to capture information instead of images

Language: Python - Size: 146 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,405 - Forks: 106

kha-white/manga-ocr

Optical character recognition for Japanese text, with the main focus being Japanese manga

Language: Python - Size: 1.07 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 2,352 - Forks: 119

ballerine-io/ballerine

Open-source infrastructure and data orchestration platform for risk decisioning

Language: TypeScript - Size: 132 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 2,317 - Forks: 264

ogkalu2/comic-translate

Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.

Language: Python - Size: 17.5 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 2,304 - Forks: 246

zcaceres/markdownify-mcp

A Model Context Protocol server for converting almost anything to Markdown

Language: TypeScript - Size: 1.51 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 2,282 - Forks: 193

WZBSocialScienceCenter/pdftabextract

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Language: Python - Size: 138 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 2,248 - Forks: 372

rmtheis/android-ocr 📦

Experimental optical character recognition app

Language: Java - Size: 16.3 MB - Last synced at: 15 days ago - Pushed at: over 7 years ago - Stars: 2,239 - Forks: 887

umlx5h/LLPlayer

The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!

Language: C# - Size: 116 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,149 - Forks: 111

sismics/docs

Lightweight document management system packed with all the features you can expect from big expensive solutions

Language: JavaScript - Size: 15.3 MB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 2,148 - Forks: 526

sirfz/tesserocr

A Python wrapper for the tesseract-ocr API

Language: Python - Size: 615 KB - Last synced at: 15 days ago - Pushed at: 17 days ago - Stars: 2,139 - Forks: 261

bgshih/crnn

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

Language: Lua - Size: 73.2 KB - Last synced at: 8 months ago - Pushed at: almost 7 years ago - Stars: 2,081 - Forks: 550

githubharald/SimpleHTR

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Language: Python - Size: 77.1 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 2,076 - Forks: 908

mg-chao/snow-shot

超好用的截图工具

Language: TypeScript - Size: 57 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,038 - Forks: 104

card-io/card.io-Android-SDK 📦

card.io provides fast, easy credit card scanning in mobile apps

Language: Java - Size: 125 MB - Last synced at: 3 months ago - Pushed at: almost 9 years ago - Stars: 1,996 - Forks: 532

TimmyOVO/deepseek-ocr.rs

Rust multi‑backend OCR/VLM engine (DeepSeek‑OCR, PaddleOCR‑VL, DotsOCR) with DSQ quantization and an OpenAI‑compatible server & CLI – run locally without Python.

Language: Rust - Size: 1.24 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,976 - Forks: 150

eikek/docspell

Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.

Language: Elm - Size: 136 MB - Last synced at: 10 days ago - Pushed at: 12 days ago - Stars: 1,968 - Forks: 154

Achno/gowall

A tool to convert a Wallpaper's color scheme / palette, OCR with VLM's Traditional & Hybrid, Image Compression ,color palette extraction, image upsacling with Adversarial Networks and more image processing features.

Language: Go - Size: 8.13 MB - Last synced at: 14 days ago - Pushed at: 18 days ago - Stars: 1,959 - Forks: 31

RD17/ambar 📦

:mag: Ambar: Document Search Engine

Language: JavaScript - Size: 49.8 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1,946 - Forks: 376

crow-translate/crow-translate 📦

A simple and lightweight translator that allows you to translate and speak text using Google, Yandex Bing, LibreTranslate and Lingva.

Language: C++ - Size: 27.1 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1,895 - Forks: 172

manisandro/gImageReader

A Gtk/Qt front-end to tesseract-ocr.

Language: C++ - Size: 11.7 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 1,892 - Forks: 212

heshengtao/comfyui_LLM_party

LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, kimi,doubao. Adapted to local llms, vlm, gguf such as llama-3.3 Janus-Pro, Linkage graphRAG

Language: Python - Size: 136 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1,892 - Forks: 158

Sierkinhane/CRNN_Chinese_Characters_Rec

(CRNN) Chinese Characters Recognition.

Language: Python - Size: 181 MB - Last synced at: 8 months ago - Pushed at: about 3 years ago - Stars: 1,857 - Forks: 536

ianzhao/textshot

Python tool for grabbing text via screenshot

Language: Python - Size: 64.5 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 1,774 - Forks: 257

NanoNets/docext

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

Language: Python - Size: 6.77 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1,755 - Forks: 132

scambier/obsidian-omnisearch

A search engine that "just works" for Obsidian. Supports OCR and PDF indexing.

Language: TypeScript - Size: 6.75 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 1,743 - Forks: 81