Topic: "ocr"
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language: C++ - Size: 51.2 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 71,586 - Forks: 10,442
PaddlePaddle/PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Language: Python - Size: 1.59 GB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 66,775 - Forks: 9,539
opendatalab/MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Language: Python - Size: 139 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 51,415 - Forks: 4,271
siyuan-note/siyuan
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
Language: TypeScript - Size: 455 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 40,166 - Forks: 2,491
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Language: Python - Size: 268 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 39,748 - Forks: 3,936
naptha/tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
Language: JavaScript - Size: 104 MB - Last synced at: 18 days ago - Pushed at: 21 days ago - Stars: 37,600 - Forks: 2,351
paperless-ngx/paperless-ngx
A community-supported supercharged document management system: scan, index and archive all your documents
Language: Python - Size: 169 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 35,254 - Forks: 2,221
ShareX/ShareX
ShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.
Language: C# - Size: 60.7 MB - Last synced at: 15 days ago - Pushed at: 18 days ago - Stars: 34,884 - Forks: 3,548
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Language: Python - Size: 64.3 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 32,099 - Forks: 2,249
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language: Python - Size: 154 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 28,632 - Forks: 3,518
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Language: Python - Size: 9.06 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 15,959 - Forks: 1,270
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
Language: JavaScript - Size: 77.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 15,818 - Forks: 746
Unstructured-IO/unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
Language: HTML - Size: 194 MB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 13,532 - Forks: 1,117
sml2h3/ddddocr
带带弟弟 通用验证码识别OCR pypi版
Language: Python - Size: 71.4 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 12,937 - Forks: 2,129
DayBreak-u/chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
Language: C++ - Size: 457 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 12,209 - Forks: 2,295
getomni-ai/zerox
OCR & Document Extraction using vision models
Language: TypeScript - Size: 164 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 11,855 - Forks: 808
tisfeng/Easydict
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.
Language: Swift - Size: 141 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 11,305 - Forks: 561
dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
Language: TypeScript - Size: 85.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 10,834 - Forks: 1,767
HIllya51/LunaTranslator
视觉小说翻译器 / Visual Novel Translator
Language: C++ - Size: 18.5 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 10,056 - Forks: 998
ripperhe/Bob
Bob 是一款 macOS 平台的翻译和 OCR 软件。
Size: 96.7 KB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 9,502 - Forks: 525
zyddnys/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
Language: Python - Size: 85.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8,969 - Forks: 877
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Language: Python - Size: 342 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 8,759 - Forks: 675
bytedance/Dolphin
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Language: Python - Size: 22 MB - Last synced at: 15 days ago - Pushed at: 18 days ago - Stars: 8,054 - Forks: 675
the-paperless-project/paperless 📦
Scan, index, and archive all of your paper documents
Language: Python - Size: 6.94 MB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 7,921 - Forks: 501
microsoft/ailab
Experience, Learn and Code the latest breakthrough innovations with Microsoft AI
Language: C# - Size: 98.8 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 7,837 - Forks: 1,397
YaoFANGUK/video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Language: Python - Size: 1.19 GB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 7,748 - Forks: 810
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
Language: Python - Size: 143 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 7,616 - Forks: 842
tesseract-ocr/tessdata
Trained models with fast variant of the "best" LSTM models + legacy models
Size: 3.1 GB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 7,066 - Forks: 2,368
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Language: Python - Size: 782 KB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 6,685 - Forks: 531
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language: Python - Size: 61.3 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 6,565 - Forks: 539
xushengfeng/eSearch
截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator 支持Windows Linux macOS
Language: TypeScript - Size: 73.4 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 6,175 - Forks: 458
chineseocr/chineseocr
yolo3+ocr
Language: Python - Size: 34.9 MB - Last synced at: 7 months ago - Pushed at: over 3 years ago - Stars: 6,060 - Forks: 1,730
Swift-AI/Swift-AI
The Swift machine learning library.
Language: Swift - Size: 28.7 MB - Last synced at: 4 days ago - Pushed at: over 8 years ago - Stars: 6,049 - Forks: 552
axa-group/Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
Language: JavaScript - Size: 52.6 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 6,001 - Forks: 318
PaddlePaddle/PaddleX
All-in-One Development Tool based on PaddlePaddle
Language: Python - Size: 854 MB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 5,964 - Forks: 1,131
yusufkaraaslan/Skill_Seekers
Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection
Language: Python - Size: 1.07 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 5,839 - Forks: 606
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Language: Python - Size: 98.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5,646 - Forks: 600
RapidAI/RapidOCR
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
Language: Python - Size: 20.2 MB - Last synced at: 6 days ago - Pushed at: 9 days ago - Stars: 5,536 - Forks: 539
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
Language: Python - Size: 58.3 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 5,511 - Forks: 512
jonaswinkler/paperless-ng 📦
A supercharged version of paperless: scan, index and archive all your physical documents
Language: Python - Size: 18.3 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 5,406 - Forks: 353
STranslate/STranslate
A ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具
Language: C# - Size: 146 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 4,933 - Forks: 277
NMAC427/SwiftOCR
Fast and simple OCR library written in Swift
Language: Swift - Size: 11.1 MB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 4,638 - Forks: 479
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Language: Python - Size: 15.8 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 4,628 - Forks: 772
Tencent/TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
Language: C++ - Size: 56 MB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 4,599 - Forks: 771
dmMaze/BallonsTranslator
深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning
Language: Python - Size: 57.2 MB - Last synced at: 15 days ago - Pushed at: 19 days ago - Stars: 4,376 - Forks: 285
TheJoeFin/Text-Grab
Use OCR in Windows quickly and easily with Text Grab. With optional background process and notifications.
Language: C# - Size: 46.4 MB - Last synced at: 14 days ago - Pushed at: 16 days ago - Stars: 4,341 - Forks: 279
oomol-lab/pdf-craft
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
Language: Python - Size: 22 MB - Last synced at: 1 day ago - Pushed at: 4 days ago - Stars: 4,319 - Forks: 274
ramjke/Translumo
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
Language: C# - Size: 100 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4,284 - Forks: 235
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Language: Jupyter Notebook - Size: 3 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 3,844 - Forks: 1,123
UB-Mannheim/tesseract Fork of tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language: C++ - Size: 118 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 3,803 - Forks: 493
cyanfish/naps2
Scan documents to PDF and more, as simply as possible.
Language: C# - Size: 109 MB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 3,748 - Forks: 378
breezedeus/CnOCR Fork of diaomin/crnn-mxnet-chinese-text-recognition
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
Language: Python - Size: 17.5 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 3,704 - Forks: 534
Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
Language: Python - Size: 149 MB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 3,612 - Forks: 1,017
aim-uofa/AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Language: Python - Size: 663 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 3,451 - Forks: 653
eragonruan/text-detection-ctpn
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
Language: Python - Size: 330 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 3,443 - Forks: 1,332
clovaai/CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
Language: Python - Size: 1.65 MB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 3,247 - Forks: 924
kerlomz/captcha_trainer
[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.
Language: Python - Size: 16.5 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 3,165 - Forks: 831
deepdoctection/deepdoctection
A Repo For Document AI
Language: Python - Size: 30.5 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 3,111 - Forks: 185
argman/EAST
A tensorflow implementation of EAST text detector
Language: C++ - Size: 1.95 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 3,043 - Forks: 1,047
AnyListen/tools-ocr
树洞 OCR 文字识别(一款跨平台的 OCR 小工具)
Language: Java - Size: 21.1 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 3,020 - Forks: 482
thiagoalessio/tesseract-ocr-for-php
A wrapper to work with Tesseract OCR inside PHP.
Language: PHP - Size: 1.09 MB - Last synced at: 7 months ago - Pushed at: 10 months ago - Stars: 2,970 - Forks: 552
xiaofengShi/CHINESE-OCR
[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别
Language: Python - Size: 121 MB - Last synced at: 9 months ago - Pushed at: over 6 years ago - Stars: 2,937 - Forks: 968
CatchTheTornado/text-extract-api
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
Language: Python - Size: 5.06 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 2,916 - Forks: 247
otiai10/gosseract
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library
Language: Go - Size: 1.08 MB - Last synced at: 8 months ago - Pushed at: 10 months ago - Stars: 2,869 - Forks: 295
InkTimeRecord/TTime
🚀 Screenshots, word marking, OCR, AI, translation software || 截图、划词、文字识别、AI、翻译软件
Language: TypeScript - Size: 42.7 MB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 2,832 - Forks: 159
datalab-to/chandra
OCR model that handles complex tables, forms, handwriting with full layout.
Language: Python - Size: 13.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2,828 - Forks: 316
ciur/papermerge
Open Source Document Management System for Digital Archives (Scanned Documents)
Language: Python - Size: 25.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2,824 - Forks: 298
Dicklesworthstone/llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
Language: Python - Size: 1.41 MB - Last synced at: 20 days ago - Pushed at: 10 months ago - Stars: 2,790 - Forks: 197
alisen39/TrWebOCR
开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~
Language: Python - Size: 314 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 2,788 - Forks: 623
ypwhs/captcha_break
验证码识别
Language: Jupyter Notebook - Size: 6.6 MB - Last synced at: 7 months ago - Pushed at: almost 4 years ago - Stars: 2,767 - Forks: 679
zhoubear/open-paperless
Scan, index, and archive all of your paper documents (acquired by Mayan EDMS)
Language: Python - Size: 23.6 MB - Last synced at: 8 months ago - Pushed at: about 7 years ago - Stars: 2,557 - Forks: 222
breezedeus/Pix2Text
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 2,552 - Forks: 234
hwalsuklee/awesome-deep-text-detection-recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
Size: 1.03 MB - Last synced at: 11 days ago - Pushed at: over 4 years ago - Stars: 2,534 - Forks: 507
openrecall/openrecall
OpenRecall is a fully open-source, privacy-first alternative to proprietary solutions like Microsoft's Windows Recall. With OpenRecall, you can easily access your digital history, enhancing your memory and productivity without compromising your privacy.
Language: Python - Size: 4.05 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 2,482 - Forks: 151
openpaperwork/paperwork 📦
Personal document manager (Linux/Windows) -- Moved to Gnome's Gitlab
Language: Python - Size: 18.2 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 2,433 - Forks: 147
dynobo/normcap
OCR powered screen-capture tool to capture information instead of images
Language: Python - Size: 146 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,405 - Forks: 106
kha-white/manga-ocr
Optical character recognition for Japanese text, with the main focus being Japanese manga
Language: Python - Size: 1.07 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 2,352 - Forks: 119
ballerine-io/ballerine
Open-source infrastructure and data orchestration platform for risk decisioning
Language: TypeScript - Size: 132 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 2,317 - Forks: 264
ogkalu2/comic-translate
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
Language: Python - Size: 17.5 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 2,304 - Forks: 246
zcaceres/markdownify-mcp
A Model Context Protocol server for converting almost anything to Markdown
Language: TypeScript - Size: 1.51 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 2,282 - Forks: 193
WZBSocialScienceCenter/pdftabextract
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Language: Python - Size: 138 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 2,248 - Forks: 372
rmtheis/android-ocr 📦
Experimental optical character recognition app
Language: Java - Size: 16.3 MB - Last synced at: 15 days ago - Pushed at: over 7 years ago - Stars: 2,239 - Forks: 887
umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!
Language: C# - Size: 116 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,149 - Forks: 111
sismics/docs
Lightweight document management system packed with all the features you can expect from big expensive solutions
Language: JavaScript - Size: 15.3 MB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 2,148 - Forks: 526
sirfz/tesserocr
A Python wrapper for the tesseract-ocr API
Language: Python - Size: 615 KB - Last synced at: 15 days ago - Pushed at: 17 days ago - Stars: 2,139 - Forks: 261
bgshih/crnn
Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.
Language: Lua - Size: 73.2 KB - Last synced at: 8 months ago - Pushed at: almost 7 years ago - Stars: 2,081 - Forks: 550
githubharald/SimpleHTR
Handwritten Text Recognition (HTR) system implemented with TensorFlow.
Language: Python - Size: 77.1 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 2,076 - Forks: 908
mg-chao/snow-shot
超好用的截图工具
Language: TypeScript - Size: 57 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,038 - Forks: 104
card-io/card.io-Android-SDK 📦
card.io provides fast, easy credit card scanning in mobile apps
Language: Java - Size: 125 MB - Last synced at: 3 months ago - Pushed at: almost 9 years ago - Stars: 1,996 - Forks: 532
TimmyOVO/deepseek-ocr.rs
Rust multi‑backend OCR/VLM engine (DeepSeek‑OCR, PaddleOCR‑VL, DotsOCR) with DSQ quantization and an OpenAI‑compatible server & CLI – run locally without Python.
Language: Rust - Size: 1.24 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,976 - Forks: 150
eikek/docspell
Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.
Language: Elm - Size: 136 MB - Last synced at: 10 days ago - Pushed at: 12 days ago - Stars: 1,968 - Forks: 154
Achno/gowall
A tool to convert a Wallpaper's color scheme / palette, OCR with VLM's Traditional & Hybrid, Image Compression ,color palette extraction, image upsacling with Adversarial Networks and more image processing features.
Language: Go - Size: 8.13 MB - Last synced at: 14 days ago - Pushed at: 18 days ago - Stars: 1,959 - Forks: 31
RD17/ambar 📦
:mag: Ambar: Document Search Engine
Language: JavaScript - Size: 49.8 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1,946 - Forks: 376
crow-translate/crow-translate 📦
A simple and lightweight translator that allows you to translate and speak text using Google, Yandex Bing, LibreTranslate and Lingva.
Language: C++ - Size: 27.1 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1,895 - Forks: 172
manisandro/gImageReader
A Gtk/Qt front-end to tesseract-ocr.
Language: C++ - Size: 11.7 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 1,892 - Forks: 212
heshengtao/comfyui_LLM_party
LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, kimi,doubao. Adapted to local llms, vlm, gguf such as llama-3.3 Janus-Pro, Linkage graphRAG
Language: Python - Size: 136 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1,892 - Forks: 158
Sierkinhane/CRNN_Chinese_Characters_Rec
(CRNN) Chinese Characters Recognition.
Language: Python - Size: 181 MB - Last synced at: 8 months ago - Pushed at: about 3 years ago - Stars: 1,857 - Forks: 536
ianzhao/textshot
Python tool for grabbing text via screenshot
Language: Python - Size: 64.5 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 1,774 - Forks: 257
NanoNets/docext
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
Language: Python - Size: 6.77 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1,755 - Forks: 132
scambier/obsidian-omnisearch
A search engine that "just works" for Obsidian. Supports OCR and PDF indexing.
Language: TypeScript - Size: 6.75 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 1,743 - Forks: 81