GitHub topics: image2text
TAO71-AI/I4.0
TAO71 I4.0 is an AI created by TAO71 in Python.
Language: Python - Size: 2.98 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 0

lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Language: Python - Size: 9.06 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 14,245 - Forks: 1,131

OleehyO/TexTeller
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
Language: Python - Size: 79.8 MB - Last synced at: 25 days ago - Pushed at: 26 days ago - Stars: 521 - Forks: 57

prabhakar267/image2text
:clipboard: Python wrapper to grab text from images and save as text files using Tesseract Engine
Language: Python - Size: 5.42 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 406 - Forks: 140

TheLime1/CheatoMate
A collection of scripts to "help" you with your programming exams and assignments.
Language: Python - Size: 214 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 1

morzellen/renamer-classifier-images-using-ai
Автоматизированная система переименования и классификации изображений на основе их содержания с использованием глубоких нейронных сетей
Language: Python - Size: 146 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

wangleihitcs/Papers
读过的CV方向的一些论文,图像生成文字、弱监督分割等
Size: 201 MB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 126 - Forks: 20

chunix64/sd-bootstrap
Ipynb for Stable Diffusion Web UI run on Google Colab
Language: Jupyter Notebook - Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ekiim/vim-mathpix
Vim commands to use mathpix from your screen
Language: Shell - Size: 9.9 MB - Last synced at: 27 days ago - Pushed at: 8 months ago - Stars: 41 - Forks: 2

thefcraft/civitai-stable-diffusion-337k
Civitai Stable Diffusion 337k Dataset; dataset of ai generated image
Language: Python - Size: 19.5 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 10 - Forks: 0

Hangover3832/ComfyUI-Hangover-Nodes
Various nodes for ComfyUI
Language: Python - Size: 9.45 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 40 - Forks: 9

yuanxiaosc/Image-Captioning
CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述
Language: Jupyter Notebook - Size: 2.11 MB - Last synced at: 5 days ago - Pushed at: almost 6 years ago - Stars: 35 - Forks: 13

amrkld/Image_to_text_OCR_project
This project is a Python application that uses Optical Character Recognition (OCR) to extract text from images. It leverages the following libraries:
Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

BinhQuocLy/Pdf2Quiz Fork of thejungwon/Pdf2Quiz
A Pdf2Quiz NLP model.
Language: Python - Size: 4.16 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

MurageKabui/AutoIT-OCRSpace-UDF
An AutoIT 3 wrapper around the OCRSpace API to convert images and PDFs to text.
Language: AutoIt - Size: 479 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 1

Subhashis360/ScreenQA
This is a Exclusive Tool that use Google Text Extract and Openai Chatgpt Together And 10X Your your productivity and explore new possibilities with ScreenQA today!
Language: Python - Size: 9.57 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ppraneeth270/img2text
Language: Python - Size: 58.6 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

iohanngrig/gptassistant
AI based apps
Language: Python - Size: 2.61 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

michelecafagna26/HL-dataset
[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rationale.
Size: 5.67 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

RasmusML/XRayReport
X-ray images to text reports
Language: Jupyter Notebook - Size: 48.2 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Allenpandas/BLIP-ImageCaptioning Fork of salesforce/BLIP
Folk BLIP ImageCaptioning from salesforce
Language: Python - Size: 8.13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

sssingh/pic-to-story
A Large Language Model (LLM) Based App to Generate Stories from Pictures
Language: Python - Size: 30.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

VityaVitalich/IMAD
IMAD: IMage Augmented multi-modal Dialogue
Language: Python - Size: 1.11 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Jerey/image-to-pdf-and-txt
Python tool, which takes 1..n images, tries to rotate them based on the text, extract the text and store 1..n images to a pdf.
Language: Python - Size: 11.3 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

dmdin/SceneDescriptor
🎞 Video editor with description generation for MTS TrueTech Hack
Language: Jupyter Notebook - Size: 179 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 3

Emsley1d/Project03-NutriCO2
A CRUD application; my third project for GA Software Engineering Immersive.
Language: Python - Size: 24.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

etosworld/etos-deepcut
Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr . a tool to do automatically object segmentation from extreme points.
Language: Python - Size: 1.2 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 24 - Forks: 4

kanocence/text-img
Language: Vue - Size: 2.65 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

davidserra9/cross-modal-retrieval-with-triplet-network Fork of cesc47/cross-modal-retrieval-with-triplet-network
Text-to-Image and Image-to-Text model retrieval
Language: Python - Size: 4.71 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

eddieir/Image_to_Text
Language: Python - Size: 1.17 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

yhwang/im2txt-inference
Run im2txt trained model in inference mode
Language: Python - Size: 129 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 2
