An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: image2text

TAO71-AI/I4.0

TAO71 I4.0 is an AI created by TAO71 in Python.

Language: Python - Size: 2.98 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 0

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language: Python - Size: 9.06 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 14,245 - Forks: 1,131

OleehyO/TexTeller

TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.

Language: Python - Size: 79.8 MB - Last synced at: 25 days ago - Pushed at: 26 days ago - Stars: 521 - Forks: 57

prabhakar267/image2text

:clipboard: Python wrapper to grab text from images and save as text files using Tesseract Engine

Language: Python - Size: 5.42 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 406 - Forks: 140

TheLime1/CheatoMate

A collection of scripts to "help" you with your programming exams and assignments.

Language: Python - Size: 214 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 1

morzellen/renamer-classifier-images-using-ai

Автоматизированная система переименования и классификации изображений на основе их содержания с использованием глубоких нейронных сетей

Language: Python - Size: 146 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

wangleihitcs/Papers

读过的CV方向的一些论文,图像生成文字、弱监督分割等

Size: 201 MB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 126 - Forks: 20

chunix64/sd-bootstrap

Ipynb for Stable Diffusion Web UI run on Google Colab

Language: Jupyter Notebook - Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ekiim/vim-mathpix

Vim commands to use mathpix from your screen

Language: Shell - Size: 9.9 MB - Last synced at: 27 days ago - Pushed at: 8 months ago - Stars: 41 - Forks: 2

thefcraft/civitai-stable-diffusion-337k

Civitai Stable Diffusion 337k Dataset; dataset of ai generated image

Language: Python - Size: 19.5 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 10 - Forks: 0

Hangover3832/ComfyUI-Hangover-Nodes

Various nodes for ComfyUI

Language: Python - Size: 9.45 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 40 - Forks: 9

yuanxiaosc/Image-Captioning

CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述

Language: Jupyter Notebook - Size: 2.11 MB - Last synced at: 5 days ago - Pushed at: almost 6 years ago - Stars: 35 - Forks: 13

amrkld/Image_to_text_OCR_project

This project is a Python application that uses Optical Character Recognition (OCR) to extract text from images. It leverages the following libraries:

Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

BinhQuocLy/Pdf2Quiz Fork of thejungwon/Pdf2Quiz

A Pdf2Quiz NLP model.

Language: Python - Size: 4.16 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

MurageKabui/AutoIT-OCRSpace-UDF

An AutoIT 3 wrapper around the OCRSpace API to convert images and PDFs to text.

Language: AutoIt - Size: 479 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 1

Subhashis360/ScreenQA

This is a Exclusive Tool that use Google Text Extract and Openai Chatgpt Together And 10X Your your productivity and explore new possibilities with ScreenQA today!

Language: Python - Size: 9.57 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ppraneeth270/img2text

Language: Python - Size: 58.6 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

iohanngrig/gptassistant

AI based apps

Language: Python - Size: 2.61 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

michelecafagna26/HL-dataset

[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rationale.

Size: 5.67 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

RasmusML/XRayReport

X-ray images to text reports

Language: Jupyter Notebook - Size: 48.2 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Allenpandas/BLIP-ImageCaptioning Fork of salesforce/BLIP

Folk BLIP ImageCaptioning from salesforce

Language: Python - Size: 8.13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

sssingh/pic-to-story

A Large Language Model (LLM) Based App to Generate Stories from Pictures

Language: Python - Size: 30.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

VityaVitalich/IMAD

IMAD: IMage Augmented multi-modal Dialogue

Language: Python - Size: 1.11 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Jerey/image-to-pdf-and-txt

Python tool, which takes 1..n images, tries to rotate them based on the text, extract the text and store 1..n images to a pdf.

Language: Python - Size: 11.3 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

dmdin/SceneDescriptor

🎞 Video editor with description generation for MTS TrueTech Hack

Language: Jupyter Notebook - Size: 179 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 3

Emsley1d/Project03-NutriCO2

A CRUD application; my third project for GA Software Engineering Immersive.

Language: Python - Size: 24.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

etosworld/etos-deepcut

Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr . a tool to do automatically object segmentation from extreme points.

Language: Python - Size: 1.2 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 24 - Forks: 4

kanocence/text-img

Language: Vue - Size: 2.65 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

davidserra9/cross-modal-retrieval-with-triplet-network Fork of cesc47/cross-modal-retrieval-with-triplet-network

Text-to-Image and Image-to-Text model retrieval

Language: Python - Size: 4.71 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

eddieir/Image_to_Text

Language: Python - Size: 1.17 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

yhwang/im2txt-inference

Run im2txt trained model in inference mode

Language: Python - Size: 129 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 2