An open API service providing repository metadata for many open source software ecosystems.

Topic: "image-to-text"

thiagoalessio/tesseract-ocr-for-php

A wrapper to work with Tesseract OCR inside PHP.

Language: PHP - Size: 1.09 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 2,961 - Forks: 552

lucidrains/CoCa-pytorch

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Language: Python - Size: 564 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 1,127 - Forks: 88

killkimno/MORT

MORT 번역기 프로젝트 - Real-time game translator with OCR

Language: C# - Size: 172 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 842 - Forks: 55

PaddlePaddle/PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

Language: Python - Size: 177 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 628 - Forks: 210

Flame-Code-VLM/Flame-Code-VLM

Flame is an open-source multimodal AI system designed to translate UI design mockups into high-quality React code. It leverages vision-language modeling, automated data synthesis, and structured training workflows to bridge the gap between design and front-end development.

Language: Python - Size: 7.24 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 470 - Forks: 28

zapolnoch/node-tesseract-ocr

A Node.js wrapper for the Tesseract OCR API

Language: JavaScript - Size: 516 KB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 311 - Forks: 38

google/imageinwords

Data release for the ImageInWords (IIW) paper.

Language: JavaScript - Size: 21.4 MB - Last synced at: 23 days ago - Pushed at: 6 months ago - Stars: 209 - Forks: 9

Yushi-Hu/tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Language: Python - Size: 6.08 MB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 159 - Forks: 9

yardstick17/image_text_reader

The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.

Language: Python - Size: 6.31 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 147 - Forks: 43

nateshmbhat/card-scanner-flutter

A flutter package for Fast, Accurate and Secure Credit card & Debit card scanning

Language: Swift - Size: 32.6 MB - Last synced at: 7 months ago - Pushed at: 9 months ago - Stars: 108 - Forks: 104

MIMICLab/L-Verse

L-Verse: Bidirectional Generation Between Image and Text

Language: Python - Size: 1.83 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 108 - Forks: 6

shoryasethia/markdrop

A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.

Language: Python - Size: 158 KB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 101 - Forks: 5

mshdabiola/NotePad

Notepad is multi module Jetpack compose note taking app with sketch pad, voice recorder, image capturing app

Language: Kotlin - Size: 8.78 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 99 - Forks: 9

MuhametSmaili/note-it

OCR functionality in a feature-rich note-taking extension.

Language: TypeScript - Size: 6.42 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 97 - Forks: 4

BEPb/image_to_ascii

Everything is very simple: you either download a picture file or specify its link when running a python script, and output you get a text file, and you can immediately view on the command line how it will look the result of your conversion.

Language: Python - Size: 1.68 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 89 - Forks: 11

NormXU/nougat-latex-ocr

Codebase for fine-tuning / evaluating nougat-based image2latex generation models

Language: Python - Size: 126 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 85 - Forks: 11

untrix/im2latex

Solution to im2latex request for research of openai

Language: Jupyter Notebook - Size: 269 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 83 - Forks: 20

farhanchoudhary/PAN_Card_OCR_Project

To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format

Language: Python - Size: 650 KB - Last synced at: 5 months ago - Pushed at: about 5 years ago - Stars: 79 - Forks: 66

glami/glami-1m

The largest multilingual image-text classification dataset. It contains fashion products.

Language: Jupyter Notebook - Size: 5.43 MB - Last synced at: 5 days ago - Pushed at: almost 2 years ago - Stars: 72 - Forks: 7

Carleslc/ImageToText

OCR with Google's AI technology (Cloud Vision API)

Language: Python - Size: 18.6 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 70 - Forks: 16

fny/swiftocr

macOS OCR command-line tool for almost any image format

Language: Python - Size: 202 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 58 - Forks: 3

aimagelab/safe-clip

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024

Language: Python - Size: 17.5 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 58 - Forks: 0

amit-y11/the_ocr_bot

Telegram bot to convert image to text using python

Language: Python - Size: 94.7 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 54 - Forks: 45

geoffsmith82/Symposium2023

Demonstrates Voice Recognition, Text to Speech, Language Translation, OAuth2, Image Generation, Face Detection and Voice Chatbot.

Language: Pascal - Size: 8.16 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 53 - Forks: 12

pharmapsychotic/comfy-cliption

Image to text with CLIP ViT-L/14 in ComfyUI

Language: Python - Size: 1.22 MB - Last synced at: 28 days ago - Pushed at: 4 months ago - Stars: 49 - Forks: 2

DS2BRAIN/ds2

Easiest way to use AI models without coding (Web UI & API support)

Language: Python - Size: 243 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 48 - Forks: 32

bensonruan/Tesseract-OCR

Tesseract.js OCR

Language: HTML - Size: 2.29 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 45 - Forks: 27

torresflo/Tag-Machine

A little Python application to auto tag your photos with the power of machine learning.

Language: Python - Size: 3.1 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 42 - Forks: 6

zhangming8/Dango-ocr

DangoOCR: screenshot OCR recognize 文字识别,支持多种语言,识别后翻译,播放声音

Language: Python - Size: 613 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 40 - Forks: 6

Akascape/TEXTEMAGE

A simple image to text converter with GUI!

Language: Python - Size: 398 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 38 - Forks: 6

affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition

Inverse DALL-E for Optical Character Recognition

Language: Python - Size: 6.9 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 38 - Forks: 6

pollinations-ai/pollinations.ai

Work with the best generative AI from Pollinations using this Python SDK. 🐝

Language: Python - Size: 11.2 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 37 - Forks: 5

visinf/lnfmm

Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)

Language: Python - Size: 1000 KB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 33 - Forks: 12

zsdonghao/im2txt2im

I2T2I: Text-to-Image Synthesis with textual data augmentation

Language: Python - Size: 1.66 MB - Last synced at: 12 days ago - Pushed at: about 6 years ago - Stars: 30 - Forks: 3

Zebbeni/ansizalizer

A TUI to convert Images to ANSI strings using bubbletea

Language: Go - Size: 51.2 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 29 - Forks: 5

N0iire/Image-to-text-Translate

Image to text translator using Open AI API & Tesseract

Language: Python - Size: 1.72 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 24 - Forks: 5

NanoNets/ocr-python

OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.

Language: Jupyter Notebook - Size: 5.52 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 4

zeeshanali-k/Classy

Text to image generation and Image Captioning Android, iOS, Desktop and Web app using Compose Multiplatform with Clean Architecture

Language: Kotlin - Size: 21.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 1

ITE-5th/image-captioning-gan

Language: Python - Size: 24.4 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 6

FeiElysia/awesome-zero-shot-captioning

A curated list of zero-shot captioning papers

Size: 15.6 KB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 1

AliShazly/ascii-py

Convert images or videos to ASCII in the terminal

Language: Python - Size: 13.6 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 22 - Forks: 5

zer0int/CLIP-XAI-GUI

CLIP GUI - XAI app ~ explainable (and guessable) AI with ViT & ResNet models

Language: Python - Size: 3.46 MB - Last synced at: 14 days ago - Pushed at: 8 months ago - Stars: 20 - Forks: 1

sujjeee/imagealt

Create alt text for any image in a few clicks with this free and open-source tool. Improve the accessibility and SEO of your content with this simple and effective tool!

Language: JavaScript - Size: 1.03 MB - Last synced at: about 10 hours ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 1

Viresh-R/ml-CCA

Implementation of Fast ml-CCA from the ICCV-2015 work "Multi-Label Cross-Modal Retrieval"

Language: Matlab - Size: 1.95 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 19 - Forks: 3

Hermann-web/python-OCR

Converting invoice pdf to image, image to text and then get, from the text, invoice informations like invoice number or vendor name

Language: Jupyter Notebook - Size: 177 MB - Last synced at: 22 days ago - Pushed at: almost 2 years ago - Stars: 18 - Forks: 2

Spidy20/Optical_Character_Reccognition

In this system we need to enter an image(like government document) ,it can convert image data into string

Language: Python - Size: 200 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 18 - Forks: 12

zabir-nabil/autoocr

Python wrapper for cross platform tesseract OCR engine with multiple languages (e.g. Bangla)

Language: Python - Size: 1.13 MB - Last synced at: about 24 hours ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 4

ahmedgulabkhan/TEI2S

TEI2S is a project which is really helpful for the visually impaired, in a sense that it takes an image containing text embedding as the input, extracts the text from the image, and converts this text to speech, i.e; the output is an audio file containing the text which is embedded in the provided input image.

Language: Python - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 15 - Forks: 4

zer0int/CLIP-text-image-interpretability

Get CLIP ViT text tokens about an image, visualize attention as a heatmap.

Language: Python - Size: 34.2 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 13 - Forks: 1

muchdogesec/file2txt

Turn a supported list of filetypes (e.g. .docx) into a markdown structured text file. Also optionally defangs indicators and extract texts from images. Built for threat intel use-cases.

Language: Python - Size: 39.3 MB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 12 - Forks: 2

fahmiaziz98/receipt_parsing

receipt parsing using donut model, next we will add using LLM + OCR or VLM

Language: Jupyter Notebook - Size: 6.51 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 12 - Forks: 4

FlyingFathead/OCR-CopyPastePad

A simple Python + Tkinter + Tesseract-based GUI image-to-text copypaste pad application

Language: Python - Size: 494 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 2

sergiss/image-ascii

💾 Image to ASCII converter. Upload your image and enjoy ;)

Language: JavaScript - Size: 253 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 10 - Forks: 1

theevann/Image-and-Text-Search

Joint representation of image and text through a Canonical Correlation Analysis

Language: Python - Size: 45.5 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 10 - Forks: 7

Albert-Zhan/php-tesseract-ocr

PHP Tesseract OCR is a C++ extension of PHP for character recognition and OCR learning in PHP environment.

Language: C++ - Size: 66.4 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 4

AnubhavYadavBCA25/Aurora-AI-Project

Aurora: AI Powered Data Analytics Tool which help users to automate and make their "Data Analysis" tasks easy.

Language: HTML - Size: 2.84 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 2

dev-aniketj/TextScannerApp-Android

Simple OCR App, it is use to scanner the text from the Picture.

Language: Java - Size: 4.3 MB - Last synced at: 20 days ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 8

kosiken/lion-image-to-ascii

A C++ Program that prints a given image as ascii characters

Language: C++ - Size: 3.86 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 7 - Forks: 0

yuanxiaosc/Image_to_Text

Taking the image description task on the MS-COCO data set as an example, the template code of Image_to_Text is shown.

Language: Jupyter Notebook - Size: 4.99 MB - Last synced at: 5 days ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 3

kaist-cvml/I-HallA-v1.0

[AAAI 2025] Official Implementation of I-HallA v1.0

Language: Python - Size: 49.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 1

michelecafagna26/VinVL

Original VinVL (and Oscar) repo with API designed for an easy inference

Language: Python - Size: 40.7 MB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

RealEngineAI/java-sdk

SDK for RealEngine.ai

Language: Java - Size: 71.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

RealEngineAI/wp-plugin

WordPress plugin for RealEngine.ai

Language: PHP - Size: 52.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

igolaizola/askimg

askimg answers questions about images using AI

Language: Go - Size: 7.81 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

lukemccrea/Textify

Textify is a Javascript library that converts your images into text!

Language: JavaScript - Size: 2.1 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

santhalakshminarayana/image-to-pdf-text-speech

Image conversion to PDF document, text document, speech.

Language: Jupyter Notebook - Size: 8.02 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 5

Alex0Blackwell/image-to-ASCII-converter

Take images from your ./imgs/ folder and make them into ASCII text drawings in True Colour!

Language: Python - Size: 5 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

ahsplore/TalkitOut-TTS-web-application-python

TalkItOut is a Python and Flask-based web application that can convert text to speech, choose your preferred language for audio output, access a built-in dictionary for word meanings, and even extract text from images, complete with audio generation.

Language: HTML - Size: 9.13 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 2

Nexdata-AI/101-People-4538-Images-Japanese-Handwriting-OCR-Data

Japanese Handwriting OCR Dataset

Size: 5.56 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 1

yjg30737/pyqt-image-to-text

PyQt GUI example of image to text using image-to-text model

Language: Python - Size: 8.34 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

ArnabKumarRoy02/Image-Caption-Generator

This project is a part of the semester long research-based Mini Project under Prof. Mr. Vikas Kumar Singh. This returns textual description or annotations for an input image.

Language: Jupyter Notebook - Size: 122 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

RealEngineAI/js-sdk

SDK for RealEngine.ai

Language: TypeScript - Size: 13.7 KB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

judemont/ascii-art-generator

Transform an image into asciis characters.

Language: Python - Size: 57.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

IshaanOhri/Capture

Capture is a python based desktop application that lets you capture the text which otherwise cannot be copied. It saves the time spent on software/website to get the text. Just select and voila, your text is copied to the clipboard.

Language: Python - Size: 5.73 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

arthurdjn/img2poem-pytorch

PyTorch implementation of the paper ‟Beyond Narrative Description: Generating Poetry from Images” by B. Liu et al., 2018.

Language: Python - Size: 12.7 MB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 2

aquatiko/Image-Text-Speech-Synthesizer-Converter

Converts image to speech to text using python and it's GUI feature

Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 4 - Forks: 1

blacktop/what-dis

Dumb image-to-text experiment

Language: Go - Size: 1.1 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 3 - Forks: 0

JhonnySalles/MangaExtractor

Image processing and character recognition, transforming into editable text. Extraction of bubbles in comics/manga.

Language: Python - Size: 140 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 1

PRITHIVSAKTHIUR/Omni-Reasoner-Vision

Omni Reasoner for Vision

Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

uk0/Kmars.ai_AI_Image_Analyzer

AI Image Analyzer for ollama mistral.rs molmo in Mac M2 max (Screen Capture Analyzer ,Camera Capture Analyzer)

Language: HTML - Size: 7.28 MB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1

SubhangiSati/KathaSangam-AI-Story-Generator

It is an ultimate platform for creating and exploring imaginative stories. Whether you're an aspiring author, a creative enthusiast, or someone looking for a fun escape, KathaSangam is here to help you craft unique narratives and immerse yourself in a world of limitless possibilities.

Language: Python - Size: 2.26 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

zcemycl/qa-chatgpt-hf-pgvector

E-commerce fashion assistant with Chatgpt, Hugging Face, Ltree and Pgvector.

Language: Python - Size: 2.2 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

rbiswasfc/benetech-mga

2nd Place solution for Benetech - Making Graphs Accessible Kaggle Competition

Language: Python - Size: 130 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

cosmic-heart/Benetech-Chart-Derendering

Benetech Kaggle Competition Work. Fine Tuning Matcha (Multi Modal Transformer) on Line, Scatter, Dot, Horizontal and Vertical bar dataset.

Language: Jupyter Notebook - Size: 8.88 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

nikD305/solutionAI-Image-Video-Solver-Summarizer-React

SolutionAI App which can solve any problems or summarize any Image or Youtube video of any duration to the shortest summary you need.

Language: JavaScript - Size: 37.9 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

akashkamati/Extract-Text-From-Images-Using-ML-Kit-Text-Recognition-API

Language: Kotlin - Size: 99.6 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

thehanimo/ocr-bot 📦

An action to automatically extract keywords from images in issue bodies, making them searchable 🔍

Language: JavaScript - Size: 3.84 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

ipchelsea/MIRANDA

Proposed an app to assist in de-escalating law enforcement situations and inform users of Miranda rights.

Language: Python - Size: 11.1 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 1

nobi1007/Characterize

Converts the given image(.jpg, .jpeg, .png) in its similar text image.

Language: HTML - Size: 889 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 2

SeanLachhander/Steganography

Hide a text document (text.txt) within the same folder as this program into an image (host_image.jpg). The GUI will show the host image in the left panel. The hiding will be performed by replacing the ’n’ lower order bits of image with the ASCII values of the ‘m’ number of characters from the text document. The more text you embed into the image, the lesser the quality the image will be. The GUI will show the value of ’n’ and ‘m’, where ’n’ can vary from 0 to 8 and ‘m’ can be any value between 0 and the maximum number of characters that can be embedded in the image.

Language: Java - Size: 47.9 KB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 1

DavidGDA/image-to-text-web-app

Esta es una app web la cual se encarga de obtener texto de imágenes con texto digital

Language: TypeScript - Size: 0 Bytes - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 2 - Forks: 0

UtpaL2102/Dark-Pattern-detector

Dark pattern detection

Language: Jupyter Notebook - Size: 683 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

rai62/screanswer

A command line tool to answer text on the screen for macOS users

Language: Go - Size: 134 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

Rayyan9477/OCR-Image-to-text

Developed an OCR Image-to-Text application using Python and Streamlit, focusing on accurate text extraction and image preprocessing. Enhanced reliability and performance, enabling seamless conversion of diverse image formats into editable text.

Language: Python - Size: 628 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

Kanaries/photes-io-obsidian-plugin

AI Image to text notes plugin in obsidian

Language: TypeScript - Size: 65.4 KB - Last synced at: 21 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 2

notkiyo/Rina-s-Realm

RinaBot is an interactive Discord bot offering anime and manga insights, character information, and AI chat. It processes images to generate captions and enhances server engagement. Currently under development . Perfect for adding a touch of fun and interactivity to your Discord server!

Language: Python - Size: 374 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

Raihan4520/Stable-Diffusion

This repository demonstrates how to use Hugging Face's pre-trained models for Text-to-Image and Image-to-Text generation using Stable Diffusion.

Language: Jupyter Notebook - Size: 30.8 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

natzcam/screader

screen capture to text to clipboard tool

Language: JavaScript - Size: 10.4 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

email2sunilverma/SImageToTextProcess

Project provides basic idea and approach to implement the Recognizing Text in Images by using apple provided framwork Visson.

Language: Swift - Size: 203 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0

itsvijaysingh/Alt-Text-Generator

Generate relevant alt text for images using AI.

Language: JavaScript - Size: 487 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0