Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pdf-document-processor

UglyToad/PdfPig

Read and extract text and other content from PDFs in C# (port of PDFBox)

Language: C# - Size: 131 MB - Last synced: about 11 hours ago - Pushed: 1 day ago - Stars: 1,473 - Forks: 214

alisafaya/txt-from-pdf

Extracting clean text from pdfs using pdfminer.six and pypdf.

Language: Python - Size: 21.5 KB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 1 - Forks: 0

pdfix/pdfix_sdk_builds

PDFix SDK release builds

Size: 11.2 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 3 - Forks: 1

tachneo/pdfcombiner

PDF Combiner PDF Combiner is a user-friendly, GUI-based tool built in Python for combining and splitting PDF files. It is easy to use, supports drag-and-drop, and allows you to adjust the order of files before combining.

Language: Python - Size: 19.5 KB - Last synced: 19 days ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

OnedocLabs/onedoc

The first developer-oriented document platform. Generate, host and track PDFs with a single API, beautifully.

Language: Python - Size: 209 KB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 47 - Forks: 0

kariemoorman/ocr-scripts

Collection of solutions for OCR tasks such as text extraction, image preprocessing, and document layout analysis.

Language: Jupyter Notebook - Size: 1.37 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 0 - Forks: 0

unidoc/unipdf

Golang PDF library for creating and processing PDF files (pure go)

Language: Go - Size: 112 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 2,368 - Forks: 245

orchetect/PDFGadget

Batch PDF operations for Swift

Language: Swift - Size: 255 KB - Last synced: 4 days ago - Pushed: 3 months ago - Stars: 4 - Forks: 0

Dtronix/PDFiumCore

.NET Standard P/Invoke bindings for PDFium.

Language: C# - Size: 59.8 MB - Last synced: 3 days ago - Pushed: 5 days ago - Stars: 128 - Forks: 18

opendocument-app/pdf2htmlEX-Android

pdf2htmlEX library port for Android - Convert PDF to HTML without losing text or format

Language: Java - Size: 18.9 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 24 - Forks: 11

michaelrsweet/pdfio

PDFio is a simple C library for reading and writing PDF files.

Language: C - Size: 7.21 MB - Last synced: 4 days ago - Pushed: 3 months ago - Stars: 159 - Forks: 36

PRITHIVSAKTHIUR/RAG-PDF-CHATBOT

(PDF) Information and Inference, Retrieval-Augmented Generation [ RAG ]

Language: Python - Size: 236 KB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 1 - Forks: 0

Phreak87/LeptonicaSharp

Full featured wrapper for leptonica 1.77.0

Language: Visual Basic - Size: 182 MB - Last synced: 6 days ago - Pushed: over 4 years ago - Stars: 8 - Forks: 5

Developer78-sgyuijhgygwtdwgyhutre45r5t/JavaPDFMerger

A tool to merge multiple PDFs together written in Java

Language: Java - Size: 34.2 KB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 1 - Forks: 0

manik-sethi/FluentDocs

A simple and efficient AI powered solution helping underserved immigrants with their essential documentation. Submission for HackDavis 2024

Language: Python - Size: 99.2 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 0 - Forks: 1

Thinqat1985731/Minimum-pdf-tools

Tools to add UI to pdf work

Language: Python - Size: 2.84 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 0 - Forks: 0

qpdf/qpdf

QPDF: A content-preserving PDF document transformer

Language: C++ - Size: 36.6 MB - Last synced: 21 days ago - Pushed: 22 days ago - Stars: 3,020 - Forks: 246

iSOLveIT/mkdocs-pdf-generate

An MkDocs plugin to generate individual PDF files from content pages.

Language: Python - Size: 11.3 MB - Last synced: 8 days ago - Pushed: 7 months ago - Stars: 6 - Forks: 0

KalyanM45/DocGenius-Revolutionizing-PDFs-with-AI

This is a Python application that allows you to load a PDF and ask questions about it using natural language. The application uses a LLM to generate a response about your PDF. The LLM will not answer questions unrelated to the document.

Language: Python - Size: 80.1 KB - Last synced: 4 days ago - Pushed: 29 days ago - Stars: 38 - Forks: 4

otanadzetsotne/pdf_squirrel

Pdf Squirrel offers tools for image-based document analysis, featuring block detection, PDF to image conversion, image normalization, selective blurring, and sentence highlighting. Ideal for developers in document processing and text analysis.

Language: Python - Size: 14.6 KB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0

datalogics/pdf-rest-api-samples

pdfRest API Toolkit is a REST API service for processing PDF documents, made by developers, for developers. Rapidly integrate PDF workflows with your existing projects and applications, simply and seamlessly. Get started for free in seconds.

Language: Java - Size: 13.5 MB - Last synced: 9 days ago - Pushed: 10 days ago - Stars: 21 - Forks: 10

GowenGit/docnet

DocNET is as fast PDF editing and reading library for modern .NET applications

Language: C# - Size: 166 MB - Last synced: 7 days ago - Pushed: 7 months ago - Stars: 426 - Forks: 87

sidphbot/Auto-Research

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

Language: Python - Size: 429 KB - Last synced: 2 days ago - Pushed: 5 months ago - Stars: 48 - Forks: 6

DorKatzir/codebreaker

Code-Breaking

Language: PHP - Size: 716 KB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 0 - Forks: 0

hellerbarde/stapler

A small utility making use of the pypdf library to provide a (somewhat) lighter alternative to pdftk

Language: Python - Size: 146 KB - Last synced: 4 days ago - Pushed: 10 months ago - Stars: 281 - Forks: 52

GURPREETKAURJETHRA/Multi-PDFs_ChatApp_AI-Agent

Meet MultiPDF 📚 Chat AI App! 🚀 Chat seamlessly with Multiple PDFs using Langchain, Google Gemini Pro & FAISS Vector DB with Seamless Streamlit Deployment. Get instant, accurate responses from Awesome Google Gemini OpenSource language Model. 📚💬 Transform your PDF experience now! 🔥✨

Language: Python - Size: 8.04 MB - Last synced: 12 days ago - Pushed: 13 days ago - Stars: 33 - Forks: 19

praj2408/Realtime-Document-Chat-System

In this project, we used Langchain to create a ChatGPT for your PDF using Streamlit. We built an application that allows you to ask questions about a PDF document and get answers directly from an LLM (Large Language Model), like OpenAI's ChatGPT.

Language: Jupyter Notebook - Size: 4.65 MB - Last synced: 11 days ago - Pushed: 13 days ago - Stars: 40 - Forks: 12

easonlai/chat_with_pdf_table

The contents of this repository showcase how to extract table data from a PDF file and preprocess it to facilitate word embedding. This preprocessing step enhances the readability of table data for language models and enables us to extract more contextual information from the tables.

Language: Jupyter Notebook - Size: 85.9 KB - Last synced: 8 days ago - Pushed: 7 months ago - Stars: 6 - Forks: 2

maximum-software/pdf-forms-for-contact-form-7

Build Contact Form 7 forms from PDF forms. Get PDFs auto-filled and attached to email messages and/or website responses on form submission.

Language: PHP - Size: 2.34 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 8 - Forks: 11

darylalim/bart-large-cnn-abstract-summarization

Summarize abstracts of PDF arXiv papers.

Language: Jupyter Notebook - Size: 2.93 KB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 0 - Forks: 0

munenendereba/pdftextreader

Language: Python - Size: 2.93 KB - Last synced: 14 days ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

ammirsm/automatic-pancake

Active learning agent-based-simulation for systematic reviews and other types of technology assisted review (TAR) which will include PDF documents and other meta-datas in itself and it's based on both fulltext-screening decisions and title-screening decisions.

Language: Python - Size: 1.82 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 2 - Forks: 0

anonfaded/pdf-merger

This tool allows you to merge PDF files through a graphical user interface (GUI) or a command-line interface (CLI) on Windows, Linux, and Mac.

Language: Python - Size: 5.34 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 0 - Forks: 0

Josee9988/Compress-PDFs

A python CLI script to 𝗰𝗼𝗺𝗽𝗿𝗲𝘀𝘀 📦 all the 𝗣𝗗𝗙 files 𝗿𝗲𝗰𝘂𝗿𝘀𝗶𝘃𝗲𝗹𝘆 in a directory using the iLovePDF technology 🥰

Language: Python - Size: 45.9 KB - Last synced: 4 days ago - Pushed: over 2 years ago - Stars: 12 - Forks: 3

hussein-esmail7/course-extractor

Sorts course files into folders based on the text in that file

Language: Python - Size: 5.86 KB - Last synced: 4 days ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1

lovasoa/pagelabels-py

Python library to manipulate PDF page labels

Language: Python - Size: 44.9 KB - Last synced: 4 days ago - Pushed: over 2 years ago - Stars: 61 - Forks: 11

tstenvold/EmailPaySlip

A crude python script to read a PDF, split it, password it, and send out the payslips by email to employees

Language: Python - Size: 3.91 KB - Last synced: 18 days ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

johngodoi/BrokersNoteLoader

This application aims to convert some broker's note into a formatted text that can be easily imported to a spreadsheet.

Language: Scala - Size: 4.88 KB - Last synced: 19 days ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

tagtog/demo-webhooks

Quick example to connect a spaCy model to tagtog using webhooks 🤖

Language: Python - Size: 39.1 KB - Last synced: 19 days ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 1

benyoung32/UCDMB_Library

PDF Utilities designed for music preparation

Language: Python - Size: 169 MB - Last synced: 28 days ago - Pushed: 28 days ago - Stars: 0 - Forks: 0

thomasvanholder/browserless

A Ruby wrapper for the Browserless PDF API with support for modern CSS such as TailwindCSS

Language: Ruby - Size: 23.4 KB - Last synced: 23 days ago - Pushed: 9 months ago - Stars: 2 - Forks: 1

IBM/generate-insights-from-data-formats-with-watson

How do we process data in different formats like docx, pdf etc and generate insights to be linked with structured data in database?This pattern helps in establishing relations between structured & unstructured data to generate recommendations using Watson NLU & Watson Studio.

Language: Jupyter Notebook - Size: 1.06 MB - Last synced: 25 days ago - Pushed: almost 4 years ago - Stars: 13 - Forks: 16

svenssonaxel/pdf-sign

A tool to sign PDF files. With Linux support.

Language: Python - Size: 355 KB - Last synced: 19 days ago - Pushed: about 2 months ago - Stars: 109 - Forks: 2

ragesh2000/chat-with-your-pdf

Language: Python - Size: 6.84 KB - Last synced: 25 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

abarker/pdfCropMargins

pdfCropMargins -- a program to crop the margins of PDF files

Language: Python - Size: 9.19 MB - Last synced: 21 days ago - Pushed: 22 days ago - Stars: 323 - Forks: 32

sailist/chatgpt-enhancement-extension

An all-in-one plugin to improve your ChatGPT experience!

Language: TypeScript - Size: 24.4 MB - Last synced: 28 days ago - Pushed: about 1 year ago - Stars: 330 - Forks: 27

IBM/science-result-extractor

Language: Java - Size: 120 MB - Last synced: 25 days ago - Pushed: almost 2 years ago - Stars: 90 - Forks: 17

wmjordan/PDFPatcher

PDF补丁丁——PDF工具箱,可以编辑书签、剪裁旋转页面、解除限制、提取或合并文档,探查文档结构,提取图片、转成图片等等

Language: C# - Size: 30.4 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 8,438 - Forks: 1,202

dx-han/pieRS

pieRS is an online text processor with offline local storage written in Rust, with support for note, code, PDF, and mindmap.

Size: 17.6 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

pleb631/PdfDet

PdfDet aims to simplify PDF layout detect tasks for users.

Language: Python - Size: 14.7 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 1 - Forks: 0

bytescout/pdfco-rails

PDF.co Gem plugin for Ruby on Rails

Language: Ruby - Size: 13.7 KB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 1 - Forks: 1

MBAigner/PDFContentConverter

A tool for converting PDF text as well as structural features into a pandas dataframe.

Language: Python - Size: 163 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 6 - Forks: 3

localauthor/pdf-pagelabels

Transient interface for pagelabels.py in Emacs

Language: Emacs Lisp - Size: 87.9 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

parthgupta1208/PDF2PPTGenerator

PDF2PPT Generator is a Python tool that creates Powerpoint presentations from PDF files by using smart summarization techniques assisted by GPT-3.5-Turbo

Language: Python - Size: 7.4 MB - Last synced: 13 days ago - Pushed: 12 months ago - Stars: 8 - Forks: 6

zombie110year/pdfwork

处理 PDF 的一些工具

Language: Python - Size: 195 KB - Last synced: 18 days ago - Pushed: about 3 years ago - Stars: 2 - Forks: 0

clemensheithecker/pdf-duplex-scan

Double-sided scanning without a duplex scanner. An app to fix the page order of a double-sided PDF scan from a non-duplex scanner.

Language: HTML - Size: 224 KB - Last synced: about 2 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 1

pankajr141/pdf2jpg

Utility to convert PDF into JPG files

Language: Java - Size: 4.22 MB - Last synced: 10 days ago - Pushed: about 1 year ago - Stars: 50 - Forks: 20

CalebHendren/AppendAnswerSheet

This desktop application is meant to append an answer sheet for written/essay questions to scanforms for services such as ZipGrade.

Language: Python - Size: 13.3 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

hoehermann/pypdf_strreplace

Search and replace text in PDF files with PyPDF.

Language: Python - Size: 335 KB - Last synced: about 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0

pdflexer/pdflexer

.net pdf parsing library

Language: C# - Size: 64.4 MB - Last synced: 9 days ago - Pushed: 3 months ago - Stars: 16 - Forks: 1

casie-aviles/botpdf-llama2-chatbot

A simple Large Language Model (LLM) chatbot project, where users can upload PDF files to receive tailored responses generated directly from the document contents.

Language: Python - Size: 74.2 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0

DinjanAI/PDF_chatbot

📄💬 Meet our PDF Chatbot! Extract insights from PDFs effortlessly. Upload, query, and get instant answers! 🤖🔍 #AI #PDF #Chatbot

Language: Python - Size: 26.5 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

jennis0/burdoc

Advanced PDF parsing for python

Language: HTML - Size: 18 MB - Last synced: about 1 month ago - Pushed: 11 months ago - Stars: 2 - Forks: 0

pdf2htmlEX/pdf2htmlEX

Convert PDF to HTML without losing text or format.

Language: HTML - Size: 133 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 3,218 - Forks: 360

madisonhk/SeniorProject

Senior Project: Summary Report Generator

Language: HTML - Size: 14.6 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

bgorman87/PDF-Flow

PDF Report Processor

Language: Python - Size: 333 MB - Last synced: 22 days ago - Pushed: 3 months ago - Stars: 2 - Forks: 0

Academic-Hammer/PDFConverter

Converting pdf to any format for easily analyzing

Language: Python - Size: 152 KB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 10 - Forks: 3

amadeusferro/English-language-reading-assistant

Introducing an English reading assistant—a web application using NLP to enhance understanding of English documents. It allows users to upload English PDFs, employing NLP to highlight recurring words and their definitions.

Language: Python - Size: 327 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

mcagriaksoy/diff_merge_pdf

A tool for compare, merge, display difference and make OCR between the PDFs.

Language: Python - Size: 1.3 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

MengWoods/sign-pdf-with-transparent-background-signature

Sign PDF. Extract signature from a picture and sign the transparent-background signature to a PDF.

Language: Python - Size: 87.2 MB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 3 - Forks: 1

rk1708-coder/pdf-to-image-converter

Free PDF to Image Converter Using PDF.js

Language: HTML - Size: 615 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

Ogwenya/pdf-stitch

PDF Stitch is a desktop application designed to facilitate the rearrangement of PDF pages with ease.

Language: JavaScript - Size: 716 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

juricaKenda/PDF-Merger-Revised

Pdf merger / extractor desktop application

Size: 7.79 MB - Last synced: 4 months ago - Pushed: almost 5 years ago - Stars: 0 - Forks: 0

juricaKenda/PDFmerger

A .pdf file merger that allows merging multiple files into one .pdf file

Language: Java - Size: 5.86 KB - Last synced: 4 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

akifislam/Complex-PDF-MCQ-Scraper

A Script to Analyze thousands of complex PDFs with text, tables, graphs and input them in a xls file within seconds.

Language: Python - Size: 917 KB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 2

ksharindam/pdfcook

Prepress preparing tool and PDF editor

Language: C++ - Size: 124 KB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 15 - Forks: 5

plain-jane-gray/PFAS-web-and-PDF-scrape

Scrapes hazardous waste data from a website and PDF file. Cleans and analyzes the data. Prepares the data for mapping.

Language: Jupyter Notebook - Size: 8.9 MB - Last synced: 3 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

BeHappy0o0o0o0/pdf_information_extraction

提取非扫描版pdf表格信息的py3脚本

Language: Python - Size: 15.6 KB - Last synced: 5 months ago - Pushed: almost 4 years ago - Stars: 4 - Forks: 4

pcdi/cambridge_core_downloader

Download and merge PDFs from Cambridge Core

Language: Python - Size: 166 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 2 - Forks: 1

Jeff-Tian/doc-rotary

gives me a docx/pptx/xlsx, I'll give you a pdf

Language: Python - Size: 102 MB - Last synced: 25 days ago - Pushed: 6 months ago - Stars: 3 - Forks: 4

department-of-veterans-affairs/DAPM-PFAS-PACT-ACT

Scrapes hazardous waste data from a website and PDF file for PACT Act. Cleans the data to prepare it for mapping.

Language: Jupyter Notebook - Size: 15.1 MB - Last synced: 21 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 1

vivekweb2013/pdf-utils

An android app to perform different operations on pdf files

Language: Java - Size: 261 KB - Last synced: 4 days ago - Pushed: almost 3 years ago - Stars: 10 - Forks: 1

codeart-ist/qna-with-pdf

Chat with your pdf files.

Language: TypeScript - Size: 18.6 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

pcerman/gambit-podofo

Gambit scheme binding to the podofo library

Language: C++ - Size: 1.28 MB - Last synced: 5 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

achronos0/pdful

PDF editor/updater library

Language: TypeScript - Size: 3.73 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

fastpdfservices/fastpdf-python

Python SDK for Fast PDF Service

Language: Python - Size: 53.7 KB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

andruhovski/aspose-pdf-js

Aspose.PDF for JavaScript via C++

Language: HTML - Size: 45.8 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

jmw8033/Pewter

Emailed invoice file handler

Language: Python - Size: 115 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 2 - Forks: 0

houking-can/CCKS2019-Task5

CCKS2019评测任务五-公众公司公告信息抽取,第3名

Language: Python - Size: 54.4 MB - Last synced: 6 months ago - Pushed: over 4 years ago - Stars: 123 - Forks: 26

mykeysid10/Invoice-PDF-QnA-System

Procurement Sector | Information Retrieval | NLP

Language: Python - Size: 6.45 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

leonardo-bm/textract

Extração do texto de um arquivo PDF

Language: Jupyter Notebook - Size: 1000 Bytes - Last synced: 6 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

libmanuk/DOSPDFtkSpliter

This DOS batch script uses PDFtk for Windows from the DOS commandline to split a multipage PDF file into separate files.

Language: Batchfile - Size: 8.79 KB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

MrBased/Limequick

Orga (ICS2813) essay peer assessment made quick and easy

Language: Python - Size: 261 KB - Last synced: 6 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

hirusha-adi/easy-pdf

manipulate and manage PDF files easily

Language: Python - Size: 4.88 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

MarcBuch/TR-PDF-Parser

Parses invoice PDF files from the german brokerage Trade Republic

Language: Python - Size: 34.2 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 1

kumarvipu1/LinkPDF

Links scanned pdf, specifically engineering drawings based on the internal drawing references.

Language: Jupyter Notebook - Size: 463 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

avr2002/CV-JD-Matching

Extracting details from Resume(CVs) and matching with Job Description(JDs) using pretrained model like DistilBERT and ranking them using cosine similarity.

Language: Jupyter Notebook - Size: 59.2 MB - Last synced: 3 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 1

hr097/cropbox-python-api

Open API for PDF cropping for Meesho and Flipkart. Orders Receipts. Additionally, It is deployed on render

Language: Python - Size: 749 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

patnicolas/floorplan

Evaluate various techniques to extract and organize information from a floor plan

Language: HTML - Size: 4.54 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

koulkoudakis/pdf-watermarker

Simple script adds predetermined watermark to .pdf file

Language: Python - Size: 157 KB - Last synced: 8 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0