GitHub topics: pdf-document-processor
qpdf/qpdf
qpdf: A content-preserving PDF document transformer
Language: C++ - Size: 39.4 MB - Last synced at: about 12 hours ago - Pushed at: 1 day ago - Stars: 3,968 - Forks: 309

unidoc/unipdf
Golang PDF library for creating and processing PDF files (pure go)
Language: Go - Size: 124 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2,788 - Forks: 265

anandaworldwide/ananda-library-chatbot Fork of mayooear/ai-pdf-chatbot-langchain
A ChatGPT chatbot app for multiple Large PDF files, audio files, and YouTube videos. Optionally generate the PDF fileset from a Wordpress database. Transcribe mp3 files en masse. Download YouTube videos en masse and transcribe their audio. Allow users to share the best answers they get with each other through a social sharing interface.
Language: TypeScript - Size: 77.1 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2 - Forks: 0

dhyeygamdha/PD
A cross-platform Python orchestrator that bootstraps Go, installs ProjectDiscovery tools (Subfinder, HTTPX, URLFinder, Nuclei + templates), and runs a streamlined recon pipeline (subfinder → httpx → urlfinder → nuclei), outputting plain-text results per domain.
Language: Python - Size: 13.7 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

UglyToad/PdfPig
Read and extract text and other content from PDFs in C# (port of PDFBox)
Language: C# - Size: 167 MB - Last synced at: 1 day ago - Pushed at: 17 days ago - Stars: 2,004 - Forks: 258

Dinesh210805/Pdfit
"PdfIt - A Powerful PDF Converter Tool" PdfIt is a versatile command-line tool that allows you to convert various file formats to PDF, merge PDFs, split PDFs into separate pages, and extract text from PDFs. With a user-friendly interface, PdfIt makes working with PDF files easier than ever.
Size: 3.43 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

run-llama/llama_cloud_services
Knowledge Agents and Management in the Cloud
Language: Python - Size: 46.1 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 3,956 - Forks: 405

masesgroup/NetPDF
.NET suite for PDFBox™
Language: C# - Size: 13.5 MB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0

wmjordan/PDFPatcher
PDF补丁丁——PDF工具箱,可以编辑书签、剪裁旋转页面、解除限制、提取或合并文档,探查文档结构,提取图片、转成图片等等
Language: C# - Size: 46.7 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 9,957 - Forks: 1,311

StabRise/ScaleDP
ScaleDP is an Open-Source extension of Apache Spark for Document Processing
Language: Python - Size: 7.88 MB - Last synced at: about 19 hours ago - Pushed at: about 2 months ago - Stars: 11 - Forks: 0

reezuleanu/pdf_deconstructor
Decompose a PDF file based on its headers for RAG ingestion.
Language: Python - Size: 13.7 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

datalogics/pdf-rest-api-samples
pdfRest API Toolkit is a REST API service for processing PDF documents, made by developers, for developers. Rapidly integrate PDF workflows with your existing projects and applications, simply and seamlessly. Get started for free in seconds.
Language: Java - Size: 13.7 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 26 - Forks: 10

Aadit-17/AI-Assistant-for-PDFs
AI Assistant for PDFs
Language: Python - Size: 12.7 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

chinapandaman/PyPDFForm
:fire: The Python library for PDF forms.
Language: Python - Size: 89 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 549 - Forks: 34

PRITHIVSAKTHIUR/RAG-PDF-CHATBOT
(PDF) Information and Inference, Retrieval-Augmented Generation [ RAG ]
Language: Python - Size: 1.06 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 9 - Forks: 0

khadijanazih/pdftool
Extract Specific Data from PDF Files and Rename Files in Folder
Language: Python - Size: 30.3 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

pdfix/pdfix_sdk_builds
PDFix SDK release builds
Size: 11.9 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 6 - Forks: 1

StabRise/spark-pdf
PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it
Language: Scala - Size: 5.72 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 49 - Forks: 3

michaelrsweet/pdfio
PDFio is a simple C library for reading and writing PDF files.
Language: C - Size: 8.74 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 246 - Forks: 53

tiwna255/Adobe-Acrobat-Reader
industry-leading-software-for-viewing,-printing,-and-annotating-PDF-documents.
Size: 0 Bytes - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

iamrkxigit/Adobe-Acrobat-Reader
industry-leading-software-for-viewing,-printing,-and-annotating-PDF-documents.
Size: 0 Bytes - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

pankajr141/pdf2jpg
Utility to convert PDF into JPG files
Language: Java - Size: 4.22 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 56 - Forks: 22

DavidLMS/DescribePDF
A tool to convert PDF files to detailed Markdown descriptions using VLMs
Language: Python - Size: 1.2 MB - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

PATELOM925/ChatPDF-AI
Language: Python - Size: 139 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

BobLd/PdfPigMLNetBlockClassifier
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Language: C# - Size: 1.1 MB - Last synced at: 7 days ago - Pushed at: about 5 years ago - Stars: 28 - Forks: 6

naiveHobo/pdfviewer
PDFViewer is a GUI tool, written using python3 and tkinter, which lets you view PDF documents.
Language: Python - Size: 152 KB - Last synced at: 11 days ago - Pushed at: almost 4 years ago - Stars: 83 - Forks: 27

abarker/pdfCropMargins
pdfCropMargins -- a program to crop the margins of PDF files
Language: Python - Size: 10 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 382 - Forks: 35

GowenGit/docnet
DocNET is as fast PDF editing and reading library for modern .NET applications
Language: C# - Size: 166 MB - Last synced at: 22 days ago - Pushed at: 12 months ago - Stars: 496 - Forks: 88

RobinMillford/Cortex-AI-Multi-Model-Insights-Hub
Cortex AI: Multi-Model Insights Hub is an advanced platform that leverages cutting-edge AI to empower your research, analysis, and data exploration. By integrating multiple Large Language Models (LLMs) with a sophisticated Retrieve-and-Generate (RAG) system
Language: Python - Size: 737 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 4 - Forks: 1

pdflexer/pdflexer
.net pdf parsing library
Language: C# - Size: 64.5 MB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 23 - Forks: 1

ufal/atrium-page-classification
Classification of historical page images using ViT - for ATRIUM project
Language: Python - Size: 370 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 2 - Forks: 0

GURPREETKAURJETHRA/Multi-PDFs_ChatApp_AI-Agent
Meet MultiPDF 📚 Chat AI App! 🚀 Chat seamlessly with Multiple PDFs using Langchain, Google Gemini Pro & FAISS Vector DB with Seamless Streamlit Deployment. Get instant, accurate responses from Awesome Google Gemini OpenSource language Model. 📚💬 Transform your PDF experience now! 🔥✨
Language: Python - Size: 8.04 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 94 - Forks: 55

HarshishBedi/DocSmart
RAG agent based on Ollama and DeepSeek-R1
Language: Python - Size: 211 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

itsceddyy/Adobe-Acrobat-Reader
industry-leading-software-for-viewing,-printing,-and-annotating-PDF-documents.
Language: JavaScript - Size: 2.93 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

Pulkitpkb/Adobe-Acrobat-Reader
industry-leading-software-for-viewing,-printing,-and-annotating-PDF-documents.
Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Bigdaddykir0/ai-chat-app
A full-stack chat application with a React frontend and Python FastAPI backend, featuring real-time messaging and AI-powered responses.
Language: Python - Size: 14.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Yishak12/Adobe-Acrobat-Reader
industry-leading software for viewing, printing, and annotating PDF documents.
Size: 1.95 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

berkbuyukates/Adobe-Acrobat-Reader
industry-leading software for viewing, printing, and annotating PDF documents.
Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

saenchai068/Adobe-Acrobat-Reader
industry-leading software for viewing, printing, and annotating PDF documents.
Size: 1.95 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

haydar665/Adobe-Acrobat-Reader
industry-leading software for viewing, printing, and annotating PDF documents.
Size: 1.95 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ranscloud/Adobe-Acrobat-Reader
industry-leading software for viewing, printing, and annotating PDF documents.
Size: 1.95 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

CalebHendren/zipmerge
Language: HTML - Size: 610 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

hoehermann/pypdf_strreplace
Search and replace text in PDF files with PyPDF.
Language: Python - Size: 573 KB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 34 - Forks: 3

prashant-g0/pdf-management-tool-python
A Python-based PDF manager that handles tasks like PDF to DOCX conversion, PDF to image, and more. Simple, efficient, and easy to use.
Language: HTML - Size: 13.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

sfneal/pdfconduit
Prepare documents for distribution
Language: Python - Size: 228 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 26 - Forks: 1

Dtronix/PDFiumCore
.NET Standard P/Invoke bindings for PDFium.
Language: C# - Size: 60.3 MB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 161 - Forks: 23

vivekweb2013/pdf-utils
An android app to perform different operations on pdf files
Language: Java - Size: 261 KB - Last synced at: 29 days ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 1

maximum-software/pdf-forms-for-contact-form-7
Build Contact Form 7 forms from PDF forms. Get PDFs auto-filled and attached to email messages and/or website responses on form submission.
Language: PHP - Size: 2.49 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 11

azuregray/AnywherePrintMachine
A software platform for a futuristic Anywhere Print Machine idea. Print machines placed locally just like ATMs where users can access and get prints at any time with extended automated functionalities with a user-friendly process + real-time app cloud-based ecosystem.
Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

SiddhantSadangi/pdf-workdesk
A Streamlit-powered application that provides a user-friendly interface for editing PDF documents.
Language: Python - Size: 151 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 55 - Forks: 12

carinewlimits/Adobe-Acrobat-Reader
industry-leading software for viewing, printing, and annotating PDF documents.
Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

bgorman87/PDF-Flow
PDF Report Processor
Language: Python - Size: 333 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

hellerbarde/stapler
A small utility making use of the pypdf library to provide a (somewhat) lighter alternative to pdftk
Language: Python - Size: 146 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 288 - Forks: 53

izzaziii/invoice-processor
A Python-based tool that processes PDF invoices using Claude AI to extract structured data and store it in MongoDB.
Language: Python - Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Aspirant-ai/PDF-Tools-Hub
Your one-stop solution for all PDF operations
Language: HTML - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

pdf2htmlEX/pdf2htmlEX
Convert PDF to HTML without losing text or format.
Language: HTML - Size: 133 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 4,856 - Forks: 439

jmfeck/python-pdf-tools
Python PDF Tools is a Python-based collection of ready-to-use applications designed for various PDF manipulations. Each tool is set up as an independent app that can be triggered by running a batch file located in the root of its folder. This project is under active development.
Language: Python - Size: 50.9 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

Uni-Creator/RAG-MultiFile-QA
A RAG (Retrieval-Augmented Generation) AI chatbot that allows users to upload multiple document types (PDF, DOCX, TXT, CSV) and ask questions about the content. Built using LangChain, Hugging Face embeddings, and Streamlit, it enables efficient document search and question answering using vector-based retrieval. 🚀
Language: Python - Size: 132 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

CyberHorizon315/Adobe-Acrobat-Reader
industry-leading software for viewing, printing, and annotating PDF documents.
Size: 1.95 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Mohith202/Ema-Chatbot
A public chatbot can use PDF from user or uses preloaded dataset and answer Query from user while displaying source.
Language: Python - Size: 41 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

eli64s/pdflex
CLI for merging PDF contexts.
Language: Python - Size: 465 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

vrouwsaxib/PDF-Ranger
Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

papercast-dev/papercast
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.
Language: Python - Size: 218 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 48 - Forks: 1

IBM/science-result-extractor 📦
Language: Java - Size: 120 MB - Last synced at: 5 days ago - Pushed at: almost 3 years ago - Stars: 91 - Forks: 17

mohiteamit/ai-pdf-summarizer
AI-Powered PDF Summarizer: Upload PDFs, extract insights, generate customizable summaries with GPT-4. Web app & API integration.
Language: Python - Size: 1.35 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

opendocument-app/pdf2htmlEX-Android
pdf2htmlEX library port for Android - Convert PDF to HTML without losing text or format
Language: Java - Size: 20.1 MB - Last synced at: about 2 hours ago - Pushed at: 6 months ago - Stars: 32 - Forks: 11

CalebHendren/PageInserter
This desktop application is meant to append an answer sheet for written/essay questions to scanforms for services such as ZipGrade.
Language: Python - Size: 13.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

arunpgcil/ghostscript-pdf-compress.wasm Fork of laurentmmeyer/ghostscript-pdf-compress.wasm
Compress PDF in the browser with ghostscript in WASM
Language: JavaScript - Size: 13.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

orchetect/PDFGadget
Batch PDF operations for Swift
Language: Swift - Size: 418 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 7 - Forks: 0

maximum-software/pdf-forms-for-wpforms
Build WPForms from PDF forms. Get PDFs filled automatically and attached to email messages and/or website responses on form submissions.
Language: PHP - Size: 1.94 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

enzovitale/EditPDF
Wrappers around the iText7 library to perform basic operations on PDF documents.
Language: C# - Size: 6.18 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lovasoa/pagelabels-py
Python library to manipulate PDF page labels
Language: Python - Size: 47.9 KB - Last synced at: 26 days ago - Pushed at: 9 months ago - Stars: 74 - Forks: 12

Thinqat1985731/Minimum-pdf-tools
Tools to add UI to pdf work
Language: Python - Size: 2.85 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

achronos0/pdful
PDF editor/updater library
Language: TypeScript - Size: 3.76 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

VrajVyas11/PDF-Manipulator
A comprehensive PDF tool that allows you to effortlessly edit, merge, split, compress, and convert PDFs. It supports adding pages, extracting images, and viewing PDFs directly within the app. With a user-friendly drag-and-drop interface, it’s fully responsive across all devices, streamlining document management for everyone.
Language: JavaScript - Size: 32.7 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 5 - Forks: 2

vordimous/vue-pdf-splitter
Building a PDF page splitter using vue-pdf and vueUse.
Language: Vue - Size: 8.71 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

vin0x/pdf-to-vehicle-data-ETL
This project extract data from a website (.pdf file) containing car data, manipulate data, store in a AWS RDS, create pipeline with Apache Airflow to automatically refresh and create a Power BI Dashboard.
Language: Jupyter Notebook - Size: 3.57 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

maximum-software/pdf-forms-for-woocommerce
Automatically fill PDF forms with WooCommerce orders and attach generated PDFs to email notifications and order downloads.
Language: PHP - Size: 1.72 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

zminken/pdfmerge
A command line utility coded in python to merge, split, and extract pages from multiple pdfs.
Language: Python - Size: 2.93 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Hareesh108/react-pdf-view-example-hub
📄 React PDF Viewer Example Hub: A collection of PDF handling implementations in React, showcasing various features like document previews, custom toolbars, text selection, and PDF generation.
Language: TypeScript - Size: 1.17 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

SubhangiSati/LangChat-Explorer
"LangChat Explorer: Your intuitive document companion. Effortlessly explore vast information with natural language conversations. Simplify queries, gain insights, and embark on a seamless journey of knowledge discovery. Unleash the power of language with LangChat Explorer."
Language: Python - Size: 471 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

jennis0/burdoc
Advanced PDF parsing for python
Language: HTML - Size: 18.7 MB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 9 - Forks: 3

BigDataIA-Spring2025-4/DAMG7245_Assignment01
A Streamlit-based app with a FastAPI backend for extracting structured data (text, images, tables) from websites and PDFs. Processed data is stored in AWS S3 and rendered in a markdown-standardized format. APIs are deployed on Google Cloud Run Service
Language: Jupyter Notebook - Size: 90.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

shihjen/PDF_Merger
A lightweight PDF merging application built with Python using PyPDF and Streamlit.
Language: Python - Size: 38.1 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ayoubelmhamdi/pdf_divider
Split PDF pages horizontally into two separate images utilizing Pillow image processing and Poppler-utils.
Language: Python - Size: 4.88 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

sailist/chatgpt-enhancement-extension
An all-in-one plugin to improve your ChatGPT experience!
Language: TypeScript - Size: 24.4 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 332 - Forks: 28

VerisimilitudeX/ocr_pdf2txt
Use Optical Character Recognition technology to convert scanned PDFs into TXT files locally.
Language: Python - Size: 525 KB - Last synced at: 15 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

LukasCY/extract_pdf2docx
This program can extract text from academic journal-style PDF files and save it as a DOCX file. It is capable of identifying and merging body paragraphs that span multiple pages, while separately extracting footnotes and converting them into endnotes for subsequent editing and use, such as full-text translation.
Language: Python - Size: 22.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

OnedocLabs/onedoc
The first developer-oriented document platform. Generate, host and track PDFs with a single API, beautifully.
Language: Python - Size: 214 KB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 69 - Forks: 2

ammirsm/automatic-pancake
Active learning agent-based-simulation for systematic reviews and other types of technology assisted review (TAR) which will include PDF documents and other meta-datas in itself and it's based on both fulltext-screening decisions and title-screening decisions.
Language: Python - Size: 1.82 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1

yusefmahmoudd/PDF_Organizer
making a program to analyze sets of research papers organize them based on content and summarize their information.
Language: Python - Size: 27.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

atthharvva/PDF-Form-Reader
This Python script extracts information from PDF forms using OCR (Optical Character Recognition) and saves the extracted data into an Excel file. It is particularly designed for processing forms with checkboxes and textual fields. The script can handle variations in form structure and allows for easy customization to accommodate other PDF form type
Language: Python - Size: 4.53 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

harshith8854/PDFusion
Web app to rearrange, merge and manage PDFs
Language: HTML - Size: 1.28 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

bayek10/smartcatalog
Transform large furniture catalog PDFs into a searchable database in minutes. Save hours of your time & thus save money
Language: Python - Size: 610 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

govindvarma1/certificate-automation
This project aims to simplify the process of certificate generation and management
Language: JavaScript - Size: 416 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 2

ptyadana/Python-Projects-Dojo
Collections of python projects including machine learning projects, image and pdf processing, password checkers, sending emails, sms, web scraping,flask web app,selenium automation testing,etc
Language: Jupyter Notebook - Size: 37.7 MB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 16

KaarLarax/Blank-Page-Remover-pdf
Blank Page Remover PDF
Language: Python - Size: 59.6 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

KalyanM45/DocGenius-Revolutionizing-PDFs-with-AI
This is a Python application that allows you to load a PDF and ask questions about it using natural language. The application uses a LLM to generate a response about your PDF. The LLM will not answer questions unrelated to the document.
Language: Python - Size: 69.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 50 - Forks: 6

pcdi/cambridge_core_downloader
Download and merge PDFs from Cambridge Core
Language: Python - Size: 187 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 6 - Forks: 2

sidphbot/Auto-Research
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
Language: Python - Size: 429 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 57 - Forks: 7
