Topic: "pdf-document-processor"
wmjordan/PDFPatcher
PDF补丁丁——PDF工具箱,可以编辑书签、剪裁旋转页面、解除限制、提取或合并文档,探查文档结构,提取图片、转成图片等等
Language: C# - Size: 46.7 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 9,957 - Forks: 1,311

pdf2htmlEX/pdf2htmlEX
Convert PDF to HTML without losing text or format.
Language: HTML - Size: 133 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 4,856 - Forks: 439

qpdf/qpdf
qpdf: A content-preserving PDF document transformer
Language: C++ - Size: 39.4 MB - Last synced at: 10 minutes ago - Pushed at: about 8 hours ago - Stars: 3,968 - Forks: 309

run-llama/llama_cloud_services
Knowledge Agents and Management in the Cloud
Language: Python - Size: 46.1 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 3,956 - Forks: 405

unidoc/unipdf
Golang PDF library for creating and processing PDF files (pure go)
Language: Go - Size: 124 MB - Last synced at: about 10 hours ago - Pushed at: about 10 hours ago - Stars: 2,788 - Forks: 265

UglyToad/PdfPig
Read and extract text and other content from PDFs in C# (port of PDFBox)
Language: C# - Size: 167 MB - Last synced at: about 20 hours ago - Pushed at: 16 days ago - Stars: 2,004 - Forks: 258

chinapandaman/PyPDFForm
:fire: The Python library for PDF forms.
Language: Python - Size: 89 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 549 - Forks: 34

GowenGit/docnet
DocNET is as fast PDF editing and reading library for modern .NET applications
Language: C# - Size: 166 MB - Last synced at: 21 days ago - Pushed at: 12 months ago - Stars: 496 - Forks: 88

abarker/pdfCropMargins
pdfCropMargins -- a program to crop the margins of PDF files
Language: Python - Size: 10 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 382 - Forks: 35

sailist/chatgpt-enhancement-extension
An all-in-one plugin to improve your ChatGPT experience!
Language: TypeScript - Size: 24.4 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 332 - Forks: 28

hellerbarde/stapler
A small utility making use of the pypdf library to provide a (somewhat) lighter alternative to pdftk
Language: Python - Size: 146 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 288 - Forks: 53

michaelrsweet/pdfio
PDFio is a simple C library for reading and writing PDF files.
Language: C - Size: 8.74 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 246 - Forks: 53

Dtronix/PDFiumCore
.NET Standard P/Invoke bindings for PDFium.
Language: C# - Size: 60.3 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 161 - Forks: 23

houking-can/CCKS2019-Task5
CCKS2019评测任务五-公众公司公告信息抽取,第3名
Language: Python - Size: 54.4 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 123 - Forks: 26

svenssonaxel/pdf-sign
A tool to sign PDF files. With Linux support.
Language: Python - Size: 403 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 122 - Forks: 3

GURPREETKAURJETHRA/Multi-PDFs_ChatApp_AI-Agent
Meet MultiPDF 📚 Chat AI App! 🚀 Chat seamlessly with Multiple PDFs using Langchain, Google Gemini Pro & FAISS Vector DB with Seamless Streamlit Deployment. Get instant, accurate responses from Awesome Google Gemini OpenSource language Model. 📚💬 Transform your PDF experience now! 🔥✨
Language: Python - Size: 8.04 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 94 - Forks: 55

IBM/science-result-extractor 📦
Language: Java - Size: 120 MB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 91 - Forks: 17

naiveHobo/pdfviewer
PDFViewer is a GUI tool, written using python3 and tkinter, which lets you view PDF documents.
Language: Python - Size: 152 KB - Last synced at: 11 days ago - Pushed at: almost 4 years ago - Stars: 83 - Forks: 27

lovasoa/pagelabels-py
Python library to manipulate PDF page labels
Language: Python - Size: 47.9 KB - Last synced at: 25 days ago - Pushed at: 9 months ago - Stars: 74 - Forks: 12

OnedocLabs/onedoc
The first developer-oriented document platform. Generate, host and track PDFs with a single API, beautifully.
Language: Python - Size: 214 KB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 69 - Forks: 2

sidphbot/Auto-Research
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
Language: Python - Size: 429 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 57 - Forks: 7

pankajr141/pdf2jpg
Utility to convert PDF into JPG files
Language: Java - Size: 4.22 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 56 - Forks: 22

SiddhantSadangi/pdf-workdesk
A Streamlit-powered application that provides a user-friendly interface for editing PDF documents.
Language: Python - Size: 151 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 55 - Forks: 12

KalyanM45/DocGenius-Revolutionizing-PDFs-with-AI
This is a Python application that allows you to load a PDF and ask questions about it using natural language. The application uses a LLM to generate a response about your PDF. The LLM will not answer questions unrelated to the document.
Language: Python - Size: 69.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 50 - Forks: 6

StabRise/spark-pdf
PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it
Language: Scala - Size: 5.72 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 49 - Forks: 3

papercast-dev/papercast
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.
Language: Python - Size: 218 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 48 - Forks: 1

praj2408/Realtime-Document-Chat-System
In this project, we used Langchain to create a ChatGPT for your PDF using Streamlit. We built an application that allows you to ask questions about a PDF document and get answers directly from an LLM (Large Language Model), like OpenAI's ChatGPT.
Language: Jupyter Notebook - Size: 4.65 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 40 - Forks: 12

uroesch/pdftools
A collection of PDF command line tools and wrappers for Linux
Language: Shell - Size: 393 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 37 - Forks: 2

hoehermann/pypdf_strreplace
Search and replace text in PDF files with PyPDF.
Language: Python - Size: 573 KB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 34 - Forks: 3

opendocument-app/pdf2htmlEX-Android
pdf2htmlEX library port for Android - Convert PDF to HTML without losing text or format
Language: Java - Size: 20.1 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 32 - Forks: 11

taseikyo/backup-utils
:sparkles: A batch of useful code/scripts: run commands automatically, finish repetitive stupid operations, perform format conversions, etc.
Language: Python - Size: 3.07 MB - Last synced at: 5 days ago - Pushed at: about 4 years ago - Stars: 32 - Forks: 15

BobLd/PdfPigMLNetBlockClassifier
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Language: C# - Size: 1.1 MB - Last synced at: 6 days ago - Pushed at: about 5 years ago - Stars: 28 - Forks: 6

sfneal/pdfconduit
Prepare documents for distribution
Language: Python - Size: 228 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 26 - Forks: 1

datalogics/pdf-rest-api-samples
pdfRest API Toolkit is a REST API service for processing PDF documents, made by developers, for developers. Rapidly integrate PDF workflows with your existing projects and applications, simply and seamlessly. Get started for free in seconds.
Language: Java - Size: 13.7 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 26 - Forks: 10

ptyadana/Python-Projects-Dojo
Collections of python projects including machine learning projects, image and pdf processing, password checkers, sending emails, sms, web scraping,flask web app,selenium automation testing,etc
Language: Jupyter Notebook - Size: 37.7 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 16

pdflexer/pdflexer
.net pdf parsing library
Language: C# - Size: 64.5 MB - Last synced at: 16 days ago - Pushed at: 6 months ago - Stars: 23 - Forks: 1

JustinTheWhale/PDF-Dark-Mode
Converts PDF's to have a grey background to be easier on the eyes
Language: Python - Size: 76.1 MB - Last synced at: 16 days ago - Pushed at: 10 months ago - Stars: 17 - Forks: 5

ksharindam/pdfcook
Prepress preparing tool and PDF editor
Language: C++ - Size: 124 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 6

eiceblue/Spire.PDF-for-Java
Spire.PDF for Java is a PDF component that enables to read, write, print and convert PDF documents in Java applications without using Adobe Acrobat.
Size: 12.2 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 17 - Forks: 4

akoweb/tcpdf
persian and arabic fonts for TCPDF - PHP -فونت فارسی برای tcpdf
Size: 1.04 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 8

simonwongwong/PDF_Merge_and_Edit
Python script to merge and edit sensitive PDF files you don't want to upload to random sites you find on Google
Language: Python - Size: 21.2 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 15 - Forks: 4

IBM/generate-insights-from-data-formats-with-watson 📦
How do we process data in different formats like docx, pdf etc and generate insights to be linked with structured data in database?This pattern helps in establishing relations between structured & unstructured data to generate recommendations using Watson NLU & Watson Studio.
Language: Jupyter Notebook - Size: 1.06 MB - Last synced at: 4 days ago - Pushed at: almost 5 years ago - Stars: 14 - Forks: 14

Josee9988/Compress-PDFs
A python CLI script to 𝗰𝗼𝗺𝗽𝗿𝗲𝘀𝘀 📦 all the 𝗣𝗗𝗙 files 𝗿𝗲𝗰𝘂𝗿𝘀𝗶𝘃𝗲𝗹𝘆 in a directory using the iLovePDF technology 🥰
Language: Python - Size: 45.9 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 13 - Forks: 3

vivekweb2013/pdf-utils
An android app to perform different operations on pdf files
Language: Java - Size: 261 KB - Last synced at: 29 days ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 1

maximum-software/pdf-forms-for-contact-form-7
Build Contact Form 7 forms from PDF forms. Get PDFs auto-filled and attached to email messages and/or website responses on form submission.
Language: PHP - Size: 2.49 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 11

StabRise/ScaleDP
ScaleDP is an Open-Source extension of Apache Spark for Document Processing
Language: Python - Size: 7.88 MB - Last synced at: about 3 hours ago - Pushed at: about 2 months ago - Stars: 11 - Forks: 0

Academic-Hammer/PDFConverter
Converting pdf to any format for easily analyzing
Language: Python - Size: 152 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 10 - Forks: 3

jennis0/burdoc
Advanced PDF parsing for python
Language: HTML - Size: 18.7 MB - Last synced at: 17 days ago - Pushed at: 4 months ago - Stars: 9 - Forks: 3

PRITHIVSAKTHIUR/RAG-PDF-CHATBOT
(PDF) Information and Inference, Retrieval-Augmented Generation [ RAG ]
Language: Python - Size: 1.06 MB - Last synced at: 4 days ago - Pushed at: 11 months ago - Stars: 9 - Forks: 0

easonlai/chat_with_pdf_table
The contents of this repository showcase how to extract table data from a PDF file and preprocess it to facilitate word embedding. This preprocessing step enhances the readability of table data for language models and enables us to extract more contextual information from the tables.
Language: Jupyter Notebook - Size: 85.9 KB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 4

umer7/Python-for-PDF
Code used in my Medium Story https://medium.com/@umerfarooq_26378/python-for-pdf-ef0fac2808b0
Language: Jupyter Notebook - Size: 203 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 17

armiro/cv-data-extractor
Extract essential data (e.g. GPA, skills, education, age, ...) from PDF-formatted working Resume files (under develop)
Language: Python - Size: 49.8 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 9 - Forks: 3

ammirsm/automatic-pancake
Active learning agent-based-simulation for systematic reviews and other types of technology assisted review (TAR) which will include PDF documents and other meta-datas in itself and it's based on both fulltext-screening decisions and title-screening decisions.
Language: Python - Size: 1.82 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1

parthgupta1208/PDF2PPTGenerator
PDF2PPT Generator is a Python tool that creates Powerpoint presentations from PDF files by using smart summarization techniques assisted by GPT-3.5-Turbo
Language: Python - Size: 7.4 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 6

MBAigner/PDFContentConverter
A tool for converting PDF text as well as structural features into a pandas dataframe.
Language: Python - Size: 163 KB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 3

houking-can/PDFSDK
Based on Foxit Quick PDF Library,python interface
Language: Python - Size: 8.27 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 2

Phreak87/LeptonicaSharp
Full featured wrapper for leptonica 1.77.0
Language: Visual Basic - Size: 182 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 5

orchetect/PDFGadget
Batch PDF operations for Swift
Language: Swift - Size: 418 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 7 - Forks: 0

yakshb/AI-DocumentQnA
Simple LLM-enabled document Q&A app built using Langchain and Streamlit
Language: Python - Size: 18.6 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 7 - Forks: 21

MoinDalvs/Resume_Screening_and_Parser
Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention Sample Data Set Details: Resumes and financial documents
Language: Jupyter Notebook - Size: 95.9 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 2

NotableAPP/Formal-stack-pdfs
Make pdf from image , markdown and more is coming...
Language: HTML - Size: 3.78 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 0

pdfix/pdfix_sdk_builds
PDFix SDK release builds
Size: 11.9 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 1

pcdi/cambridge_core_downloader
Download and merge PDFs from Cambridge Core
Language: Python - Size: 187 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 6 - Forks: 2

iSOLveIT/mkdocs-pdf-generate
An MkDocs plugin to generate individual PDF files from content pages.
Language: Python - Size: 11.3 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

bevanweiss/PdfEditor 📦
PDF Editor (remove JS, find/replace, redact) based on iTextSharp
Language: C# - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 1

zyingzhou/pdfCatalog
Build catalogs for pdf documents automatically.
Language: Python - Size: 66.9 MB - Last synced at: 29 days ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 1

NowshadRuhan/PDF-Maker-Android-Apps
Its a simple PDF apps.Which can create PDF. Like you can create memo. This apps can help you to create PDF like this time. If you need and kind of PDF code or same things alse just contact with me.
Language: Java - Size: 159 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 6 - Forks: 1

VrajVyas11/PDF-Manipulator
A comprehensive PDF tool that allows you to effortlessly edit, merge, split, compress, and convert PDFs. It supports adding pages, extracting images, and viewing PDFs directly within the app. With a user-friendly drag-and-drop interface, it’s fully responsive across all devices, streamlining document management for everyone.
Language: JavaScript - Size: 32.7 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 5 - Forks: 2

XieJiSS/pdf-tools
Useful PDF tools to work with PDF translation platforms.
Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 5 - Forks: 0

MengWoods/sign-pdf-with-transparent-background-signature
Sign PDF. Extract signature from a picture and sign the transparent-background signature to a PDF.
Language: Python - Size: 87.2 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 5 - Forks: 1

carvalhoviniciusluiz/edaily-backend
Edaily API server with configured JWT and GraphQL. :metal:
Language: JavaScript - Size: 1.39 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

masesgroup/NetPDF
.NET suite for PDFBox™
Language: C# - Size: 13.5 MB - Last synced at: about 8 hours ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0

RobinMillford/Cortex-AI-Multi-Model-Insights-Hub
Cortex AI: Multi-Model Insights Hub is an advanced platform that leverages cutting-edge AI to empower your research, analysis, and data exploration. By integrating multiple Large Language Models (LLMs) with a sophisticated Retrieve-and-Generate (RAG) system
Language: Python - Size: 737 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 4 - Forks: 1

SubhangiSati/LangChat-Explorer
"LangChat Explorer: Your intuitive document companion. Effortlessly explore vast information with natural language conversations. Simplify queries, gain insights, and embark on a seamless journey of knowledge discovery. Unleash the power of language with LangChat Explorer."
Language: Python - Size: 471 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

ysdede/pdf2pacs
Converts pdf medical reports to dicom. It automatically scrapes and adds patient details to the freshly produced dicom file.
Language: Python - Size: 19.9 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

ynynl/pdf-merger
Merge, sort, delete, split pdf files on your local browser. Inspired by MacOS preview.
Language: JavaScript - Size: 6.47 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 1

mavaddat/jpdfbookmarks
Create and edit bookmarks on existing PDF files.
Language: Java - Size: 12 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

BeHappy0o0o0o0/pdf_information_extraction
提取非扫描版pdf表格信息的py3脚本
Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 4

bgorman87/PDF-Flow
PDF Report Processor
Language: Python - Size: 333 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

eli64s/pdflex
CLI for merging PDF contexts.
Language: Python - Size: 465 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

maximum-software/pdf-forms-for-wpforms
Build WPForms from PDF forms. Get PDFs filled automatically and attached to email messages and/or website responses on form submissions.
Language: PHP - Size: 1.94 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

MarcBuch/TR-PDF-Parser
Parses invoice PDF files from the german brokerage Trade Republic
Language: Python - Size: 34.2 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

HamzaZaidiX/Pdf-Merger-Tool
Pdf Merger Tool By Node JS
Language: HTML - Size: 210 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

benckx/optimize-pdf-ereaders
Optimize scanned PDFs for small ebook readers using OCR
Language: Java - Size: 42.8 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

Zeeshanahmad4/NLP-Pdf-Minning-Extracting-text-from-pdf
NLP Pdf Minning Extracting text from pdf
Language: Python - Size: 2.86 MB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 1

simonkeng/pdf_parser
Textual & numeric data extraction with Python using textract, easily shareable with Docker.
Language: C - Size: 15.6 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

anandaworldwide/ananda-library-chatbot Fork of mayooear/ai-pdf-chatbot-langchain
A ChatGPT chatbot app for multiple Large PDF files, audio files, and YouTube videos. Optionally generate the PDF fileset from a Wordpress database. Transcribe mp3 files en masse. Download YouTube videos en masse and transcribe their audio. Allow users to share the best answers they get with each other through a social sharing interface.
Language: TypeScript - Size: 77.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

ufal/atrium-page-classification
Classification of historical page images using ViT - for ATRIUM project
Language: Python - Size: 370 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 2 - Forks: 0

prashant-g0/pdf-management-tool-python
A Python-based PDF manager that handles tasks like PDF to DOCX conversion, PDF to image, and more. Simple, efficient, and easy to use.
Language: HTML - Size: 13.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

Uni-Creator/RAG-MultiFile-QA
A RAG (Retrieval-Augmented Generation) AI chatbot that allows users to upload multiple document types (PDF, DOCX, TXT, CSV) and ask questions about the content. Built using LangChain, Hugging Face embeddings, and Streamlit, it enables efficient document search and question answering using vector-based retrieval. 🚀
Language: Python - Size: 132 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

jmw8033/Pewter
Emailed invoice file handler
Language: Python - Size: 146 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 1

rvbcldud/focus-study
A collection of FOCUS Bible studies in booklet format.
Language: Shell - Size: 4.58 MB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

life888888/JPdfBookmarks
jpdfbookmarks - fix JPdfBookmarks GUI mode open a pdf have bookmarks include CJK (Chinese , Japanese , Korean ) characters will show like tofu char (□) , add native installer ( msi , deb, rpm)
Language: Java - Size: 14 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

thomasvanholder/browserless
A Ruby wrapper for the Browserless PDF API with support for modern CSS such as TailwindCSS
Language: Ruby - Size: 23.4 KB - Last synced at: 15 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

EzzatEsam/Zc-Transcript-Analyzer
A simple gpa calculator with minimalistic gui written in golang. Can generate transcript data automatically from the pdf generated by Zewail city website
Language: Go - Size: 338 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

jlomako/pdfscraper
GH action that extracts table from pdf and saves data to csv
Language: Python - Size: 11.5 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

hussein-esmail7/course-extractor
Sorts course files into folders based on the text in that file
Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

tagtog/demo-webhooks
Quick example to connect a spaCy model to tagtog using webhooks 🤖
Language: Python - Size: 39.1 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 1

zombie110year/pdfwork
处理 PDF 的一些工具
Language: Python - Size: 195 KB - Last synced at: 13 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

jethar/mutualfund-stmts-etl
Convert investment statements (like mutual fund) for India to interpretable formats
Language: Python - Size: 67.4 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 5
