An open API service providing repository metadata for many open source software ecosystems.

Topic: "pdf-document-processor"

wmjordan/PDFPatcher

PDF补丁丁——PDF工具箱,可以编辑书签、剪裁旋转页面、解除限制、提取或合并文档,探查文档结构,提取图片、转成图片等等

Language: C# - Size: 46.7 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 9,957 - Forks: 1,311

pdf2htmlEX/pdf2htmlEX

Convert PDF to HTML without losing text or format.

Language: HTML - Size: 133 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 4,856 - Forks: 439

qpdf/qpdf

qpdf: A content-preserving PDF document transformer

Language: C++ - Size: 39.4 MB - Last synced at: 10 minutes ago - Pushed at: about 8 hours ago - Stars: 3,968 - Forks: 309

run-llama/llama_cloud_services

Knowledge Agents and Management in the Cloud

Language: Python - Size: 46.1 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 3,956 - Forks: 405

unidoc/unipdf

Golang PDF library for creating and processing PDF files (pure go)

Language: Go - Size: 124 MB - Last synced at: about 10 hours ago - Pushed at: about 10 hours ago - Stars: 2,788 - Forks: 265

UglyToad/PdfPig

Read and extract text and other content from PDFs in C# (port of PDFBox)

Language: C# - Size: 167 MB - Last synced at: about 20 hours ago - Pushed at: 16 days ago - Stars: 2,004 - Forks: 258

chinapandaman/PyPDFForm

:fire: The Python library for PDF forms.

Language: Python - Size: 89 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 549 - Forks: 34

GowenGit/docnet

DocNET is as fast PDF editing and reading library for modern .NET applications

Language: C# - Size: 166 MB - Last synced at: 21 days ago - Pushed at: 12 months ago - Stars: 496 - Forks: 88

abarker/pdfCropMargins

pdfCropMargins -- a program to crop the margins of PDF files

Language: Python - Size: 10 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 382 - Forks: 35

sailist/chatgpt-enhancement-extension

An all-in-one plugin to improve your ChatGPT experience!

Language: TypeScript - Size: 24.4 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 332 - Forks: 28

hellerbarde/stapler

A small utility making use of the pypdf library to provide a (somewhat) lighter alternative to pdftk

Language: Python - Size: 146 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 288 - Forks: 53

michaelrsweet/pdfio

PDFio is a simple C library for reading and writing PDF files.

Language: C - Size: 8.74 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 246 - Forks: 53

Dtronix/PDFiumCore

.NET Standard P/Invoke bindings for PDFium.

Language: C# - Size: 60.3 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 161 - Forks: 23

houking-can/CCKS2019-Task5

CCKS2019评测任务五-公众公司公告信息抽取,第3名

Language: Python - Size: 54.4 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 123 - Forks: 26

svenssonaxel/pdf-sign

A tool to sign PDF files. With Linux support.

Language: Python - Size: 403 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 122 - Forks: 3

GURPREETKAURJETHRA/Multi-PDFs_ChatApp_AI-Agent

Meet MultiPDF 📚 Chat AI App! 🚀 Chat seamlessly with Multiple PDFs using Langchain, Google Gemini Pro & FAISS Vector DB with Seamless Streamlit Deployment. Get instant, accurate responses from Awesome Google Gemini OpenSource language Model. 📚💬 Transform your PDF experience now! 🔥✨

Language: Python - Size: 8.04 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 94 - Forks: 55

IBM/science-result-extractor 📦

Language: Java - Size: 120 MB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 91 - Forks: 17

naiveHobo/pdfviewer

PDFViewer is a GUI tool, written using python3 and tkinter, which lets you view PDF documents.

Language: Python - Size: 152 KB - Last synced at: 11 days ago - Pushed at: almost 4 years ago - Stars: 83 - Forks: 27

lovasoa/pagelabels-py

Python library to manipulate PDF page labels

Language: Python - Size: 47.9 KB - Last synced at: 25 days ago - Pushed at: 9 months ago - Stars: 74 - Forks: 12

OnedocLabs/onedoc

The first developer-oriented document platform. Generate, host and track PDFs with a single API, beautifully.

Language: Python - Size: 214 KB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 69 - Forks: 2

sidphbot/Auto-Research

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

Language: Python - Size: 429 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 57 - Forks: 7

pankajr141/pdf2jpg

Utility to convert PDF into JPG files

Language: Java - Size: 4.22 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 56 - Forks: 22

SiddhantSadangi/pdf-workdesk

A Streamlit-powered application that provides a user-friendly interface for editing PDF documents.

Language: Python - Size: 151 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 55 - Forks: 12

KalyanM45/DocGenius-Revolutionizing-PDFs-with-AI

This is a Python application that allows you to load a PDF and ask questions about it using natural language. The application uses a LLM to generate a response about your PDF. The LLM will not answer questions unrelated to the document.

Language: Python - Size: 69.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 50 - Forks: 6

StabRise/spark-pdf

PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it

Language: Scala - Size: 5.72 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 49 - Forks: 3

papercast-dev/papercast

A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.

Language: Python - Size: 218 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 48 - Forks: 1

praj2408/Realtime-Document-Chat-System

In this project, we used Langchain to create a ChatGPT for your PDF using Streamlit. We built an application that allows you to ask questions about a PDF document and get answers directly from an LLM (Large Language Model), like OpenAI's ChatGPT.

Language: Jupyter Notebook - Size: 4.65 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 40 - Forks: 12

uroesch/pdftools

A collection of PDF command line tools and wrappers for Linux

Language: Shell - Size: 393 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 37 - Forks: 2

hoehermann/pypdf_strreplace

Search and replace text in PDF files with PyPDF.

Language: Python - Size: 573 KB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 34 - Forks: 3

opendocument-app/pdf2htmlEX-Android

pdf2htmlEX library port for Android - Convert PDF to HTML without losing text or format

Language: Java - Size: 20.1 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 32 - Forks: 11

taseikyo/backup-utils

:sparkles: A batch of useful code/scripts: run commands automatically, finish repetitive stupid operations, perform format conversions, etc.

Language: Python - Size: 3.07 MB - Last synced at: 5 days ago - Pushed at: about 4 years ago - Stars: 32 - Forks: 15

BobLd/PdfPigMLNetBlockClassifier

Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.

Language: C# - Size: 1.1 MB - Last synced at: 6 days ago - Pushed at: about 5 years ago - Stars: 28 - Forks: 6

sfneal/pdfconduit

Prepare documents for distribution

Language: Python - Size: 228 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 26 - Forks: 1

datalogics/pdf-rest-api-samples

pdfRest API Toolkit is a REST API service for processing PDF documents, made by developers, for developers. Rapidly integrate PDF workflows with your existing projects and applications, simply and seamlessly. Get started for free in seconds.

Language: Java - Size: 13.7 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 26 - Forks: 10

ptyadana/Python-Projects-Dojo

Collections of python projects including machine learning projects, image and pdf processing, password checkers, sending emails, sms, web scraping,flask web app,selenium automation testing,etc

Language: Jupyter Notebook - Size: 37.7 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 16

pdflexer/pdflexer

.net pdf parsing library

Language: C# - Size: 64.5 MB - Last synced at: 16 days ago - Pushed at: 6 months ago - Stars: 23 - Forks: 1

JustinTheWhale/PDF-Dark-Mode

Converts PDF's to have a grey background to be easier on the eyes

Language: Python - Size: 76.1 MB - Last synced at: 16 days ago - Pushed at: 10 months ago - Stars: 17 - Forks: 5

ksharindam/pdfcook

Prepress preparing tool and PDF editor

Language: C++ - Size: 124 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 6

eiceblue/Spire.PDF-for-Java

Spire.PDF for Java is a PDF component that enables to read, write, print and convert PDF documents in Java applications without using Adobe Acrobat.

Size: 12.2 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 17 - Forks: 4

akoweb/tcpdf

persian and arabic fonts for TCPDF - PHP -فونت فارسی برای tcpdf

Size: 1.04 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 8

simonwongwong/PDF_Merge_and_Edit

Python script to merge and edit sensitive PDF files you don't want to upload to random sites you find on Google

Language: Python - Size: 21.2 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 15 - Forks: 4

IBM/generate-insights-from-data-formats-with-watson 📦

How do we process data in different formats like docx, pdf etc and generate insights to be linked with structured data in database?This pattern helps in establishing relations between structured & unstructured data to generate recommendations using Watson NLU & Watson Studio.

Language: Jupyter Notebook - Size: 1.06 MB - Last synced at: 4 days ago - Pushed at: almost 5 years ago - Stars: 14 - Forks: 14

Josee9988/Compress-PDFs

A python CLI script to 𝗰𝗼𝗺𝗽𝗿𝗲𝘀𝘀 📦 all the 𝗣𝗗𝗙 files 𝗿𝗲𝗰𝘂𝗿𝘀𝗶𝘃𝗲𝗹𝘆 in a directory using the iLovePDF technology 🥰

Language: Python - Size: 45.9 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 13 - Forks: 3

vivekweb2013/pdf-utils

An android app to perform different operations on pdf files

Language: Java - Size: 261 KB - Last synced at: 29 days ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 1

maximum-software/pdf-forms-for-contact-form-7

Build Contact Form 7 forms from PDF forms. Get PDFs auto-filled and attached to email messages and/or website responses on form submission.

Language: PHP - Size: 2.49 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 11

StabRise/ScaleDP

ScaleDP is an Open-Source extension of Apache Spark for Document Processing

Language: Python - Size: 7.88 MB - Last synced at: about 3 hours ago - Pushed at: about 2 months ago - Stars: 11 - Forks: 0

Academic-Hammer/PDFConverter

Converting pdf to any format for easily analyzing

Language: Python - Size: 152 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 10 - Forks: 3

jennis0/burdoc

Advanced PDF parsing for python

Language: HTML - Size: 18.7 MB - Last synced at: 17 days ago - Pushed at: 4 months ago - Stars: 9 - Forks: 3

PRITHIVSAKTHIUR/RAG-PDF-CHATBOT

(PDF) Information and Inference, Retrieval-Augmented Generation [ RAG ]

Language: Python - Size: 1.06 MB - Last synced at: 4 days ago - Pushed at: 11 months ago - Stars: 9 - Forks: 0

easonlai/chat_with_pdf_table

The contents of this repository showcase how to extract table data from a PDF file and preprocess it to facilitate word embedding. This preprocessing step enhances the readability of table data for language models and enables us to extract more contextual information from the tables.

Language: Jupyter Notebook - Size: 85.9 KB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 4

umer7/Python-for-PDF

Code used in my Medium Story https://medium.com/@umerfarooq_26378/python-for-pdf-ef0fac2808b0

Language: Jupyter Notebook - Size: 203 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 17

armiro/cv-data-extractor

Extract essential data (e.g. GPA, skills, education, age, ...) from PDF-formatted working Resume files (under develop)

Language: Python - Size: 49.8 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 9 - Forks: 3

ammirsm/automatic-pancake

Active learning agent-based-simulation for systematic reviews and other types of technology assisted review (TAR) which will include PDF documents and other meta-datas in itself and it's based on both fulltext-screening decisions and title-screening decisions.

Language: Python - Size: 1.82 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1

parthgupta1208/PDF2PPTGenerator

PDF2PPT Generator is a Python tool that creates Powerpoint presentations from PDF files by using smart summarization techniques assisted by GPT-3.5-Turbo

Language: Python - Size: 7.4 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 6

MBAigner/PDFContentConverter

A tool for converting PDF text as well as structural features into a pandas dataframe.

Language: Python - Size: 163 KB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 3

houking-can/PDFSDK

Based on Foxit Quick PDF Library,python interface

Language: Python - Size: 8.27 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 2

Phreak87/LeptonicaSharp

Full featured wrapper for leptonica 1.77.0

Language: Visual Basic - Size: 182 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 5

orchetect/PDFGadget

Batch PDF operations for Swift

Language: Swift - Size: 418 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 7 - Forks: 0

yakshb/AI-DocumentQnA

Simple LLM-enabled document Q&A app built using Langchain and Streamlit

Language: Python - Size: 18.6 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 7 - Forks: 21

MoinDalvs/Resume_Screening_and_Parser

Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention Sample Data Set Details: Resumes and financial documents

Language: Jupyter Notebook - Size: 95.9 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 2

NotableAPP/Formal-stack-pdfs

Make pdf from image , markdown and more is coming...

Language: HTML - Size: 3.78 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 0

pdfix/pdfix_sdk_builds

PDFix SDK release builds

Size: 11.9 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 1

pcdi/cambridge_core_downloader

Download and merge PDFs from Cambridge Core

Language: Python - Size: 187 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 6 - Forks: 2

iSOLveIT/mkdocs-pdf-generate

An MkDocs plugin to generate individual PDF files from content pages.

Language: Python - Size: 11.3 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

bevanweiss/PdfEditor 📦

PDF Editor (remove JS, find/replace, redact) based on iTextSharp

Language: C# - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 1

zyingzhou/pdfCatalog

Build catalogs for pdf documents automatically.

Language: Python - Size: 66.9 MB - Last synced at: 29 days ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 1

NowshadRuhan/PDF-Maker-Android-Apps

Its a simple PDF apps.Which can create PDF. Like you can create memo. This apps can help you to create PDF like this time. If you need and kind of PDF code or same things alse just contact with me.

Language: Java - Size: 159 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 6 - Forks: 1

VrajVyas11/PDF-Manipulator

A comprehensive PDF tool that allows you to effortlessly edit, merge, split, compress, and convert PDFs. It supports adding pages, extracting images, and viewing PDFs directly within the app. With a user-friendly drag-and-drop interface, it’s fully responsive across all devices, streamlining document management for everyone.

Language: JavaScript - Size: 32.7 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 5 - Forks: 2

XieJiSS/pdf-tools

Useful PDF tools to work with PDF translation platforms.

Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 5 - Forks: 0

MengWoods/sign-pdf-with-transparent-background-signature

Sign PDF. Extract signature from a picture and sign the transparent-background signature to a PDF.

Language: Python - Size: 87.2 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 5 - Forks: 1

carvalhoviniciusluiz/edaily-backend

Edaily API server with configured JWT and GraphQL. :metal:

Language: JavaScript - Size: 1.39 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

masesgroup/NetPDF

.NET suite for PDFBox™

Language: C# - Size: 13.5 MB - Last synced at: about 8 hours ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0

RobinMillford/Cortex-AI-Multi-Model-Insights-Hub

Cortex AI: Multi-Model Insights Hub is an advanced platform that leverages cutting-edge AI to empower your research, analysis, and data exploration. By integrating multiple Large Language Models (LLMs) with a sophisticated Retrieve-and-Generate (RAG) system

Language: Python - Size: 737 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 4 - Forks: 1

SubhangiSati/LangChat-Explorer

"LangChat Explorer: Your intuitive document companion. Effortlessly explore vast information with natural language conversations. Simplify queries, gain insights, and embark on a seamless journey of knowledge discovery. Unleash the power of language with LangChat Explorer."

Language: Python - Size: 471 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

ysdede/pdf2pacs

Converts pdf medical reports to dicom. It automatically scrapes and adds patient details to the freshly produced dicom file.

Language: Python - Size: 19.9 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

ynynl/pdf-merger

Merge, sort, delete, split pdf files on your local browser. Inspired by MacOS preview.

Language: JavaScript - Size: 6.47 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 1

mavaddat/jpdfbookmarks

Create and edit bookmarks on existing PDF files.

Language: Java - Size: 12 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

BeHappy0o0o0o0/pdf_information_extraction

提取非扫描版pdf表格信息的py3脚本

Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 4

bgorman87/PDF-Flow

PDF Report Processor

Language: Python - Size: 333 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

eli64s/pdflex

CLI for merging PDF contexts.

Language: Python - Size: 465 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

maximum-software/pdf-forms-for-wpforms

Build WPForms from PDF forms. Get PDFs filled automatically and attached to email messages and/or website responses on form submissions.

Language: PHP - Size: 1.94 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

MarcBuch/TR-PDF-Parser

Parses invoice PDF files from the german brokerage Trade Republic

Language: Python - Size: 34.2 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

HamzaZaidiX/Pdf-Merger-Tool

Pdf Merger Tool By Node JS

Language: HTML - Size: 210 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

benckx/optimize-pdf-ereaders

Optimize scanned PDFs for small ebook readers using OCR

Language: Java - Size: 42.8 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

Zeeshanahmad4/NLP-Pdf-Minning-Extracting-text-from-pdf

NLP Pdf Minning Extracting text from pdf

Language: Python - Size: 2.86 MB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 1

simonkeng/pdf_parser

Textual & numeric data extraction with Python using textract, easily shareable with Docker.

Language: C - Size: 15.6 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

anandaworldwide/ananda-library-chatbot Fork of mayooear/ai-pdf-chatbot-langchain

A ChatGPT chatbot app for multiple Large PDF files, audio files, and YouTube videos. Optionally generate the PDF fileset from a Wordpress database. Transcribe mp3 files en masse. Download YouTube videos en masse and transcribe their audio. Allow users to share the best answers they get with each other through a social sharing interface.

Language: TypeScript - Size: 77.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

ufal/atrium-page-classification

Classification of historical page images using ViT - for ATRIUM project

Language: Python - Size: 370 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 2 - Forks: 0

prashant-g0/pdf-management-tool-python

A Python-based PDF manager that handles tasks like PDF to DOCX conversion, PDF to image, and more. Simple, efficient, and easy to use.

Language: HTML - Size: 13.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

Uni-Creator/RAG-MultiFile-QA

A RAG (Retrieval-Augmented Generation) AI chatbot that allows users to upload multiple document types (PDF, DOCX, TXT, CSV) and ask questions about the content. Built using LangChain, Hugging Face embeddings, and Streamlit, it enables efficient document search and question answering using vector-based retrieval. 🚀

Language: Python - Size: 132 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

jmw8033/Pewter

Emailed invoice file handler

Language: Python - Size: 146 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 1

rvbcldud/focus-study

A collection of FOCUS Bible studies in booklet format.

Language: Shell - Size: 4.58 MB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

life888888/JPdfBookmarks

jpdfbookmarks - fix JPdfBookmarks GUI mode open a pdf have bookmarks include CJK (Chinese , Japanese , Korean ) characters will show like tofu char (□) , add native installer ( msi , deb, rpm)

Language: Java - Size: 14 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

thomasvanholder/browserless

A Ruby wrapper for the Browserless PDF API with support for modern CSS such as TailwindCSS

Language: Ruby - Size: 23.4 KB - Last synced at: 15 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

EzzatEsam/Zc-Transcript-Analyzer

A simple gpa calculator with minimalistic gui written in golang. Can generate transcript data automatically from the pdf generated by Zewail city website

Language: Go - Size: 338 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

jlomako/pdfscraper

GH action that extracts table from pdf and saves data to csv

Language: Python - Size: 11.5 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

hussein-esmail7/course-extractor

Sorts course files into folders based on the text in that file

Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

tagtog/demo-webhooks

Quick example to connect a spaCy model to tagtog using webhooks 🤖

Language: Python - Size: 39.1 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 1

zombie110year/pdfwork

处理 PDF 的一些工具

Language: Python - Size: 195 KB - Last synced at: 13 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

jethar/mutualfund-stmts-etl

Convert investment statements (like mutual fund) for India to interpretable formats

Language: Python - Size: 67.4 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 5