An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pdfparser

BobLd/tabula-sharp

Extract tables from PDF files (port of tabula-java)

Language: C# - Size: 9.33 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 175 - Forks: 27

ashutoshvarma/pyxpdf

Fast and memory-efficient Python PDF Parser based on xpdf sources

Language: Cython - Size: 12.2 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 17

lazyFrogLOL/llmdocparser

A package for parsing PDFs and analyzing their content using LLMs.

Language: Python - Size: 1.21 MB - Last synced at: 30 days ago - Pushed at: 9 months ago - Stars: 268 - Forks: 8

naivefeeling/naivepdf

yet another pdf texts and tables extractor

Language: Python - Size: 8.87 MB - Last synced at: 24 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

rayeesrather99/Notomatic

An AI-driven web app that generates structured notes from uploaded syllabi using OpenAI's API. Built with React, Node.js, Express.js, and MongoDB, it offers customizable note formats, downloads, and user notifications. Future updates include collaborative notes and LMS integration

Language: JavaScript - Size: 1.64 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ethannschwartz/gpt-api

Node.js implementation of OpenAI's GPT API.

Language: JavaScript - Size: 3.29 MB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

AbdulRehmanRattu/Resume-Scoring-Assistant-Using-GPT-3.5-and-Tkinter

A Tkinter-based GUI that uses OpenAI's GPT-3.5-turbo to score resumes based on job descriptions. Users can upload PDF or DOCX files, and the application provides a relevance score, enhancing the job application process with AI assistance.

Language: Python - Size: 17.6 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

BobLd/camelot-sharp

A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).

Language: C# - Size: 3.51 MB - Last synced at: 6 days ago - Pushed at: about 3 years ago - Stars: 31 - Forks: 5

yvnggodemis/pdf-parse

PDF Parser built in Rust

Language: Rust - Size: 146 KB - Last synced at: 10 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Erdos1729/webscrapping-identify-download-classify-published-pdfs-from-multiple-urls

This repository will assist you in scrapping data from multiple websites. It will identify, download and classify the latest pdf files published on a website as per the users requirement. This can be used for automating various operations involved in market research.

Language: Python - Size: 40 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

parzibyte/extraer-texto-imagenes-pdf-php

Ejemplos de uso de PdfParser para extraer texto e imágenes de un documento PDF con PHP

Language: PHP - Size: 216 KB - Last synced at: 18 days ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 6

l-4-l/far2l Fork of elfmz/far2l

Linux port of FAR v2 : Multiarc PDF support

Language: C - Size: 67.6 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

JHW5981/split_pdfs

Repository for PDF split

Language: Python - Size: 67.4 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

jsmatias/aiod-paper-metadata-extractor

A python service to retrieve metadata extract keywords from scientific papers

Language: Jupyter Notebook - Size: 2.82 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

aishwarya-art/Pdf-to-text-extract

Pdf to text extraction using PDF parser library in codeigniter 3 sample code

Language: PHP - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vpec/lyrapdf

LyraPDF: convert a PDF to JSON or MarkDown

Language: Python - Size: 229 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

adilsachwani/ExcelPdfParsing

Excel parsing in Android using Apache POI Library and PDF parsing in Android using iText Library.

Language: Java - Size: 9.61 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0