GitHub topics: pdfs
tabulapdf/tabula-java
Extract tables from PDF files
Language: Java - Size: 9.78 MB - Last synced at: about 18 hours ago - Pushed at: 2 months ago - Stars: 1,930 - Forks: 441

harishdeivanayagam/rowfill
Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers
Language: TypeScript - Size: 1.2 MB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 277 - Forks: 14

biglocalnews/usc-crime-reports-scraper
A GitHub Action workflow for automating the collection of crime and fire logs posted by the University of Southern California's Department of Public Safety.
Language: Jupyter Notebook - Size: 250 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 1

michaelcchu/music
music scores
Language: LilyPond - Size: 12.3 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

timvink/mkdocs-print-site-plugin
MkDocs Plugin that adds an additional page that combines all pages, allowing easy exports to PDF and standalone HTML.
Language: Python - Size: 2.28 MB - Last synced at: 7 days ago - Pushed at: 22 days ago - Stars: 164 - Forks: 25

cezarysanecki/presentations
Size: 44.2 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

trimstray/technical-whitepapers
Collection of IT whitepapers, presentations, pdfs; hacking, web app security, db, reverse engineering and more; EN/PL.
Size: 268 MB - Last synced at: 4 days ago - Pushed at: over 5 years ago - Stars: 498 - Forks: 101

BobLd/tabula-sharp
Extract tables from PDF files (port of tabula-java)
Language: C# - Size: 9.33 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 178 - Forks: 27

akshaybadola/ref-man
Emacs plugin to manage bibliography having tight integration with org-mode and eww.
Language: Emacs Lisp - Size: 2.02 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

roopsagar-k/QuestionpaperHub
Language: TypeScript - Size: 3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

dave-007/presentations
Repository for source files in my past and upcoming presentations. Powershell, SQL, Azure or AWS command line script files, code for examples and demos, supporting slides as PDFs.
Language: PowerShell - Size: 7.07 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

pdf-association/pdf-corpora
An index of PDF-centric corpora
Size: 241 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 127 - Forks: 9

lzkelley/kalepy
Kernel Density Estimation and (re)sampling
Language: Python - Size: 99.6 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 61 - Forks: 12

hbiede/Voter-Tokens
A Ruby tool to generate tokens for voters based on delegate counts for online elections and then validate those tokens when counting votes
Language: Ruby - Size: 14.2 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

benweston/course-certificates
Serves course certificate PDFs to the benweston repo.
Language: HCL - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Divinemonk/notes-for-hackers
Study material (pdfs, notes, free course download links etc) for HACKERS
Size: 530 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 1

gabrielzschmitz/research
A repository showcasing my research papers, summaries, and bibliographic resources with organized access and reference materials.
Language: TeX - Size: 8.5 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

apurvmishra99/pdf-to-scan
Make your PDFs look like they were scanned
Language: Python - Size: 9.77 KB - Last synced at: 25 days ago - Pushed at: about 5 years ago - Stars: 85 - Forks: 7

Kavex/PDF-Combine
Allows you to combine multi pdfs into into one pdf. Add, Rearrange, or Delete Pages.
Language: Python - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
Language: Python - Size: 11.3 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 128 - Forks: 65

cszach/awesome-linear-algebra
Resources for linear algebra students.
Language: Ruby - Size: 19.5 KB - Last synced at: 20 days ago - Pushed at: almost 5 years ago - Stars: 9 - Forks: 1

The-Swarm-Corporation/doc-master
A powerful, lightweight Python library for automated file reading and content extraction. Doc Master simplifies the process of reading various file formats into string representations, making it perfect for data processing, content analysis, and document management systems.
Language: Python - Size: 2.33 MB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 0

HajerHammami/Scan-Tailor-v0.9.11.1-2012-
Size: 3.91 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

codingforentrepreneurs/render-to-pdf
Create PDFs using Templates with Django just like you do with views. View PDFs inline or Force Download.
Language: Python - Size: 6.84 KB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

Visgean/pdfs-rename
Bulk rename PDFs based on metadata or title from the first page
Language: Python - Size: 6.84 KB - Last synced at: 12 days ago - Pushed at: about 6 years ago - Stars: 8 - Forks: 1

Ansh420/FastAPI
API links for the text Extract from Url's and pdf's and finding similar words using cosine.
Language: Python - Size: 390 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

txuswashere/SNES-manuals
Collection of All US SNES Manuals and All PAL Exclusive manuals.
Size: 3.48 GB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

txuswashere/NES-manuals
Manuals for the games released on the NES (Nintendo Entertainment System)
Size: 911 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

AnuragAnalog/365-data-science
A Repository which contains lecture notes, exercise, solutions
Language: TSQL - Size: 39.9 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

BobLd/camelot-sharp
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).
Language: C# - Size: 3.51 MB - Last synced at: 8 days ago - Pushed at: over 3 years ago - Stars: 31 - Forks: 5

AmphibiaWeb/aw-assets
All downloadable material from AmphibiaWeb is organized here by website section
Size: 336 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

City-Bureau/aldermanic-menu-money
Extract Chicago aldermanic menu money line items from budget PDFs
Language: Python - Size: 394 KB - Last synced at: 12 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

cooljeanius/legislation
drafts of LSRs I intend to file, am filing, or have filed as a legislator
Language: HTML - Size: 77.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 0

Manoj-2702/OmniQuest
Building RAG With OpenAI GPT-4o(omni) Model Using Objectbox Vector Database
Language: Python - Size: 6.84 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

AmmarYasserAllaithy/Mark-as-Completed
A tool helps you in Marking Movies, Series episodes, Course videos, Lectures, Records, PDFs as Completed, to facilitate their manipulating, and save your time and effort. Don't memorize or move them, life is easier. Just complete and let us mark ✔
Language: Java - Size: 916 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Erdos1729/webscrapping-identify-download-classify-published-pdfs-from-multiple-urls
This repository will assist you in scrapping data from multiple websites. It will identify, download and classify the latest pdf files published on a website as per the users requirement. This can be used for automating various operations involved in market research.
Language: Python - Size: 40 KB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

altafshaikh/TeachmeBroStaticFiles
Repo for storing Static Files
Language: CSS - Size: 23.3 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

txuswashere/Yo-Tenia-Un-Juego
https://www.yoteniaunjuego.com/
Size: 185 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

torokernel/papers
This repo contains papers and presentations about toro
Size: 52.9 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

iSingh-11/QuickChat
QuickChat is a web app where a user can login and chat with family and friends. Group chats has admin controls features. A user can add/remove contacts, can create new groups or can join existing groups. A chat thread supports sending texts, images, videos, pdfs and invitation links of groups, each with a small preview before sending.
Language: JavaScript - Size: 391 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

pdf-association/safedocs
Artifacts from the DARPA-funded SafeDocs research program
Language: Python - Size: 1.91 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 20 - Forks: 2

jakshin/manx
A command-line utility which opens man pages in various convenient ways on macOS
Language: Shell - Size: 438 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

hitthecodelabs/PDFs_TextReplacement
A Python utility for replacing specific text in a PDF file.
Language: Python - Size: 8.79 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Francesco601/AWESOME-Operating-System-Resources
A collection of Operating System Resources for students and teachers
Size: 333 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 7

maximeburri/DocsLock
Android documents reader for courses or exams, locked and synced by an administrative panel.
Language: JavaScript - Size: 3.29 MB - Last synced at: 8 months ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 0

aditya-barman/sem1-files
STATISTICS PRACTICAL FILES FOR SEM1
Language: MATLAB - Size: 3.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

HHMagnus/PDFMerge
Merge pdfs in the browser
Language: C# - Size: 19.1 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 1

thawkin3/html-to-pdf-demo
Demo of exporting HTML content as PDFs using various html-to-pdf libraries
Language: JavaScript - Size: 1.44 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 4

GerHobbelt/Evil-PDF-Library-for-Qiqqa
A Qiqqa Test Library / Test Corpus which contains various PDF document samples, etc. collected from live Qiqqa libraries to showcase issues and check regressions in the software.
Language: HTML - Size: 7.92 GB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

kairavkkp/Merge-PDF
My first PyPi Package. Merge Image and PDF files using customizations within a folder using the Command line.
Language: Python - Size: 1.26 MB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 3

naseemakhtar994/android-doc-picker Fork of Turtlebody/android-doc-picker
A simple and easy to use documents Picker android library. Choose any documents like pdf, ppt, text, word or media files from your device
Language: Kotlin - Size: 1.34 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

Nalin-Angrish/IScan 📦
IScan - The Indian Scanner App. Made for the Indians, by the Indians
Language: Java - Size: 108 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

Turtlebody/android-doc-picker 📦
A simple and easy to use documents Picker android library. Choose any documents like pdf, ppt, text, word or media files from your device
Language: Kotlin - Size: 1.32 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 40 - Forks: 21

imvickykumar999/Unacademy
Storage for PDFs and other Study Materials... PLUSNQ45S https://unacademy.com/goal/gate-ese/PESHE/practice?subject_uid=PQQFK
Size: 293 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

hadro/new-york-city-directories
Some basic data and text extraction from the New York City Directories
Size: 13.2 MB - Last synced at: 3 months ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 2

akicodeoficial/material-organizacao-arquitetura-computadores 📦
Um repositório de material para a disciplina organização e arquitetura de computadores.
Size: 5.82 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

hxnyeol/watermark-pdfs
adds watermark to pdf/pdfs, requires the watermark to be in pdf format.
Language: Python - Size: 46.9 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

sagarmk/docviewier
pdf split tool
Language: Python - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

marsnebulasoup/vue-chartjs-exporter
Export charts created by vue-chartjs to PDF files
Language: JavaScript - Size: 4.03 MB - Last synced at: 8 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

singh-l/ebooks_pdfs Fork of shankergit/ebooks_pdfs
Ebooks on varous CS and tech topics
Size: 690 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

cmjagtap/My_books_Collection
This repo contains some Good Books
Size: 137 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 80 - Forks: 44

Qazalbash/GitHub-Library
An open source repository that contains all the ebooks I have. I would realy encourge to contribute the books you have into it.
Size: 881 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

lundegaard/gatsby-plugin-pdf
Gatsby plugin that is able generate PDFs out of your gatsby web pages
Language: JavaScript - Size: 51.8 KB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 9

datadesk/hsr-document-analysis 📦
An analysis conducted for the April 27, 2019, story How California’s faltering high-speed rail project was ‘captured’ by costly consultants.
Language: Jupyter Notebook - Size: 68.4 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 2

IamThiago-IT/Books
Pdf para leituras 📙
Size: 51.9 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

Hawk453/OCR_FOR_PDFS
Optical Character Recognition for Scanned Documents
Language: Python - Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

Hynitr/PediaPlus
Online PDF bank for campuses
Language: JavaScript - Size: 96.5 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

applitools/example-image-tester-cli
Applitools Example: Image Tester CLI
Size: 17.4 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 3

supersonic181/Replacinator
This is a python script written by me (not from scratch) to replace 1st page of any Pdf with the desired 1st page.
Language: Python - Size: 165 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

vardecab/emails-from-pdfs
Extract email addresses from PDFs stored in multiple folders.
Language: Python - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

leenock/Py-python
Python Getting started
Language: Python - Size: 65.7 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

GerHobbelt/veraPDF-corpus Fork of veraPDF/veraPDF-corpus
veraPDF test corpus for ISO 19005 (PDF/A)
Size: 73.9 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

williamguilhermesouza/SUSEPDataExtract
Extraction of data from SUSEP for Carteira Global Challenge
Language: Python - Size: 24.2 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

BryanHolbrook/react-pdf-preview
Upload and display PDFs on multiple pages easily in your React app. ✨💖🥳
Language: JavaScript - Size: 727 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

vanessa920/aws-comp-nlp
The goal is to parse a large dataset of public meetings (i.e. City Council, School Board, Planning Commission) and surface critical insights to everyday community members. This may involve imagining recognition, natural language processing, and sentiment analysis. Meeting minutes are often stored as PDFs so we need help running image recognition on the PDFs.
Language: Jupyter Notebook - Size: 58 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

democritus-project/d8s-pdfs
Democritus functions for working with PDFs.
Language: Python - Size: 55.7 KB - Last synced at: 15 days ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

NiedsonEmanoel/LIB-Geradora-de-Certificado-PDF
Biblioteca simples feita com o ejs com o instuito de gerar certificados em pdf.
Language: EJS - Size: 2.05 MB - Last synced at: 20 days ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

dulajkavinda/pdf-lecture
📄Convert PDFs into vanilla text.
Language: JavaScript - Size: 2.92 MB - Last synced at: 11 days ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

draldric/Automated-PDF-Combiner
Language: MATLAB - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

hurshd0/colab_notebook_to_pdf
Converts colab notebooks to pdfs 📙 👉 📄
Language: Python - Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

QuispeRosasGabriel/SistemaRestaurante
Sistema Restaurante
Language: TypeScript - Size: 2.85 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

ConflictingTheories/contractJS
A javascript framework for drafting up contracts using javascript. *See License file for information*
Language: JavaScript - Size: 54.7 KB - Last synced at: 18 days ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

markasoftware/mindtap-scraper
A scraper to generate a PDF of the book Introductory Chemistry: A Foundation
Language: JavaScript - Size: 23.4 KB - Last synced at: 6 days ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

kshithijiyer/MyNotes
This is a repository where I'll upload all my notes which can be helpful for anyone and everyone who needs help.
Size: 3.99 MB - Last synced at: about 2 months ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

fabyanmartin/PdfFunctions
A C# class library designed to modify/create pdfs from existing pdf or image files.
Language: C# - Size: 16.6 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

alexlitel/dotgovpdfs
A Twitter bot recording new PDFs on .gov sites.
Language: JavaScript - Size: 49.8 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0
