An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pdf-to-csv

monambike/pdfconverter-pdftables-to-csv

Python project that converts tables inside PDFs to CSV for convenient data manipulation. It has log and exception handling.

Language: Python - Size: 142 MB - Last synced at: 20 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 1

nordinz7/maybankpdf2json-cli

Convert MayBank email statement delivery to CSV or JSON format via CLI

Language: Python - Size: 28.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

bytescout/pdf-extractor-sdk-samples

ByteScout PDF Extractor SDK source code samples

Language: C# - Size: 27.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 8 - Forks: 5

RyanLiu6/Ena

Converts and categorizes transactions into CSVs for Canadian Financial Institutions. Uses Llama3 to infer categories via Ollama.

Language: Python - Size: 65.4 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

HoangTran0410/saoke_yagi

Sao kê của Mặt Trận Tổ Quốc Việt Nam (MTTQ) về việc hỗ trợ đồng bào sau bão Yagi

Language: JavaScript - Size: 392 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 26 - Forks: 7

floriancochard/extract-data-from-paper

A tool designed to extract numerical data from scanned historical weather documents.

Language: Python - Size: 151 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 13 - Forks: 2

towfique-elahe/pdf-to-structured-csv

A Python-based tool for extracting structured data from PDFs using OCR and regex, and exporting it to CSV. Ideal for processing invoices, logs, or scanned documents into organized, usable datasets.

Language: Jupyter Notebook - Size: 27.3 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Bizzaro/Teller Fork of scanion/Teller

Extract transaction data from RBC, TD, BMO, Manulife, AMEX and other 🇨🇦 Canadian banks/FI's credit card PDF e-statements to SQLite DB/CSV.

Language: Python - Size: 211 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 9

vresch/cv-parser

Bulk CV parser

Language: JavaScript - Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

Deadpool2000/pdf-to-csv

Convert PDF files to CSV

Language: Python - Size: 29.3 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

NanoNets/ocr-python

OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.

Language: Jupyter Notebook - Size: 5.52 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 4

cbgaindia/parsers

A collection of scripts to parse Indian Budget documents into clean machine readable formats.

Language: Jupyter Notebook - Size: 1.61 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 14 - Forks: 7

bkawan/pdf-parser

Language: Python - Size: 3.25 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 0

othyn/docker-tabula-java Fork of sseemayer/docker-tabula-java

A minimal Docker image for running tabulapdf/tabula-java.

Language: Makefile - Size: 71.3 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

odunayo12/New_NGN_BUDGET_DATA

This repo consists of Nigerian Budget Data for data accessible period.

Language: R - Size: 3.16 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

gbroques/wild-edibles-of-missouri-pdf-to-csv

A Node.js script to transform a PDF copy of Wild Edibles of Missouri to a CSV file.

Language: JavaScript - Size: 3.7 MB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0