Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pdfpig

BobLd/Caly

Caly Pdf Reader is a cross-platform pdf document reader application written in C#

Language: C# - Size: 2.88 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 15 - Forks: 3

GuilhermeStracini/POC-dotnet-ExtractPdfContent

🔬 Proof of Concept of extracting content from PDF files using multiple PDF libraries

Language: C# - Size: 98.6 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 1 - Forks: 0

BobLd/PdfPig.Rendering.Skia

Render pdf documents as images using PdfPig and SkiaSharp

Language: C# - Size: 73.1 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 5 - Forks: 2

BobLd/tabula-sharp

Extract tables from PDF files (port of tabula-java)

Language: C# - Size: 9.28 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 136 - Forks: 21

BobLd/DocumentLayoutAnalysis

Document Layout Analysis resources repos for development with PdfPig.

Language: C# - Size: 41.6 MB - Last synced: 3 months ago - Pushed: 8 months ago - Stars: 501 - Forks: 59

edilma/RAG-App-HackTogether

ChatGPT-like Application using RAG pattern that allows to ask question to my own documents - I Used Semantic Kernel to integrate a LLM (OpenAI) using C# to orchestrate AI pluggins (Azure Cognitive Services). For the document embeddings I used Qdrant for the vector database and Pdfpig to extract the content from the pdfs

Language: C# - Size: 1.75 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 3 - Forks: 1

BobLd/PdfPigSvmRegionClassifier

Proof of concept of a simple SVM Region Classifier using PdfPig and Accord.Net. The objective is to classify each text block in a pdf document page as either title, text, list, table and image.

Language: C# - Size: 1.13 MB - Last synced: 8 months ago - Pushed: almost 2 years ago - Stars: 7 - Forks: 1

BobLd/camelot-sharp

A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).

Language: C# - Size: 3.51 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 21 - Forks: 3

BobLd/PdfPigMLNetBlockClassifier

Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.

Language: C# - Size: 1.1 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 17 - Forks: 5

BobLd/simple-docstrum

A step-by-step C# implementation of the Docstrum algorithm

Language: Jupyter Notebook - Size: 898 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 19 - Forks: 5