Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: pdfpig
BobLd/Caly
Caly Pdf Reader is a cross-platform pdf document reader application written in C#
Language: C# - Size: 2.88 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 15 - Forks: 3
GuilhermeStracini/POC-dotnet-ExtractPdfContent
🔬 Proof of Concept of extracting content from PDF files using multiple PDF libraries
Language: C# - Size: 98.6 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 1 - Forks: 0
BobLd/PdfPig.Rendering.Skia
Render pdf documents as images using PdfPig and SkiaSharp
Language: C# - Size: 73.1 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 5 - Forks: 2
BobLd/tabula-sharp
Extract tables from PDF files (port of tabula-java)
Language: C# - Size: 9.28 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 136 - Forks: 21
BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
Language: C# - Size: 41.6 MB - Last synced: 3 months ago - Pushed: 8 months ago - Stars: 501 - Forks: 59
edilma/RAG-App-HackTogether
ChatGPT-like Application using RAG pattern that allows to ask question to my own documents - I Used Semantic Kernel to integrate a LLM (OpenAI) using C# to orchestrate AI pluggins (Azure Cognitive Services). For the document embeddings I used Qdrant for the vector database and Pdfpig to extract the content from the pdfs
Language: C# - Size: 1.75 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 3 - Forks: 1
BobLd/PdfPigSvmRegionClassifier
Proof of concept of a simple SVM Region Classifier using PdfPig and Accord.Net. The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Language: C# - Size: 1.13 MB - Last synced: 8 months ago - Pushed: almost 2 years ago - Stars: 7 - Forks: 1
BobLd/camelot-sharp
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).
Language: C# - Size: 3.51 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 21 - Forks: 3
BobLd/PdfPigMLNetBlockClassifier
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Language: C# - Size: 1.1 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 17 - Forks: 5
BobLd/simple-docstrum
A step-by-step C# implementation of the Docstrum algorithm
Language: Jupyter Notebook - Size: 898 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 19 - Forks: 5