Topic: "docx-parser"
ispras/dedoc
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
Language: Python - Size: 235 MB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 233 - Forks: 27

has-abi/docparser
Extract text from your DOCX documents.
Language: Python - Size: 92.8 KB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 2

lukethacoder/docx-to-html
π A GUI based docx to html parser. Useful for ripping out inline styles of docx files.
Language: HTML - Size: 110 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

omar2535/BioLife-AU-01-attendance-parser
Biolife-AU-01 ζε‘ιθ§£ζη¨εΊ
Language: Python - Size: 16.6 KB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

FayazK/Document-Metadata-Extractor
A Python tool that uses Google's Gemini AI to automatically extract structured metadata from PDF and DOCX documents, saving results to Excel for easy analysis and organizing raw responses as JSON files.
Language: Python - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

coffeemesh/compareFootnotes
Small script for comparing footnotes on .docx files. Resulting in a .csv
Language: Python - Size: 22.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

cuiyuheng/docling Fork of docling-project/docling
π₯ Transform PDF to JSON or Markdown with ease and speed π£
Size: 28.5 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Valss22/shoqan-testing-platform
A platform for testing in various disciplines with biometric verification and certificates.
Language: Python - Size: 375 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
