An open API service providing repository metadata for many open source software ecosystems.

Topic: "docx-parser"

ispras/dedoc

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

Language: Python - Size: 235 MB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 233 - Forks: 27

has-abi/docparser

Extract text from your DOCX documents.

Language: Python - Size: 92.8 KB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 2

lukethacoder/docx-to-html

πŸ“ƒ A GUI based docx to html parser. Useful for ripping out inline styles of docx files.

Language: HTML - Size: 110 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

omar2535/BioLife-AU-01-attendance-parser

Biolife-AU-01 ζ‰“ε‘ι˜θ§£ζžη¨‹εΊ

Language: Python - Size: 16.6 KB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

FayazK/Document-Metadata-Extractor

A Python tool that uses Google's Gemini AI to automatically extract structured metadata from PDF and DOCX documents, saving results to Excel for easy analysis and organizing raw responses as JSON files.

Language: Python - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

coffeemesh/compareFootnotes

Small script for comparing footnotes on .docx files. Resulting in a .csv

Language: Python - Size: 22.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

cuiyuheng/docling Fork of docling-project/docling

πŸ₯š Transform PDF to JSON or Markdown with ease and speed 🐣

Size: 28.5 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Valss22/shoqan-testing-platform

A platform for testing in various disciplines with biometric verification and certificates.

Language: Python - Size: 375 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0