GitHub topics: docx
md2docx/react-markdown
SSR-ready React Markdown renderer with MDAST reuse, JSX support, and unified plugin pipeline.
Language: TypeScript - Size: 598 KB - Last synced at: 44 minutes ago - Pushed at: about 3 hours ago - Stars: 6 - Forks: 0

EvotecIT/OfficeIMO
Fast and easy to use cross-platform .NET library that creates or modifies Microsoft Word (DocX) and later also Excel (XLSX) files without installing any software. Library is based on Open XML SDK
Language: C# - Size: 20.3 MB - Last synced at: about 2 hours ago - Pushed at: about 3 hours ago - Stars: 370 - Forks: 57

docling-project/docling
Get your documents ready for gen AI
Language: Python - Size: 131 MB - Last synced at: about 5 hours ago - Pushed at: about 20 hours ago - Stars: 38,011 - Forks: 2,618

davidjia1972/md2docx
A modern, user-friendly GUI application for converting Markdown files to DOCX documents using Pandoc. Designed with a clean, Google-like interface and powerful batch processing capabilities.
Language: Python - Size: 2.56 MB - Last synced at: about 16 hours ago - Pushed at: about 18 hours ago - Stars: 2 - Forks: 0

User233389/DocuQuick
日本式の社内文書のテンプレートを作成することができるソフトウェア。
Language: C# - Size: 18.6 MB - Last synced at: about 17 hours ago - Pushed at: about 19 hours ago - Stars: 0 - Forks: 0

superstarryeyes/lue
Terminal eBook Reader with Text-to-Speech
Language: Python - Size: 453 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 398 - Forks: 12

koodo-reader/koodo-reader
A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux, Android, iOS and Web
Language: JavaScript - Size: 73.8 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 23,727 - Forks: 1,802

JJJJJJack/go-template-docx
Template engine for docx documents, with image loading, loops and charts support
Language: Go - Size: 227 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 27 - Forks: 0

Harbour-Enterprises/SuperDoc
🦋️ SuperDoc - modern document editing
Language: JavaScript - Size: 142 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 58 - Forks: 15

rstudio/gt
Easily generate information-rich, publication-quality tables from R
Language: R - Size: 292 MB - Last synced at: about 21 hours ago - Pushed at: 11 days ago - Stars: 2,114 - Forks: 218

miyako/4d-plugin-doctotext
4D implementation of DocToText.
Language: C - Size: 113 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

md2docx/jsx
A React-compatible renderer for MDAST that supports rendering extended Markdown (with HTML, Mermaid, and more) to both JSX and styled DOCX documents—using a unified syntax tree. Like react-markdown, but with document generation built-in
Language: TypeScript - Size: 479 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

Unstructured-IO/unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
Language: HTML - Size: 193 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 12,544 - Forks: 1,030

gabriel-alves051294/python-document-merger
Ferramenta Python para unificar e converter arquivos Word (.doc, .docx). Ideal para automação, limpeza de dados e preparação para IA. Python script to merge and convert .doc/.docx files.
Language: Python - Size: 35.2 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

makbn/JThumbnail
A thumbnail generation Java library for Office,PDF,HTML,Text,MP3,MPEG and Image documents
Language: Java - Size: 18.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 41 - Forks: 23

jesselau76/ebook-GPT-translator
Enjoy reading with your favorite style.
Language: Python - Size: 638 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 1,681 - Forks: 210

Zettlr/Zettlr
Your One-Stop Publication Workbench
Language: TypeScript - Size: 133 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 11,781 - Forks: 723

shcherbak-ai/contextgem
ContextGem: Effortless LLM extraction from documents
Language: Python - Size: 63 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,483 - Forks: 112

HlexNC/Document-Conversion-Solutions
Document Conversion Solutions 📄🔄 - A comprehensive suite of tools for document conversion, including Python APIs, JavaScript solutions, and an npm package. Dedicated to simplifying the export of DOCX, PPTX, and XLSX documents.
Language: JavaScript - Size: 1.67 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 1

md2docx/mdast2docx
Utility to convert MDAST (Markdown Abstract Syntax Tree) to DOCX
Language: TypeScript - Size: 1.68 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 15 - Forks: 3

n4ze3m/dialoqbase
Create chatbots with ease
Language: TypeScript - Size: 2.31 MB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 1,777 - Forks: 282

mcanouil/quarto-highlight-text
Quarto extension that allows to highlight text in a document for various formats: HTML, LaTeX, Typst, Reveal.js, Beamer, PowerPoint, and Docx.
Language: Lua - Size: 77.1 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 32 - Forks: 2

xceedsoftware/DocX
Fast and easy to use .NET library that creates or modifies Microsoft Word files without installing Word.
Language: C# - Size: 25.9 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 1,861 - Forks: 481

Fdawgs/docsmith
RESTful API for converting clinical documents and files
Language: Rich Text Format - Size: 25.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 21 - Forks: 2

dvejsada/mcp-ms-office-documents
MCP server providing tools to create Ms Office documents like presentations, emails, spreadshhets and word docs (pptx, docx, eml, xlsx)
Language: Python - Size: 516 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 4 - Forks: 1

syncfusion-content/java-file-format-docs
This repository contains the documentation of Syncfusion file format libraries for Java which is used to create, read, edit, and Word documents.
Language: HTML - Size: 32.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 2

lukasjarosch/go-docx
Replace placeholders inside docx documents with speed and confidence.
Language: Go - Size: 678 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 255 - Forks: 54

ether/etherpad-lite
Etherpad: A modern really-real-time collaborative document editor.
Language: TypeScript - Size: 41.1 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 17,687 - Forks: 2,955

guigrpa/docx-templates
Template-based docx report creation
Language: TypeScript - Size: 9.59 MB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 1,013 - Forks: 163

Eddy12597/MUN-Reso-Formatter
Python App for formatting MUN Resolutions
Language: Python - Size: 145 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

VolodymyrBaydalka/docxjs
Docx rendering library
Language: TypeScript - Size: 3.59 MB - Last synced at: 7 days ago - Pushed at: 24 days ago - Stars: 1,697 - Forks: 224

Novout/betterwrite
:bookmark_tabs: A Creative Word Processor.
Language: TypeScript - Size: 63.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 77 - Forks: 10

QuivrHQ/MegaParse
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Language: Python - Size: 55.2 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 7,122 - Forks: 391

fabriziosalmi/pdf-ocr
Converts scanned PDF documents to multiple formats using Optical Character Recognition
Language: HTML - Size: 28.8 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 5 - Forks: 1

manfromarce/DocSharp
Pure C# library to convert between document formats (Office 97-2003, Open XML, RTF, Markdown)
Language: C# - Size: 7.21 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 20 - Forks: 4

DraviaVemal/openxml-office
Create or Modify Power Point/Presentation (pptx), Excel/Spreadsheet (xlsx) & Word/Document (docx) file with ease in Rust, C#, Python🚧⚒️, Java🚧⚒️,Go🚧⚒️ or Typescript🚧⚒️
Language: Rust - Size: 3.48 MB - Last synced at: 5 days ago - Pushed at: 18 days ago - Stars: 39 - Forks: 3

open-xml-templating/docxtemplater
Generate docx, pptx, and xlsx from templates (Word, Powerpoint and Excel documents), from Node.js or the browser. Demo: https://www.docxtemplater.com/demo. #docx #office #generator #templating #report #json #generate #generation #template #create #pptx #docx #xlsx #react #vuejs #angularjs #browser #typescript #image #html #table #chart
Language: JavaScript - Size: 26.6 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 3,375 - Forks: 371

MrUsabuki/jobhireai-resume-templates
📄 Create professional, ATS-friendly resumes with JobHire.ai's free templates designed for job seekers and recruiters. Edit easily in popular formats.
Size: 112 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

open-xml-templating/docxtemplater-build
Built versions of docxtemplater
Language: JavaScript - Size: 2.74 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 21 - Forks: 124

ispras/dedoc
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
Language: Python - Size: 240 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 589 - Forks: 44

frstlvl/docx2md
Convert Microsoft Word .docx files to Obsidian-friendly Markdown with YAML front matter extracted from document properties.
Language: Python - Size: 41 KB - Last synced at: 1 day ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

jinghaihan/turnpress
Markdown, Docx to VitePress converter, powered by pandoc and turndown.
Language: TypeScript - Size: 705 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 1

docs-plus/docs.plus
A real-time community collaboration platform
Language: TypeScript - Size: 28.7 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 79 - Forks: 9

gamemaker1/office-text-extractor
Yet another library to extract text from MS Office and PDF files
Language: TypeScript - Size: 2.15 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 81 - Forks: 7

wvbe/docxml
TypeScript (component) library for building and parsing a DOCX file
Language: TypeScript - Size: 37.9 MB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 34 - Forks: 8

zyl-ui/vue-file-viewer
一个基于iframe提供跨框架、多格式、纯前端渲染的文件浏览解决方案(支持格式:pptx,docx,xlsx,pdf,mp4,纯文本和图片)
Language: JavaScript - Size: 27 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 124 - Forks: 21

AstraBert/PdfItDown
Convert Everything to PDF
Language: Python - Size: 7.6 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 161 - Forks: 20

SharvilDhumal/DokuAi
DokuAI is an AI-powered web application that converts PDF and DOCX documents into clean, structured Markdown with embedded images. It features secure authentication, role-based access, and a modern, user-friendly interface for seamless documentation workflows.
Language: JavaScript - Size: 18.4 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

vace/markdown-docx
Convert Markdown files to DOCX format with support for both browser and Node.js environments. 将 Markdown 文件转换为 DOCX 格式,支持浏览器和 Node.js 环境。
Language: TypeScript - Size: 1.44 MB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 130 - Forks: 16

cherfia/chromiumly
A lightweight Typescript library that interacts with Gotenberg's different modules to convert a variety of document formats to PDF files.
Language: TypeScript - Size: 2.44 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 123 - Forks: 13

jobhire-ai/jobhireai-resume-templates
Free ATS-friendly resume templates (DOCX). Chronological, Modern, Creative, Harvard, and Two-Column styles. Edit in Word or Google Docs. By JobHire.ai.
Size: 156 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

ONLYOFFICE/snap-desktopeditors
The ONLYOFFICE Desktop Editors snap package for the snap package system
Language: Shell - Size: 116 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 15 - Forks: 9

ONLYOFFICE/snap-documentserver
The ONLYOFFICE Document Server snap package for the snap package system
Language: Shell - Size: 218 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 23 - Forks: 7

ra1g-eu/stringsearch
Search for a string in PDF and Docx files
Language: JavaScript - Size: 900 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

qq15725/modern-openxml
Office Open XML for JavaScript
Language: TypeScript - Size: 88.2 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 7 - Forks: 0

syncfusion-content/fileformat-docs
This repository contains the documentation of Syncfusion file format .NET libraries which is used to create, read, edit and convert PDF, Excel, Word and PPTX documents.
Language: HTML - Size: 251 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 11 - Forks: 18

inokawa/remark-docx
remark plugin to compile markdown to docx (Microsoft Word, Office Open XML).
Language: TypeScript - Size: 18.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 87 - Forks: 19

eiceblue/Spire.Doc-for-C
Spire.Doc for C++ is a professional Word C++ library specifically designed for developers to create, read, write, convert, merge, split, and compare Word documents on any C++ platforms with fast and high-quality performance.
Language: C++ - Size: 371 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 1

skfrost19/Docx-Viewer
VSCode extension to view docx / ODT files within the editor.
Language: TypeScript - Size: 7.13 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 39 - Forks: 5

bgreenwell/doxx
Expose the contents of .docx files without leaving your terminal. Fast, safe, and smart — no Office required!
Language: Rust - Size: 4.69 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2,208 - Forks: 45

rashidrashiii/BuildMyResume
An AI-powered, open source resume builder — no sign-up, end-to-end encrypted, with Google Gemini content enhancement and PDF export.
Language: TypeScript - Size: 7.13 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 9 - Forks: 0

davidgohel/flextable
table farming
Language: R - Size: 51.2 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 602 - Forks: 87

brsloan/warewoolf
A minimalist novel-writing system/rich text editor designed to be usable without a mouse. For desktop and standalone word processors/digital typewriters/writerDecks.
Language: JavaScript - Size: 3.89 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 264 - Forks: 6

yobix-ai/extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Language: Rust - Size: 2.88 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 1,217 - Forks: 56

didikprabowo/mbadocx
Go library for programmatically creating and manipulating Microsoft Word (DOCX) documents
Language: Go - Size: 139 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

dotnet/Open-XML-SDK
Open XML SDK by Microsoft
Language: C# - Size: 82.9 MB - Last synced at: 14 days ago - Pushed at: 25 days ago - Stars: 4,300 - Forks: 568

ArtifexSoftware/pdf2docx
Open source Python library for converting PDF to DOCX.
Language: Python - Size: 21.6 MB - Last synced at: 14 days ago - Pushed at: 3 months ago - Stars: 3,066 - Forks: 446

GuoJikun/quicklook
quicklook 是使用 Tauri v2 开发的 Windows 平台的文件预览工具
Language: JavaScript - Size: 5.11 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 12 - Forks: 1

onizet/html2openxml
Html2OpenXml is a small .Net library that convert simple or advanced HTML to plain OpenXml components. This program has started in 2009, initially to convert user's comments into templated Word.
Language: C# - Size: 1.98 MB - Last synced at: 5 days ago - Pushed at: 26 days ago - Stars: 378 - Forks: 120

ruby-docx/docx
a ruby library/gem for interacting with .docx files
Language: Ruby - Size: 504 KB - Last synced at: 15 days ago - Pushed at: 4 months ago - Stars: 470 - Forks: 175

tomwatkins1994/go-docx-template
A simple Go library for merging docx files with data.
Language: Go - Size: 266 KB - Last synced at: 6 days ago - Pushed at: 15 days ago - Stars: 4 - Forks: 1

Hufe921/canvas-editor-plugin
plugins for canvas-editor
Language: TypeScript - Size: 731 KB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 110 - Forks: 42

phlowerteam/act_as_page_extractor
A library that extracts plain text from documents for subsequent processing, such as indexing and search.
Language: Rich Text Format - Size: 368 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 4 - Forks: 0

smnandre/pandoc
Pandoc PHP - Advanced Document Converter
Language: PHP - Size: 82 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 2 - Forks: 0

tristan-mcinnis/qualitative-insight-engine
AI-powered qualitative research analysis pipeline with GPT-5 Nano and Pinecone vector storage for automated topic analysis and report generation
Language: TypeScript - Size: 441 KB - Last synced at: 7 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

kalcaddle/kodbox
kodbox is a file manager for web. It is a newly designed product based on kodexplorer. It is also a web code editor, which allows you to develop websites directly within the web browser.You can run kodbox either online or locally,on Linux, Windows or Mac based platforms
Language: PHP - Size: 207 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 2,648 - Forks: 429

lacuna-technologies/docdocgoose
Edit documents directly in your browser. Remove editing or highlighting restrictions and unlock track changes in your documents.
Language: TypeScript - Size: 354 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 38 - Forks: 3

dfop02/html4docx
Convert html to docx
Language: Python - Size: 160 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 37 - Forks: 7

maehr/academic-pandoc-template
Write beautifully typeset academic texts with distraction-free Markdown and Pandoc.
Language: HTML - Size: 41 MB - Last synced at: 15 days ago - Pushed at: 17 days ago - Stars: 257 - Forks: 47

zurmokeeper/officecrypto-tool
officecrypto-tool is a library for js that can be used to decrypt and encrypt office(excel/ppt/word) files.
Language: JavaScript - Size: 1.08 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 21 - Forks: 5

Anathi-C/Backup-Linux
Effortlessly back up your Linux files with PyBackup. This open-source tool offers encryption, integrity checks, and a user-friendly CLI. 🌐🐙
Language: Python - Size: 48.8 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0

HoangAnhTut/thebat_parser
This repository contains a Python script that converts `.eml` email files into structured `.docx` documents. It uses BeautifulSoup for HTML parsing and python-docx for document creation, making email data easy to read and access. 🐱💻📧
Language: Python - Size: 11.7 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

Taha5125/DocxWriter-JSON
DocxWriter is a Python library for generating professional Word documents from JSON. Automate reports, add tables, lists, images, and apply custom styles — all from clean, structured data.
Language: Python - Size: 23.4 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

ThaSMorato/docx-parser
A modern JavaScript library for parsing and processing Microsoft Word DOCX documents with support for both buffer and stream operations. Features incremental parsing, checkbox detection, footnote support, and document validation.
Language: TypeScript - Size: 383 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

sussybakala/HackRx-6.0-Intelligent-Query-Retrieval
Language: Python - Size: 14.6 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

orcastor/addon-previewer
🖼【多模态文档预览插件(纯本地版,效果和兼容性更好,可适配存储)】跨平台支持Office(docx/xlsx/pptx) / WPS / iWork / PDF / CAD / 代码文档、图片、视频、音频、压缩包等大部分文件的预览 A local multimodal previewer for most kinds of documents.
Language: Go - Size: 540 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 44 - Forks: 7

kevv1m/tikara
The metadata and text content extractor for almost every file type.
Size: 1000 Bytes - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

BenedicteGiraud/docx-merge-js
A fast and lightweight Node.js library written in TypeScript for merging two Microsoft Word (.docx) documents into one. Easily insert content at specific positions or based on placeholder patterns.
Language: TypeScript - Size: 85 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 1

88250/lute-docx
📝一款将 Markdown 文本转换为 Word 文档 (.docx) 的小工具。
Language: Go - Size: 348 KB - Last synced at: 6 days ago - Pushed at: over 4 years ago - Stars: 46 - Forks: 8

kotwys/parsedict
Parse .docx files that are structured by formatting
Language: Python - Size: 28.3 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

keskinonur/doxx-go
A Go port of bgreenwell/doxx - A lightning-fast, terminal-native document viewer for Microsoft Word files.
Language: Go - Size: 674 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

smartinmedia/Net-Core-DocX-HTML-To-PDF-Converter
.NET Core library to create custom reports based on Word docx or HTML documents and convert to PDF
Language: C# - Size: 1.94 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 334 - Forks: 81

sajari/docconv
Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text
Language: Go - Size: 1.62 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 1,720 - Forks: 239

ag2307/CSV-GPT
Document Question Answering Chatbot
Language: Python - Size: 8.39 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

luuducly/WordTemplater
An useful cross-platform library to export Word template with formatting, repeating data, image, QR code...
Language: C# - Size: 2.03 MB - Last synced at: 22 days ago - Pushed at: 6 months ago - Stars: 15 - Forks: 2

Drntth/rag-ai-assistant
RAG AI Assistant is a modular system for advanced document-based Q&A. It uses a vector database (PostgreSQL + pgvector) for fast, context-aware search and supports multiple chat/embedding models. A document pipeline cleans and converts DOCX/TXT files for embedding, but the main focus is on AI-powered question answering.
Language: Python - Size: 37.1 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

dead8309/markitdown-ts
Convert various file formats to Markdown for indexing, text analysis, and other applications that benefit from structured text. TS port of the python ibrary.
Language: HTML - Size: 5.3 MB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 60 - Forks: 2

neka-nat/mineru-api
MinerU API server
Language: Python - Size: 7.16 MB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 67 - Forks: 23

unidoc/unioffice
Pure go library for creating and processing Office Word (.docx), Excel (.xlsx) and Powerpoint (.pptx) documents
Language: Go - Size: 156 MB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 4,646 - Forks: 487

transpect/docx2tex
Converts Microsoft Word docx to LaTeX
Language: XSLT - Size: 1.07 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 585 - Forks: 53
