An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: docx

md2docx/react-markdown

SSR-ready React Markdown renderer with MDAST reuse, JSX support, and unified plugin pipeline.

Language: TypeScript - Size: 598 KB - Last synced at: 44 minutes ago - Pushed at: about 3 hours ago - Stars: 6 - Forks: 0

EvotecIT/OfficeIMO

Fast and easy to use cross-platform .NET library that creates or modifies Microsoft Word (DocX) and later also Excel (XLSX) files without installing any software. Library is based on Open XML SDK

Language: C# - Size: 20.3 MB - Last synced at: about 2 hours ago - Pushed at: about 3 hours ago - Stars: 370 - Forks: 57

docling-project/docling

Get your documents ready for gen AI

Language: Python - Size: 131 MB - Last synced at: about 5 hours ago - Pushed at: about 20 hours ago - Stars: 38,011 - Forks: 2,618

davidjia1972/md2docx

A modern, user-friendly GUI application for converting Markdown files to DOCX documents using Pandoc. Designed with a clean, Google-like interface and powerful batch processing capabilities.

Language: Python - Size: 2.56 MB - Last synced at: about 16 hours ago - Pushed at: about 18 hours ago - Stars: 2 - Forks: 0

User233389/DocuQuick

日本式の社内文書のテンプレートを作成することができるソフトウェア。

Language: C# - Size: 18.6 MB - Last synced at: about 17 hours ago - Pushed at: about 19 hours ago - Stars: 0 - Forks: 0

superstarryeyes/lue

Terminal eBook Reader with Text-to-Speech

Language: Python - Size: 453 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 398 - Forks: 12

koodo-reader/koodo-reader

A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux, Android, iOS and Web

Language: JavaScript - Size: 73.8 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 23,727 - Forks: 1,802

JJJJJJack/go-template-docx

Template engine for docx documents, with image loading, loops and charts support

Language: Go - Size: 227 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 27 - Forks: 0

Harbour-Enterprises/SuperDoc

🦋️ SuperDoc - modern document editing

Language: JavaScript - Size: 142 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 58 - Forks: 15

rstudio/gt

Easily generate information-rich, publication-quality tables from R

Language: R - Size: 292 MB - Last synced at: about 21 hours ago - Pushed at: 11 days ago - Stars: 2,114 - Forks: 218

miyako/4d-plugin-doctotext

4D implementation of DocToText.

Language: C - Size: 113 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

md2docx/jsx

A React-compatible renderer for MDAST that supports rendering extended Markdown (with HTML, Mermaid, and more) to both JSX and styled DOCX documents—using a unified syntax tree. Like react-markdown, but with document generation built-in

Language: TypeScript - Size: 479 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

Unstructured-IO/unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

Language: HTML - Size: 193 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 12,544 - Forks: 1,030

gabriel-alves051294/python-document-merger

Ferramenta Python para unificar e converter arquivos Word (.doc, .docx). Ideal para automação, limpeza de dados e preparação para IA. Python script to merge and convert .doc/.docx files.

Language: Python - Size: 35.2 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

makbn/JThumbnail

A thumbnail generation Java library for Office,PDF,HTML,Text,MP3,MPEG and Image documents

Language: Java - Size: 18.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 41 - Forks: 23

jesselau76/ebook-GPT-translator

Enjoy reading with your favorite style.

Language: Python - Size: 638 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 1,681 - Forks: 210

Zettlr/Zettlr

Your One-Stop Publication Workbench

Language: TypeScript - Size: 133 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 11,781 - Forks: 723

shcherbak-ai/contextgem

ContextGem: Effortless LLM extraction from documents

Language: Python - Size: 63 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,483 - Forks: 112

HlexNC/Document-Conversion-Solutions

Document Conversion Solutions 📄🔄 - A comprehensive suite of tools for document conversion, including Python APIs, JavaScript solutions, and an npm package. Dedicated to simplifying the export of DOCX, PPTX, and XLSX documents.

Language: JavaScript - Size: 1.67 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 1

md2docx/mdast2docx

Utility to convert MDAST (Markdown Abstract Syntax Tree) to DOCX

Language: TypeScript - Size: 1.68 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 15 - Forks: 3

n4ze3m/dialoqbase

Create chatbots with ease

Language: TypeScript - Size: 2.31 MB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 1,777 - Forks: 282

mcanouil/quarto-highlight-text

Quarto extension that allows to highlight text in a document for various formats: HTML, LaTeX, Typst, Reveal.js, Beamer, PowerPoint, and Docx.

Language: Lua - Size: 77.1 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 32 - Forks: 2

xceedsoftware/DocX

Fast and easy to use .NET library that creates or modifies Microsoft Word files without installing Word.

Language: C# - Size: 25.9 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 1,861 - Forks: 481

Fdawgs/docsmith

RESTful API for converting clinical documents and files

Language: Rich Text Format - Size: 25.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 21 - Forks: 2

dvejsada/mcp-ms-office-documents

MCP server providing tools to create Ms Office documents like presentations, emails, spreadshhets and word docs (pptx, docx, eml, xlsx)

Language: Python - Size: 516 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 4 - Forks: 1

syncfusion-content/java-file-format-docs

This repository contains the documentation of Syncfusion file format libraries for Java which is used to create, read, edit, and Word documents.

Language: HTML - Size: 32.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 2

lukasjarosch/go-docx

Replace placeholders inside docx documents with speed and confidence.

Language: Go - Size: 678 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 255 - Forks: 54

ether/etherpad-lite

Etherpad: A modern really-real-time collaborative document editor.

Language: TypeScript - Size: 41.1 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 17,687 - Forks: 2,955

guigrpa/docx-templates

Template-based docx report creation

Language: TypeScript - Size: 9.59 MB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 1,013 - Forks: 163

Eddy12597/MUN-Reso-Formatter

Python App for formatting MUN Resolutions

Language: Python - Size: 145 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

VolodymyrBaydalka/docxjs

Docx rendering library

Language: TypeScript - Size: 3.59 MB - Last synced at: 7 days ago - Pushed at: 24 days ago - Stars: 1,697 - Forks: 224

Novout/betterwrite

:bookmark_tabs: A Creative Word Processor.

Language: TypeScript - Size: 63.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 77 - Forks: 10

QuivrHQ/MegaParse

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Language: Python - Size: 55.2 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 7,122 - Forks: 391

fabriziosalmi/pdf-ocr

Converts scanned PDF documents to multiple formats using Optical Character Recognition

Language: HTML - Size: 28.8 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 5 - Forks: 1

manfromarce/DocSharp

Pure C# library to convert between document formats (Office 97-2003, Open XML, RTF, Markdown)

Language: C# - Size: 7.21 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 20 - Forks: 4

DraviaVemal/openxml-office

Create or Modify Power Point/Presentation (pptx), Excel/Spreadsheet (xlsx) & Word/Document (docx) file with ease in Rust, C#, Python🚧⚒️, Java🚧⚒️,Go🚧⚒️ or Typescript🚧⚒️

Language: Rust - Size: 3.48 MB - Last synced at: 5 days ago - Pushed at: 18 days ago - Stars: 39 - Forks: 3

open-xml-templating/docxtemplater

Generate docx, pptx, and xlsx from templates (Word, Powerpoint and Excel documents), from Node.js or the browser. Demo: https://www.docxtemplater.com/demo. #docx #office #generator #templating #report #json #generate #generation #template #create #pptx #docx #xlsx #react #vuejs #angularjs #browser #typescript #image #html #table #chart

Language: JavaScript - Size: 26.6 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 3,375 - Forks: 371

MrUsabuki/jobhireai-resume-templates

📄 Create professional, ATS-friendly resumes with JobHire.ai's free templates designed for job seekers and recruiters. Edit easily in popular formats.

Size: 112 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

open-xml-templating/docxtemplater-build

Built versions of docxtemplater

Language: JavaScript - Size: 2.74 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 21 - Forks: 124

ispras/dedoc

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

Language: Python - Size: 240 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 589 - Forks: 44

frstlvl/docx2md

Convert Microsoft Word .docx files to Obsidian-friendly Markdown with YAML front matter extracted from document properties.

Language: Python - Size: 41 KB - Last synced at: 1 day ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

jinghaihan/turnpress

Markdown, Docx to VitePress converter, powered by pandoc and turndown.

Language: TypeScript - Size: 705 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 1

docs-plus/docs.plus

A real-time community collaboration platform

Language: TypeScript - Size: 28.7 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 79 - Forks: 9

gamemaker1/office-text-extractor

Yet another library to extract text from MS Office and PDF files

Language: TypeScript - Size: 2.15 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 81 - Forks: 7

wvbe/docxml

TypeScript (component) library for building and parsing a DOCX file

Language: TypeScript - Size: 37.9 MB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 34 - Forks: 8

zyl-ui/vue-file-viewer

一个基于iframe提供跨框架、多格式、纯前端渲染的文件浏览解决方案(支持格式:pptx,docx,xlsx,pdf,mp4,纯文本和图片)

Language: JavaScript - Size: 27 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 124 - Forks: 21

AstraBert/PdfItDown

Convert Everything to PDF

Language: Python - Size: 7.6 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 161 - Forks: 20

SharvilDhumal/DokuAi

DokuAI is an AI-powered web application that converts PDF and DOCX documents into clean, structured Markdown with embedded images. It features secure authentication, role-based access, and a modern, user-friendly interface for seamless documentation workflows.

Language: JavaScript - Size: 18.4 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

vace/markdown-docx

Convert Markdown files to DOCX format with support for both browser and Node.js environments. 将 Markdown 文件转换为 DOCX 格式,支持浏览器和 Node.js 环境。

Language: TypeScript - Size: 1.44 MB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 130 - Forks: 16

cherfia/chromiumly

A lightweight Typescript library that interacts with Gotenberg's different modules to convert a variety of document formats to PDF files.

Language: TypeScript - Size: 2.44 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 123 - Forks: 13

jobhire-ai/jobhireai-resume-templates

Free ATS-friendly resume templates (DOCX). Chronological, Modern, Creative, Harvard, and Two-Column styles. Edit in Word or Google Docs. By JobHire.ai.

Size: 156 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

ONLYOFFICE/snap-desktopeditors

The ONLYOFFICE Desktop Editors snap package for the snap package system

Language: Shell - Size: 116 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 15 - Forks: 9

ONLYOFFICE/snap-documentserver

The ONLYOFFICE Document Server snap package for the snap package system

Language: Shell - Size: 218 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 23 - Forks: 7

ra1g-eu/stringsearch

Search for a string in PDF and Docx files

Language: JavaScript - Size: 900 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

qq15725/modern-openxml

Office Open XML for JavaScript

Language: TypeScript - Size: 88.2 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 7 - Forks: 0

syncfusion-content/fileformat-docs

This repository contains the documentation of Syncfusion file format .NET libraries which is used to create, read, edit and convert PDF, Excel, Word and PPTX documents.

Language: HTML - Size: 251 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 11 - Forks: 18

inokawa/remark-docx

remark plugin to compile markdown to docx (Microsoft Word, Office Open XML).

Language: TypeScript - Size: 18.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 87 - Forks: 19

eiceblue/Spire.Doc-for-C

Spire.Doc for C++ is a professional Word C++ library specifically designed for developers to create, read, write, convert, merge, split, and compare Word documents on any C++ platforms with fast and high-quality performance.

Language: C++ - Size: 371 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 1

skfrost19/Docx-Viewer

VSCode extension to view docx / ODT files within the editor.

Language: TypeScript - Size: 7.13 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 39 - Forks: 5

bgreenwell/doxx

Expose the contents of .docx files without leaving your terminal. Fast, safe, and smart — no Office required!

Language: Rust - Size: 4.69 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2,208 - Forks: 45

rashidrashiii/BuildMyResume

An AI-powered, open source resume builder — no sign-up, end-to-end encrypted, with Google Gemini content enhancement and PDF export.

Language: TypeScript - Size: 7.13 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 9 - Forks: 0

davidgohel/flextable

table farming

Language: R - Size: 51.2 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 602 - Forks: 87

brsloan/warewoolf

A minimalist novel-writing system/rich text editor designed to be usable without a mouse. For desktop and standalone word processors/digital typewriters/writerDecks.

Language: JavaScript - Size: 3.89 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 264 - Forks: 6

yobix-ai/extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

Language: Rust - Size: 2.88 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 1,217 - Forks: 56

didikprabowo/mbadocx

Go library for programmatically creating and manipulating Microsoft Word (DOCX) documents

Language: Go - Size: 139 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

dotnet/Open-XML-SDK

Open XML SDK by Microsoft

Language: C# - Size: 82.9 MB - Last synced at: 14 days ago - Pushed at: 25 days ago - Stars: 4,300 - Forks: 568

ArtifexSoftware/pdf2docx

Open source Python library for converting PDF to DOCX.

Language: Python - Size: 21.6 MB - Last synced at: 14 days ago - Pushed at: 3 months ago - Stars: 3,066 - Forks: 446

GuoJikun/quicklook

quicklook 是使用 Tauri v2 开发的 Windows 平台的文件预览工具

Language: JavaScript - Size: 5.11 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 12 - Forks: 1

onizet/html2openxml

Html2OpenXml is a small .Net library that convert simple or advanced HTML to plain OpenXml components. This program has started in 2009, initially to convert user's comments into templated Word.

Language: C# - Size: 1.98 MB - Last synced at: 5 days ago - Pushed at: 26 days ago - Stars: 378 - Forks: 120

ruby-docx/docx

a ruby library/gem for interacting with .docx files

Language: Ruby - Size: 504 KB - Last synced at: 15 days ago - Pushed at: 4 months ago - Stars: 470 - Forks: 175

tomwatkins1994/go-docx-template

A simple Go library for merging docx files with data.

Language: Go - Size: 266 KB - Last synced at: 6 days ago - Pushed at: 15 days ago - Stars: 4 - Forks: 1

Hufe921/canvas-editor-plugin

plugins for canvas-editor

Language: TypeScript - Size: 731 KB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 110 - Forks: 42

phlowerteam/act_as_page_extractor

A library that extracts plain text from documents for subsequent processing, such as indexing and search.

Language: Rich Text Format - Size: 368 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 4 - Forks: 0

smnandre/pandoc

Pandoc PHP - Advanced Document Converter

Language: PHP - Size: 82 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 2 - Forks: 0

tristan-mcinnis/qualitative-insight-engine

AI-powered qualitative research analysis pipeline with GPT-5 Nano and Pinecone vector storage for automated topic analysis and report generation

Language: TypeScript - Size: 441 KB - Last synced at: 7 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

kalcaddle/kodbox

kodbox is a file manager for web. It is a newly designed product based on kodexplorer. It is also a web code editor, which allows you to develop websites directly within the web browser.You can run kodbox either online or locally,on Linux, Windows or Mac based platforms

Language: PHP - Size: 207 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 2,648 - Forks: 429

lacuna-technologies/docdocgoose

Edit documents directly in your browser. Remove editing or highlighting restrictions and unlock track changes in your documents.

Language: TypeScript - Size: 354 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 38 - Forks: 3

dfop02/html4docx

Convert html to docx

Language: Python - Size: 160 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 37 - Forks: 7

maehr/academic-pandoc-template

Write beautifully typeset academic texts with distraction-free Markdown and Pandoc.

Language: HTML - Size: 41 MB - Last synced at: 15 days ago - Pushed at: 17 days ago - Stars: 257 - Forks: 47

zurmokeeper/officecrypto-tool

officecrypto-tool is a library for js that can be used to decrypt and encrypt office(excel/ppt/word) files.

Language: JavaScript - Size: 1.08 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 21 - Forks: 5

Anathi-C/Backup-Linux

Effortlessly back up your Linux files with PyBackup. This open-source tool offers encryption, integrity checks, and a user-friendly CLI. 🌐🐙

Language: Python - Size: 48.8 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0

HoangAnhTut/thebat_parser

This repository contains a Python script that converts `.eml` email files into structured `.docx` documents. It uses BeautifulSoup for HTML parsing and python-docx for document creation, making email data easy to read and access. 🐱💻📧

Language: Python - Size: 11.7 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

Taha5125/DocxWriter-JSON

DocxWriter is a Python library for generating professional Word documents from JSON. Automate reports, add tables, lists, images, and apply custom styles — all from clean, structured data.

Language: Python - Size: 23.4 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

ThaSMorato/docx-parser

A modern JavaScript library for parsing and processing Microsoft Word DOCX documents with support for both buffer and stream operations. Features incremental parsing, checkbox detection, footnote support, and document validation.

Language: TypeScript - Size: 383 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

sussybakala/HackRx-6.0-Intelligent-Query-Retrieval

Language: Python - Size: 14.6 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

orcastor/addon-previewer

🖼【多模态文档预览插件(纯本地版,效果和兼容性更好,可适配存储)】跨平台支持Office(docx/xlsx/pptx) / WPS / iWork / PDF / CAD / 代码文档、图片、视频、音频、压缩包等大部分文件的预览 A local multimodal previewer for most kinds of documents.

Language: Go - Size: 540 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 44 - Forks: 7

kevv1m/tikara

The metadata and text content extractor for almost every file type.

Size: 1000 Bytes - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

BenedicteGiraud/docx-merge-js

A fast and lightweight Node.js library written in TypeScript for merging two Microsoft Word (.docx) documents into one. Easily insert content at specific positions or based on placeholder patterns.

Language: TypeScript - Size: 85 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 1

88250/lute-docx

📝一款将 Markdown 文本转换为 Word 文档 (.docx) 的小工具。

Language: Go - Size: 348 KB - Last synced at: 6 days ago - Pushed at: over 4 years ago - Stars: 46 - Forks: 8

kotwys/parsedict

Parse .docx files that are structured by formatting

Language: Python - Size: 28.3 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

keskinonur/doxx-go

A Go port of bgreenwell/doxx - A lightning-fast, terminal-native document viewer for Microsoft Word files.

Language: Go - Size: 674 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

smartinmedia/Net-Core-DocX-HTML-To-PDF-Converter

.NET Core library to create custom reports based on Word docx or HTML documents and convert to PDF

Language: C# - Size: 1.94 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 334 - Forks: 81

sajari/docconv

Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

Language: Go - Size: 1.62 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 1,720 - Forks: 239

ag2307/CSV-GPT

Document Question Answering Chatbot

Language: Python - Size: 8.39 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

luuducly/WordTemplater

An useful cross-platform library to export Word template with formatting, repeating data, image, QR code...

Language: C# - Size: 2.03 MB - Last synced at: 22 days ago - Pushed at: 6 months ago - Stars: 15 - Forks: 2

Drntth/rag-ai-assistant

RAG AI Assistant is a modular system for advanced document-based Q&A. It uses a vector database (PostgreSQL + pgvector) for fast, context-aware search and supports multiple chat/embedding models. A document pipeline cleans and converts DOCX/TXT files for embedding, but the main focus is on AI-powered question answering.

Language: Python - Size: 37.1 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

dead8309/markitdown-ts

Convert various file formats to Markdown for indexing, text analysis, and other applications that benefit from structured text. TS port of the python ibrary.

Language: HTML - Size: 5.3 MB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 60 - Forks: 2

neka-nat/mineru-api

MinerU API server

Language: Python - Size: 7.16 MB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 67 - Forks: 23

unidoc/unioffice

Pure go library for creating and processing Office Word (.docx), Excel (.xlsx) and Powerpoint (.pptx) documents

Language: Go - Size: 156 MB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 4,646 - Forks: 487

transpect/docx2tex

Converts Microsoft Word docx to LaTeX

Language: XSLT - Size: 1.07 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 585 - Forks: 53