Pure Python HWPX automation: read, edit, generate, and validate documents without Hancom Office.
-
Updated
May 6, 2026 - Python
Pure Python HWPX automation: read, edit, generate, and validate documents without Hancom Office.
OfficeCLI is AI document generation CLI for PPTX, DOCX, XLSX, Reports, and Images. Generate editable Office files from prompts with npm install, hosted trial, and optional agent skills.
📄 Professional MCP server for converting 29+ file formats to Markdown - Perfect for Claude Desktop and AI workflows!
Document metrics, structure extraction, and code exploration for real repositories
🔍 FastAPI-powered document text extraction service supporting PDF, images, and Microsoft Office files with OCR capabilities. Extract text from multiple document formats through simple REST API endpoints. Docker-ready with Tesseract OCR integration.
This tool can be used to view details of Office Open Xml formatted files (Word, Excel, PowerPoint) for troubleshooting purposes...Click here to download the tool:
📄 Convert 29+ file formats to clean Markdown using the Model Context Protocol for seamless integration with AI workflows.
Convert complex Excel files into AI-readable JSON/HTML
Read, fill, and edit Korean HWP (Hancom Office) documents in Python. Extract text for LLM / RAG pipelines, fill government & university forms programmatically, and rewrite the binary without corrupting it.
A powerful, privacy-focused web application for side-by-side comparison of Word documents with intelligent diff highlighting, comprehensive analytics, and multilingual support including Arabic and RTL languages.
Python desktop app for converting documents and images to PDF. Supports multiple formats, file merging, and cross-platform operation with LibreOffice integration.
Fork of microsoft/markitdown: Python tool for converting files and Office documents to Markdown for LLM workflows.
A batch tool to validate and prepare files for strict parsers like CKFinder. It passes regular files through while actively detecting and structurally rebuilding corrupted PDFs.
政企会议纪要skill
Windows-first OpenClaw skill for document archive and content search across Office and PDF folders.
MCP server for creating, reading, editing and converting Office documents (docx, xlsx, pptx) — 77 async tools with version history, charts, spell-check, find/replace, and security layers
Windows-first document accessibility checker for PDF, Office, HTML, Markdown, text, and CSV files, with Section 508/WCAG-oriented reports.
Python alpha tool for preliminary technical analysis of Microsoft Office metadata, with forensic-aware documentation and synthetic validation tests.
Add a description, image, and links to the office-documents topic page so that developers can more easily learn about it.
To associate your repository with the office-documents topic, visit your repo's landing page and select "manage topics."