LLMS Central - The Robots.txt for AI

artifex.com

Last updated: 7/3/2026valid

Independent Directory - Important Information

This llms.txt file was publicly accessible and retrieved from artifex.com. LLMS Central does not claim ownership of this content and hosts it for informational purposes only to help AI systems discover and respect website policies.

This listing is not an endorsement by artifex.com and they have not sponsored this page. We are an independent directory service with no affiliation to the listed domain.

Copyright & Terms: Users should respect the original terms of service of artifex.com. If you believe there is a copyright or terms of service violation, please contact us at support@llmscentral.com for prompt removal. Domain owners can also claim their listing.

Current llms.txt Content

# Artifex Software

> Artifex Software builds open-source and commercially licensed document processing technology, including Ghostscript (PostScript/PDF interpreter), MuPDF (lightweight PDF SDK), PyMuPDF (Python bindings), and PyMuPDF4LLM (PDF-to-Markdown extraction for RAG and LLM pipelines). Founded in 1989, Artifex powers document workflows for over 40,000 customers and 1B+ deployments worldwide.

Artifex's core technology stack covers four product families:
- **PyMuPDF** family — Python-first PDF and document processing, including PyMuPDF4LLM (LLM/RAG-optimized extraction), PyMuPDF Pro (Office formats), and PyMuPDF Layout (GNN-based structural analysis).
- **MuPDF** family — fast, lightweight PDF rendering SDK with Core (C/C++), .NET, and JavaScript bindings.
- **Ghostscript** family — Page Description Language (PDL) conversion engine with .NET binding and an Enterprise edition that handles Microsoft Office formats.
- **SaaS APIs** — PDF.co for low-code PDF automation and DocRaptor for HTML-to-PDF conversion.

All open-source projects are AGPL-licensed with commercial licensing available for proprietary use.

## Products

- [Products overview](https://artifex.com/products): Full product catalog across all four families.
- [PyMuPDF](https://pymupdf.io/): High-performance Python library for PDF reading, extraction, and manipulation.
- [PyMuPDF4LLM](https://pymupdf.io/4llm): PDF-to-Markdown/JSON/text extraction purpose-built for LLM and RAG pipelines with CPU-only AI layout analysis using a Graph Neural Network on PDF structure (no image rendering required).
- [PyMuPDF Pro](https://pymupdf.io/pro): PyMuPDF plus Microsoft Office (DOC/DOCX, PPT/PPTX, XLS/XLSX) and Hangul (HWP/HWPX) support.
- [MuPDF](https://mupdf.com/): Core PDF rendering and manipulation SDK in C/C++.
- [MuPDF.NET](https://mupdfnet.readthedocs.io/en/latest/): MuPDF for .NET applications, distributed via NuGet.
- [MuPDF.js](https://mupdfjs.readthedocs.io/en/latest/): MuPDF for browser and Node.js environments.
- [MuPDF WebViewer](https://webviewer.mupdf.com): Embeddable browser-based PDF viewer built on the MuPDF engine. AI Ready. Embed in minutes.
- [Ghostscript](https://ghostscript.com/): PostScript and PDF interpreter with rendering engine and graphics library.
- [Ghostscript Enterprise](https://ghostscript.readthedocs.io/en/latest/Ghostscript-Enterprise.html): Commercial Ghostscript that processes Microsoft Office documents.
- [PDF.co](https://pdf.co/): Low-code REST API for PDF conversion, editing, extraction, and parsing; integrates with n8n, Zapier, and Make.
- [DocRaptor](https://docraptor.com/): HTML-to-PDF conversion API powered by the Prince rendering engine.

## Documentation

- [PyMuPDF docs](https://pymupdf.readthedocs.io/): Full API reference and guides for the Python library.
- [PyMuPDF4LLM docs](https://docs.pdf4llm.com/): Installation, quickstart, guides, and API reference for both Python and .NET.
- [MuPDF docs](https://mupdf.readthedocs.io/): Quick start guide and reference for the core C/C++ SDK.
- [MuPDF.NET docs](https://mupdfnet.readthedocs.io/en/latest/): API reference for the .NET binding.
- [MuPDF.js docs](https://mupdfjs.readthedocs.io/en/latest/): API reference for the JavaScript binding.
- [MuPDF WebViewer docs](https://webviewer-docs.mupdf.com): Guide and API for using WebViewer.
- [Ghostscript docs](https://ghostscript.readthedocs.io/en/latest/Readme.html): Full documentation for the Ghostscript interpreter.
- [Developer hub](https://artifex.com/developers): Central landing page for developer resources across products.

## Package distributions

- [PyMuPDF on PyPI](https://pypi.org/project/PyMuPDF/): `pip install PyMuPDF`
- [PyMuPDF4LLM on PyPI](https://pypi.org/project/pymupdf4llm/): `pip install pymupdf4llm`
- [PDF4LLM on NuGet](https://www.nuget.org/packages/PDF4LLM/): .NET package for PDF4LLM.
- [MuPDF.js on NPM](https://www.npmjs.com/package/mupdf): `npm i mupdf`
- [MuPDF WebViewer on NPM](https://www.npmjs.com/package/mupdf-webviewer): `npm i mupdf-webviewer`
- [Ghostscript.NET on NuGet](https://www.nuget.org/packages/Ghostscript.NET): .NET binding for Ghostscript.
- [MuPDF releases](https://mupdf.com/releases): Source and binary releases of MuPDF.
- [Ghostscript releases](https://ghostscript.com/releases): Source and binary releases of Ghostscript.
- [Downloads](https://artifex.com/downloads): Centralized downloads page.

## Solutions

- [RAG/LLM solutions](https://artifex.com/solutions/llm): How Artifex tooling fits into retrieval-augmented generation pipelines.
- [Document Management](https://artifex.com/solutions/document-management): Document processing for enterprise content systems.
- [Document Rendering](https://artifex.com/solutions/document-rendering): High-fidelity rendering for viewers, printers, and apps.
- [PDF Toolkit](https://artifex.com/solutions/pdf-toolkit): General-purpose PDF manipulation use cases.
- [SaaS](https://artifex.com/solutions/saas): Embedding Artifex tech in SaaS products.
- [Automation](https://artifex.com/solutions/automation): Document workflow automation.
- [Printers and MFPs](https://artifex.com/solutions/printers): Embedded use in printing hardware.
- [RIP](https://artifex.com/solutions/rip): Raster Image Processor applications.
- [Cloud Computing](https://artifex.com/solutions/cloud): Cloud-deployed document processing.
- [Mobile](https://artifex.com/solutions/mobile): Mobile PDF rendering and manipulation.

## Licensing and company

- [Licensing](https://artifex.com/licensing): Commercial licensing options for AGPL-licensed products.
- [Contact](https://artifex.com/contact): Contact form for sales and support inquiries.
- [Support](https://artifex.com/support): Support resources and channels.
- [Legal](https://artifex.com/legal): Legal information and policies.
- [Privacy Policy](https://artifex.com/legal/privacy): Privacy policy.

## Optional

- [Blog](https://artifex.com/blog): Technical articles, tutorials, and product announcements.
- [Building a Multimodal LLM Application with PyMuPDF4LLM](https://artifex.com/blog/building-a-multimodal-llm-application-with-pymupdf4llm): Tutorial covering text extraction for LLM pipelines.
- [RAG/LLM and PDF: Conversion to Markdown with PyMuPDF](https://artifex.com/blog/rag-llm-and-pdf-conversion-to-markdown-text-with-pymupdf): Why markdown matters for RAG and how to produce it.
- [Building a RAG Chatbot GUI with the ChatGPT API and PyMuPDF](https://artifex.com/blog/building-a-rag-chatbot-gui-with-the-chatgpt-api-and-pymupdf): End-to-end RAG chatbot tutorial.
- [Extracting Text from Multi-Column Pages: A Practical PyMuPDF Guide](https://artifex.com/blog/extracting-text-from-multi-column-pages-a-practical-pymupdf-guide): Handling complex layouts.
- [SOC 2 Type II compliance announcement](https://artifex.com/blog/celebrating-our-achievement-achieving-soc-2-type-ii-compliance): Compliance milestone.
- [Community forum](https://forum.mupdf.com): MuPDF and PyMuPDF user forum.
- [Discord](https://pymupdf.pro/discord/4llm): PyMuPDF4LLM community Discord.
- [GitHub: PyMuPDF](https://github.com/pymupdf/PyMuPDF): Source repository for PyMuPDF.
- [GitHub: Artifex Software](https://github.com/ArtifexSoftware): Organization-level GitHub presence.

Version History

Version 17/3/2026, 6:02:24 AMvalid
7262 bytes

Categories

blogdocumentationdocstechnology

Visit Website

Explore the original website and see their AI training policy in action.

Visit artifex.com

Content Types

articlespostspagesproductsapidocumentationtutorialsguides

Recent Access

No recent access

API Access

Canonical URL:
https://llmscentral.com/artifex.com/llms.txt
API Endpoint:
/api/llms?domain=artifex.com