Tools
kreuzberg
Polyglot document intelligence framework with a Rust core. Extracts text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Provides bindings for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno) and can be used via CLI, REST API, or MCP server.
Why use it
Polyglot document intelligence framework with a Rust core. Extracts text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Provides bindings for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno) and can be used via CLI, REST API, or MCP server.
Best for
beginner
Deployment options
compose · docker
Resource requirements
server
Alternative to
No mapping yet
Common setup stack
Reverse proxy · HTTPS certs · auth gateway · backups