kreuzberg

Name: kreuzberg
Author: kreuzberg-dev

Extract text, tables, metadata, and images from 75+ document formats (PDF, Office, images, HTML, email, archives, academic) using Kreuzberg. Use when writing code that calls Kreuzberg APIs in Python, Node.js/TypeScript, Rust, or CLI. Covers installation, extraction (sync/async),

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: kreuzberg-dev
Category: Security

GitHub repo

About this skill

How to use

Zainstaluj Kreuzberg dla wybranego języka programowania: w Pythonie uruchom pip install kreuzberg, opcjonalnie dodaj backend OCR (pip install kreuzberg[easyocr]); w Node.js wykonaj npm install @kreuzberg/node; w Rust-u dodaj do Cargo.toml zależność kreuzberg z wersją 4 i wymaganymi funkcjami (np. tokio-runtime do przetwarzania synchronicznego i wsadowego). 2. Przygotuj dokument do przetworzenia — obsługiwane są pliki PDF, dokumenty Office, obrazy, HTML, e-maile i archiwa. 3. Napisz kod wywołujący API Kreuzberg — w Pythonie użyj extract_file() z parametrem ścieżki do pliku, w Node.js analogicznie wywołaj funkcję ekstraktora z biblioteki. 4. Skonfiguruj opcje ekstrakcji zgodnie z potrzebami: włącz OCR dla skanów, ustaw format wyjściowy, skonfiguruj dzielenie tekstu na fragmenty lub wykrywanie języka. 5. Obsługuj wynik — dostęp do wyodrębnionego tekstu przez result.content, metadanych i obrazów zwróconych w strukturze wyniku. 6. Dla przetwarzania wielu plików użyj funkcji przetwarzania wsadowego dostępnej w bibliotece, opcjonalnie implementując niestandardowe wtyczki post-procesorów lub walidatorów.

Related skills

brand-voice

by anthropics

Apply and enforce brand voice, style guide, and messaging pillars across content. Use when reviewing content for brand consistency, documenting a brand voice, adapting tone for different audiences, or checking terminology and style guide compliance.

Security

48158

manim

by davila7

Comprehensive guide for Manim Community - Python framework for creating mathematical animations and educational videos with programmatic control

Security

1588

architect-review

by sickn33

Master software architect specializing in modern architecture patterns, clean architecture, microservices, event-driven systems, and DDD. Reviews system designs and code changes for architectural integrity, scalability, and maintainability. Use PROACTIVELY for architectural

Security

2773

youtube-watcher

by openclaw

Fetch and read transcripts from YouTube videos. Use when you need to summarize a video, answer questions about its content, or extract information from it.

Security

2231

reverse-engineering-tools

by gmh5225

Guide for reverse engineering tools and techniques used in game security research. Use this skill when working with debuggers, disassemblers, memory analysis tools, binary analysis, or decompilers for game security research.

Security

3168

payload

by payloadcms

Use when working with Payload CMS projects (payload.config.ts, collections, fields, hooks, access control, Payload API). Use when debugging validation errors, security issues, relationship queries, transactions, or hook behavior.

Security

50171