markitdown

Name: markitdown
Author: K-Dense-AI

by K-Dense-AI

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: K-Dense-AI
Category: Frontend
Views: 173

GitHub repo

About this skill

Convert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use when converting documents to markdown, extracting text from PDFs/Office files, transcribing audio, performing OCR on images, extracting YouTube transcripts, or processing batches of files. Supports 20+ formats including DOCX, XLSX, PPTX, PDF, HTML, EPUB, CSV, JSON, images with OCR, and audio with transcription.

How to use

Zainstaluj MarkItDown jako zależność w swoim projekcie Python. Narzędzie wymaga Pythona 3.8+ i jest dostępne w repozytorium GitHub (microsoft/markitdown). 2. Przygotuj plik do konwersji — może to być dokument Office (DOCX, XLSX, PPTX), PDF, obraz (PNG, JPG, GIF), plik audio (MP3, WAV), HTML, CSV, JSON, XML, EPUB lub link do YouTube'a. 3. Uruchom konwersję za pomocą skryptu lub API MarkItDown, podając ścieżkę do pliku wejściowego. Narzędzie automatycznie wykryje format i zastosuje odpowiednią metodę przetwarzania. 4. Dla obrazów zawierających tekst lub skanów dokumentów aktywuj OCR — MarkItDown wyodrębni tekst i strukturę. Dla plików audio narzędzie przeprowadzi transkrypcję do tekstu. 5. Otrzymasz wynik w formacie Markdown, gotowy do bezpośredniego użytku w promptach dla modeli AI lub jako źródło do dalszej edycji. 6. W przypadku przetwarzania wielu plików możesz zautomatyzować proces, przetwarzając całe foldery lub archiwa ZIP — MarkItDown obsługuje przetwarzanie wsadowe.

Related skills

2d-games

by davila7

2D game development principles. Sprites, tilemaps, physics, camera.

Frontend

2674

keyword-research

by openclaw

Discovers high-value keywords with search intent analysis, difficulty assessment, and content opportunity mapping. Essential for starting any SEO or GEO content strategy.

Frontend

24138

browser-automation

by browserbase

Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications. Triggers include \

Frontend

21175

frontend-ui-ux

by code-yeongyu

Designer-turned-developer who crafts stunning UI/UX even without design mockups

Frontend

1884

jimeng-mcp-skill

by wwwzhouhui

使用jimeng-mcp-server进行AI图像和视频生成。当用户请求从文本生成图像、合成多张图片、从文本描述创建视频或为静态图像添加动画时使用此技能。支持四大核心能力：文生图、图像合成、文生视频、图生视频。需要jimeng-mcp-server在本地运行或通过SSE/HTTP访问。

Frontend

17126

shadcn-ui-setup

by maneeshanif

Install and configure Shadcn/ui component library with Radix UI primitives, Aceternity UI effects, set up components, and manage the component registry. Use when adding Shadcn/ui to a Next.js project or installing specific UI components for Phase 2.

Frontend

23167