ocr

Name: ocr
Author: trpc-group

by trpc-group

Extract text from images using Tesseract OCR

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: trpc-group
Category: Data Science
Views: 44

GitHub repo

About this skill

Extract text from images using Tesseract OCR

How to use

Zainstaluj wymagane zależności: Python 3.8 lub nowszy, bibliotekę pytesseract, Pillow (PIL) oraz tesseract-ocr dla systemu operacyjnego. Na większości systemów Linux wystarczy zainstalować pakiet tesseract-ocr przez menedżer pakietów, na macOS użyj Homebrew, a na Windows pobierz instalator z oficjalnej strony projektu Tesseract.
Przygotuj obraz do przetworzenia. Umiejętność obsługuje formaty PNG, JPG, JPEG, GIF, BMP, TIFF i WEBP. Możesz pracować z plikami lokalnymi lub podać URL do obrazu w internecie.
Uruchom skrypt OCR z podstawowymi parametrami: python3 scripts/ocr.py ścieżka_do_obrazu.png plik_wyjściowy.txt. Domyślnie rozpoznawanie odbywa się w języku angielskim, a wynik zapisywany jest jako zwykły tekst.
Jeśli obraz zawiera tekst w innym języku, dodaj parametr --lang z kodem języka, na przykład --lang chi_sim dla chińskiego uproszczonego lub --lang jpn dla japońskiego. Możesz również połączyć wiele kodów języków znakiem plus: --lang eng+chi_sim.
Aby poprawić dokładność rozpoznawania słabej jakości obrazów, dodaj flagę --preprocess, która automatycznie zastosuje przetwarzanie wstępne takie jak konwersja do skali szarości i progowanie.
Jeśli potrzebujesz wyniku w formacie JSON z wynikami pewności dla każdego słowa, dodaj parametr --format json. Dla obrazów z URL-i użyj skryptu ocr_url.py zamiast ocr.py i podaj pełny adres URL jako pierwszy parametr.

Related skills

xlsx

by anthropics

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2)

Data Science

40128

pptx

by anthropics

Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks

Data Science

134310

deepwiki-rs

by sopaco

AI-powered Rust documentation generation engine for comprehensive codebase analysis, C4 architecture diagrams, and automated technical documentation. Use when Claude needs to analyze source code, understand software architecture, generate technical specs, or create professional

Data Science

18144

nano-banana-pro

by garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., \

Data Science

535772

claude-automation-recommender

by anthropics

Analyze a codebase and recommend Claude Code automations (hooks, subagents, skills, plugins, MCP servers). Use when user asks for automation recommendations, wants to optimize their Claude Code setup, mentions improving Claude Code workflows, asks how to first set up Claude Code

Data Science

1787

threejs

by mrgoonie

Build 3D web apps with Three.js (WebGL/WebGPU). Use for 3D scenes, animations, custom shaders, PBR materials, VR/XR experiences, games, data visualizations, product configurators.

Data Science

1743