voice-transcribe

Name: voice-transcribe
Author: openclaw

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: openclaw
Category: Data Science
Views: 6

GitHub repo

About this skill

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

How to use

Zainstaluj narzędzie uv, jeśli jeszcze go nie masz, zgodnie z instrukcją na https://docs.astral.sh/uv/.
Pobierz skill voice-transcribe z repozytorium https://github.com/openclaw/skills/tree/main/skills/darinkishore/voice-transcribe i umieść go w swoim katalogu projektów.
W pliku .env w katalogu voice-transcribe dodaj swój klucz API OpenAI: OPENAI_API_KEY=sk-... (uzyskaj klucz z panelu OpenAI).
Aby transkrybować plik audio, uruchom komendę: uv run transcribe /ścieżka/do/pliku.mp3 (zastąp ścieżkę rzeczywistą lokalizacją pliku). Obsługiwane formaty to mp3, mp4, m4a, wav, webm, ogg i opus.
Jeśli AI źle transkrybuje określone słowa, dodaj je do pliku vocab.txt (po jednym słowie w linii), aby dać modelowi wskazówkę — przydatne dla nazw własnych i terminów specjalistycznych. Dla gwarantowanego poprawienia błędu dodaj regułę do replacements.txt w formacie: błędny tekst -> poprawny tekst.
Wynik transkrypcji możesz przekierować do schowka lub innego narzędzia, np. uv run transcribe /tmp/memo.ogg | pbcopy.

Related skills

docx

by anthropics

Comprehensive document creation, editing, and analysis with support for tracked changes, comments, formatting preservation, and text extraction. When Claude needs to work with professional documents (.docx files) for: (1) Creating new documents, (2) Modifying or editing content,

Data Science

39142

a-stock-analysis

by openclaw

A股实时行情与分时量能分析。获取沪深股票实时价格、涨跌、成交量，分析分时量能分布（早盘/尾盘放量）、主力动向（抢筹/出货信号）、涨停封单。支持持仓管理和盈亏分析。Use when: (1) 查询A股实时行情, (2) 分析主力资金动向, (3) 查看分时成交量分布, (4) 管理股票持仓, (5) 分析持仓盈亏。

Data Science

48153

openrouter

by rawveg

OpenRouter API - Unified access to 400+ AI models through one API

Data Science

17138

pdf-processing

by Ming-Kai-LC

Comprehensive PDF processing techniques for handling large files that exceed Claude Code's reading limits, including chunking strategies, text/table extraction, and OCR for scanned documents. Use when working with PDFs larger than 10-15MB or more than 30-50 pages.

Data Science

23134

prompt-optimizer

by solatis

Optimize system prompts for Claude Code agents using proven prompt engineering patterns. Use when users request prompt improvement, optimization, or refinement for agent workflows, tool instructions, or system behaviors.

Data Science

15109

skill-creator

by anthropics

Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations.

Data Science

59147