openrouter-transcribe

Name: openrouter-transcribe
Author: openclaw

by openclaw

Transcribe audio files via OpenRouter using audio-capable models (Gemini, GPT-4o-audio, etc).

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: openclaw
Category: Data Science
Views: 12

GitHub repo

About this skill

Transcribe audio files via OpenRouter using audio-capable models (Gemini, GPT-4o-audio, etc).

How to use

Ustaw zmienną środowiskową OPENROUTER_API_KEY na Twój klucz API OpenRouter, lub skonfiguruj go w pliku ~/.clawdbot/clawdbot.json w sekcji skills.openrouter-transcribe.apiKey.
Upewnij się, że masz zainstalowane wymagane narzędzia: ffmpeg, curl, base64 i jq. Są one niezbędne do konwersji audio, kodowania i komunikacji z API.
Uruchom podstawową transkrypcję, podając ścieżkę do pliku audio: {baseDir}/scripts/transcribe.sh /ścieżka/do/audio.m4a. Wynik pojawi się w standardowym wyjściu (stdout).
Aby użyć inny model, dodaj flagę --model, na przykład: {baseDir}/scripts/transcribe.sh audio.ogg --model openai/gpt-4o-audio-preview. Domyślnie używany jest google/gemini-2.5-flash.
Jeśli chcesz dostosować instrukcje transkrypcji, użyj flagi --prompt: {baseDir}/scripts/transcribe.sh audio.m4a --prompt "Transkrybuj ze wskazaniem mówców". Aby zapisać wynik do pliku zamiast wyświetlać go na ekranie, dodaj flagę --out: {baseDir}/scripts/transcribe.sh audio.m4a --out /tmp/transkrypcja.txt.
Opcjonalnie możesz dodać flagę --title, aby ustawić niestandardowy identyfikator w panelu OpenRouter: {baseDir}/scripts/transcribe.sh audio.m4a --title "MojaAplikacja". Skrypt automatycznie konwertuje audio do WAV (mono, 16 kHz), koduje je w base64 i wysyła do OpenRouter, a następnie wyodrębnia transkrypcję z odpowiedzi.

Related skills

openrouter

by rawveg

OpenRouter API - Unified access to 400+ AI models through one API

Data Science

17138

xlsx

by anthropics

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2)

Data Science

40128

nano-banana-pro

by garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., \

Data Science

535772

infographic-creation

by antvis

Create beautiful infographics based on the given text content. Use this when users request creating infographics.

Data Science

60199

data-storytelling

by wshobson

Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.

Data Science

26105

notebooklm

by leegonzales

Query Google NotebookLM for source-grounded, citation-backed answers from uploaded documents. Reduces hallucinations through Gemini's document-only responses. Browser automation with library management and persistent authentication.

Data Science

142112