Toolverse
All skills

video-processor

by basher83

Process video files with audio extraction, format conversion (mp4, webm), and Whisper

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author
basher83
Category
Data Science
Views
112

About this skill

Process video files with audio extraction, format conversion (mp4, webm), and Whisper

How to use

  1. Zainstaluj wymagane narzędzia systemowe. Na macOS uruchom brew install ffmpeg. Na Ubuntu lub Debian użyj apt-get install ffmpeg. Sprawdź poprawność instalacji poleceniem ffmpeg -version.
  2. Zainstaluj model Whisper do transkrypcji. Uruchom pip install -U openai-whisper i zweryfikuj instalację poleceniem whisper --help.
  3. Aby wyodrębnić audio z pliku wideo, użyj komendy uv run .claude/skills/video-processor/scripts/video_processor.py extract-audio input.mp4 output.wav. Możesz zmienić format wyjściowy (domyślnie wav) na mp3, aac lub flac za pomocą opcji --format.
  4. Do konwersji wideo do formatu MP4 uruchom uv run .claude/skills/video-processor/scripts/video_processor.py to-mp4 input.avi output.mp4. Opcjonalnie dostosuj kodek wideo (domyślnie libx264) lub preset jakości (domyślnie medium) parametrami --codec i --preset.
  5. Aby przekonwertować wideo do formatu WebM, użyj komendy uv run .claude/skills/video-processor/scripts/video_processor.py to-webm input.mp4 output.webm z tymi samymi opcjami dostrajania kodeka i presetu.
  6. Po wyodrębnieniu audio możesz transkrybować zawartość poleceniem whisper output.wav, które wygeneruje plik tekstowy z transkryptem.

Related skills

nano-banana-pro

by garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., \

Data Science
535772

pdf-processing

by Ming-Kai-LC

Comprehensive PDF processing techniques for handling large files that exceed Claude Code's reading limits, including chunking strategies, text/table extraction, and OCR for scanned documents. Use when working with PDFs larger than 10-15MB or more than 30-50 pages.

Data Science
23134

pptx

by anthropics

Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks

Data Science
134310

moon-dev-trading-agents

by moondevonyt

Master Moon Dev's Ai Agents Github with 48+ specialized agents, multi-exchange support, LLM abstraction, and autonomous trading capabilities across crypto markets

Data Science
102232

rust-coding-skill

by UtakataKyosui

Guides Claude in writing idiomatic, efficient, well-structured Rust code using proper data modeling, traits, impl organization, macros, and build-speed best practices.

Data Science
248325

codex

by Lucklyric

Invoke Codex CLI for complex coding tasks requiring high reasoning capabilities. This skill should be invoked when users explicitly mention \

Data Science
16163