siliconflow-vision

Name: siliconflow-vision
Author: openclaw

图片识别与分析工具。使用视觉大模型识别图片内容，输出详细客观的识别结果供主模型分析。当用户发图片时，主模型必须直接调用此 skill，然后基于识别结果进行分析和回答。支持 SiliconFlow（默认）、OpenAI、Anthropic 等多服务商。

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: openclaw
Category: Data Science

GitHub repo

About this skill

How to use

Zainstaluj skill w swoim środowisku agenta Claude/Codex/Copilot, upewniając się, że masz dostęp do repozytorium openclaw/skills i folderu siliconflow-vision.
Skonfiguruj klucze API w pliku config/default.json dla wybranego dostawcy usług. Domyślnie używany jest SiliconFlow z kluczem siliconflow_api_key, ale możesz również skonfigurować openai_api_key dla OpenAI lub anthropic_api_key dla Anthropic.
Gdy użytkownik przesyła obraz, agent główny powinien automatycznie wywołać skill za pomocą polecenia: python scripts/analyze_image.py /ścieżka/do/obrazu.jpg
Dla bardziej precyzyjnej analizy złożonych obrazów, wykresów lub memów użyj trybu inteligentnego: python scripts/analyze_image.py obraz.png -m smart. Tryb ten zajmuje więcej czasu (~2 minuty), ale zapewnia dokładniejsze wyniki.
Jeśli chcesz dostosować pytanie do konkretnego zadania, dodaj parametr -q, na przykład: python scripts/analyze_image.py zdjęcie.jpg -q "Wyodrębnij cały tekst z obrazu". Możesz również użyć flagi -s dla skróconego wyjścia lub --provider openai aby zmienić dostawcę.
Agent główny analizuje wyniki zwrócone przez skill i na ich podstawie udziela odpowiedzi użytkownikowi. Skill dostarcza tylko rozpoznawanie; analiza, wnioskowanie i odpowiadanie na pytania pozostają zadaniem agenta głównego.

Related skills

prompt-optimizer

by solatis

Optimize system prompts for Claude Code agents using proven prompt engineering patterns. Use when users request prompt improvement, optimization, or refinement for agent workflows, tool instructions, or system behaviors.

Data Science

15109

nano-banana-pro

by garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., \

Data Science

535772

last30days

by sickn33

Research a topic from the last 30 days on Reddit + X + Web, become an expert, and write copy-paste-ready prompts for the user's target tool.

Data Science

2148

skill-creator

by anthropics

Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations.

Data Science

59147

deep-research

by davidorex

Multi-agent parallel investigation for complex VCV Rack problems

Data Science

16151

market-research-reports

by davila7

Generate comprehensive market research reports (50+ pages) in the style of top consulting firms (McKinsey, BCG, Gartner). Features professional LaTeX formatting, extensive visual generation with scientific-schematics and generate-image, deep integration with research-lookup for

Data Science

16115