evaluating-machine-learning-models

Name: evaluating-machine-learning-models
Author: jeremylongshore

Build this skill allows AI assistant to evaluate machine learning models using a comprehensive suite of metrics. it should be used when the user requests model performance analysis, validation, or testing. AI assistant can use this skill to assess model accuracy, p... Use when

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: jeremylongshore
Category: Data Science
Views: 3

GitHub repo

About this skill

How to use

Zainstaluj skill w swoim środowisku Claude Code, Codex lub OpenClaw, upewniając się, że masz dostęp do narzędzi Read, Write, Edit, Grep, Glob i Bash.
Przygotuj model do oceny — upewnij się, że model jest dostępny w Twoim projekcie lub repozytorium, wraz z danymi testowymi lub walidacyjnymi.
Poproś Claude o ocenę modelu, używając naturalnego języka, np. "Oceń dokładność mojego modelu klasyfikacji obrazów" lub "Porównaj wydajność tych dwóch modeli".
Skill automatycznie analizuje Twoje żądanie, identyfikuje model do oceny i wybiera odpowiednie metryki na podstawie kontekstu.
Claude wykonuje ocenę za pomocą komendy /eval-model z pakietu model-evaluation-suite, generując metryki takie jak dokładność, precyzję, czułość i F1-score.
Przejrzyj wyniki — Claude prezentuje wygenerowane metryki, wskazuje kluczowe wskaźniki wydajności i sugeruje obszary do optymalizacji modelu lub podjęcia decyzji o wdrożeniu.

Related skills

skill-creator

by anthropics

Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations.

Data Science

59147

notebooklm

by leegonzales

Query Google NotebookLM for source-grounded, citation-backed answers from uploaded documents. Reduces hallucinations through Gemini's document-only responses. Browser automation with library management and persistent authentication.

Data Science

142112

a-stock-analysis

by openclaw

A股实时行情与分时量能分析。获取沪深股票实时价格、涨跌、成交量，分析分时量能分布（早盘/尾盘放量）、主力动向（抢筹/出货信号）、涨停封单。支持持仓管理和盈亏分析。Use when: (1) 查询A股实时行情, (2) 分析主力资金动向, (3) 查看分时成交量分布, (4) 管理股票持仓, (5) 分析持仓盈亏。

Data Science

48153

nano-banana-pro

by garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., \

Data Science

535772

claude-automation-recommender

by anthropics

Analyze a codebase and recommend Claude Code automations (hooks, subagents, skills, plugins, MCP servers). Use when user asks for automation recommendations, wants to optimize their Claude Code setup, mentions improving Claude Code workflows, asks how to first set up Claude Code

Data Science

1787

infographic-creation

by antvis

Create beautiful infographics based on the given text content. Use this when users request creating infographics.

Data Science

60199