Toolverse
All skills

hugging-face-evaluation

by huggingface

Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial Analysis API, and running custom model evaluations with vLLM/lighteval. Works with the model-index metadata format.

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Category
Data Science
Views
14

About this skill

Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial Analysis API, and running custom model evaluations with vLLM/lighteval. Works with the model-index metadata format.

How to use

  1. Zainstaluj skill jako zależność w swoim projekcie Claude/Copilot, upewniając się, że masz Python 3.8+ i narzędzie uv do zarządzania środowiskiem.

  2. Przed utworzeniem pull requesta sprawdź istniejące otwarte PRy w repozytorium, aby uniknąć duplikatów — jest to krok krytyczny przed użyciem flagi --create-pr.

  3. Wybierz metodę dodania wyników ewaluacji: ekstrahuj istniejące tabele z README modelu, zaimportuj wyniki z API Artificial Analysis, lub uruchom własną ewaluację.

  4. Dla ewaluacji niestandardowych z vLLM zainstaluj dodatkowe zależności (torch, transformers, accelerate, lighteval) — będą zainstalowane automatycznie jeśli używasz uv run ze skryptami PEP 723.

  5. Skill zaktualizuje metadane model-index w formacie kompatybilnym z Papers with Code i leaderboardami Hugging Face.

  6. Jeśli uruchamiasz ewaluacje na Hugging Face Jobs, skill obsługuje integrację z uv do zarządzania zależnościami i GPU inference przez vLLM.

Related skills

pdf-processing

by Ming-Kai-LC

Comprehensive PDF processing techniques for handling large files that exceed Claude Code's reading limits, including chunking strategies, text/table extraction, and OCR for scanned documents. Use when working with PDFs larger than 10-15MB or more than 30-50 pages.

Data Science
23134

rust-coding-skill

by UtakataKyosui

Guides Claude in writing idiomatic, efficient, well-structured Rust code using proper data modeling, traits, impl organization, macros, and build-speed best practices.

Data Science
248325

pptx

by anthropics

Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks

Data Science
134310

ml-paper-writing

by davila7

Write publication-ready ML/AI papers for NeurIPS, ICML, ICLR, ACL, AAAI, COLM. Use when drafting papers from research repos, structuring arguments, verifying citations, or preparing camera-ready submissions. Includes LaTeX templates, reviewer guidelines, and citation

Data Science
2681

infographic-creation

by antvis

Create beautiful infographics based on the given text content. Use this when users request creating infographics.

Data Science
60199

data-storytelling

by wshobson

Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.

Data Science
26105