redteam-plugin-development

Name: redteam-plugin-development
Author: promptfoo

Standards for creating redteam plugins and graders. Use when creating new plugins, writing graders, or modifying attack templates.

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: promptfoo
Category: Security
Views: 36

GitHub repo

About this skill

Standards for creating redteam plugins and graders. Use when creating new plugins, writing graders, or modifying attack templates.

How to use

Zainstaluj skill redteam-plugin-development w swoim środowisku Claude'a lub Codex'a, pobierając go z repozytorium promptfoo na GitHub.
Przed utworzeniem nowej wtyczki lub gradera zapoznaj się z wymaganiami dotyczącymi znormalizowanych tagów — zawsze używaj <UserQuery>{{prompt}}</UserQuery> dla promptu użytkownika i <purpose>{{purpose}}</purpose> dla celu systemu, nigdy nie stosuj przestarzałych wariantów takich jak <UserPrompt> czy <prompt>.
Podczas pisania gradera oceniającego odpowiedzi AI strukturyzuj kod według podanego szablonu: otwórz z opisem celu systemu, opcjonalnie dodaj listę dozwolonych jednostek, umieść prompt użytkownika w odpowiednich tagach, a następnie zdefiniuj warunki FAIL i PASS z przypisanymi wynikami.
Jeśli modyfikujesz szablony ataków, postępuj zgodnie ze strukturą DEFAULT_EXAMPLES — każdy przykład powinien zawierać cel systemu i przykładowy prompt ataku, wszystko opakowane w tagi <Example>.
Weryfikuj swoją wtyczkę lub grader pod kątem zgodności z wytycznymi — upewnij się, że wszystkie wymagane tagi są obecne, a struktura rubryk zawiera jasne kryteria oceny.
Testuj wtyczkę w kontekście red team promptfoo, aby potwierdzić, że grader prawidłowo ocenia odpowiedzi modelu i zwraca strukturę zawierającą reason, pass i score.

Related skills

typescript-review

by metabase

Review TypeScript and JavaScript code changes for compliance with Metabase coding standards, style violations, and code quality issues. Use when reviewing pull requests or diffs containing TypeScript/JavaScript code.

Security

17133

qmd

by tobi

Search personal markdown knowledge bases, notes, meeting transcripts, and documentation using QMD - a local hybrid search engine. Combines BM25 keyword search, vector semantic search, and LLM re-ranking. Use when users ask to search notes, find documents, look up information in

Security

1951

openapi-spec-generation

by wshobson

Generate and maintain OpenAPI 3.1 specifications from code, design-first specs, and validation patterns. Use when creating API documentation, generating SDKs, or ensuring API contract compliance.

Security

18109

ui-audit

by openclaw

AI skill for automated UI audits. Evaluate interfaces against proven UX principles for visual hierarchy, accessibility, cognitive load, navigation, and more. Based on Making UX Decisions by Tommy Geoco.

Security

1223

llama-cpp

by zechenzhangAGI

Runs LLM inference on CPU, Apple Silicon, and consumer GPUs without NVIDIA hardware. Use for edge deployment, M1/M2/M3 Macs, AMD/Intel GPUs, or when CUDA is unavailable. Supports GGUF quantization (1.5-8 bit) for reduced memory and 4-10× speedup vs PyTorch on CPU.

Security

11252

brand-voice

by anthropics

Apply and enforce brand voice, style guide, and messaging pillars across content. Use when reviewing content for brand consistency, documenting a brand voice, adapting tone for different audiences, or checking terminology and style guide compliance.

Security

48158