nnsight-remote-interpretability

Name: nnsight-remote-interpretability
Author: davila7

Provides guidance for interpreting and manipulating neural network internals using nnsight with optional NDIF remote execution. Use when needing to run interpretability experiments on massive models (70B+) without local GPU resources, or when working with any PyTorch

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: davila7
Category: Security

GitHub repo

About this skill

How to use

Zainstaluj nnsight i jego zależności: upewnij się, że masz PyTorch w wersji 2.0.0 lub wyższej oraz nnsight w wersji 0.5.0 lub wyższej dostępne w swoim środowisku Python.
Załaduj model PyTorch, z którym chcesz pracować — może to być dowolna architektura, od transformerów po Mambę czy modele niestandardowe.
Użyj kontekstu trace() do przechwycenia aktywacji: otwórz blok with model.trace(prompt) i wskaż warstwy oraz tensory, które chcesz zapisać za pomocą .save(). Na przykład model.transformer.h[5].output[0].save() przechwyci wyjście piątej warstwy.
Aby uruchomić eksperyment lokalnie na małym modelu, pozostaw domyślne ustawienia. Dla dużych modeli (70B+) dodaj parametr remote=True do trace(), co automatycznie przesyła obliczenia na zdalne zasoby NDIF.
Wykonaj interwencje na aktywacjach — możesz modyfikować wartości, przeprowadzać eksperymenty ablacyjne lub dzielić aktywacje między różne prompty, wszystko w ramach tego samego bloku trace().
Wyodrębnij wyniki z obiektu zwróconego przez trace() i analizuj uzyskane tensory — nnsight zwraca pełny dostęp do wewnętrznych stanów modelu bez konieczności reimplementacji.

Related skills

brand-voice

by anthropics

Apply and enforce brand voice, style guide, and messaging pillars across content. Use when reviewing content for brand consistency, documenting a brand voice, adapting tone for different audiences, or checking terminology and style guide compliance.

Security

48158

skill-writer

by pytorch

Guide users through creating Agent Skills for Claude Code. Use when the user wants to create, write, author, or design a new Skill, or needs help with SKILL.md files, frontmatter, or skill structure.

Security

15116

llama-cpp

by zechenzhangAGI

Runs LLM inference on CPU, Apple Silicon, and consumer GPUs without NVIDIA hardware. Use for edge deployment, M1/M2/M3 Macs, AMD/Intel GPUs, or when CUDA is unavailable. Supports GGUF quantization (1.5-8 bit) for reduced memory and 4-10× speedup vs PyTorch on CPU.

Security

11252

zendesk

by vm0-ai

Zendesk Support REST API for managing tickets, users, organizations, and support operations. Use this skill to create tickets, manage users, search, and automate customer support workflows.

Security

11100

google-analytics

by davila7

Analyze Google Analytics data, review website performance metrics, identify traffic patterns, and suggest data-driven improvements. Use when the user asks about analytics, website metrics, traffic analysis, conversion rates, user behavior, or performance optimization.

Security

1260

manim

by davila7

Comprehensive guide for Manim Community - Python framework for creating mathematical animations and educational videos with programmatic control

Security

1588