Toolverse
All skills

rwkv-architecture

by davila7

RNN+Transformer hybrid with O(n) inference. Linear time, infinite context, no KV cache. Train like GPT (parallel), infer like RNN (sequential). Linux Foundation AI project. Production at Windows, Office, NeMo. RWKV-7 (March 2025). Models up to 14B parameters.

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author
davila7
Category
Security

About this skill

RNN+Transformer hybrid with O(n) inference. Linear time, infinite context, no KV cache. Train like GPT (parallel), infer like RNN (sequential). Linux Foundation AI project. Production at Windows, Office, NeMo. RWKV-7 (March 2025). Models up to 14B parameters.

How to use

  1. Zainstaluj PyTorch z obsługą CUDA: pip install torch --upgrade --extra-index-url https://download.pytorch.org/whl/cu121

  2. Zainstaluj zależności: pip install pytorch-lightning==1.9.5 deepspeed wandb ninja --upgrade

  3. Zainstaluj bibliotekę RWKV: pip install rwkv

  4. Pobierz wstępnie wytrenowany model (np. RWKV-4-Pile-14B) i ustaw ścieżkę w kodzie.

  5. Załaduj model w Pythonie, ustawiając zmienne środowiskowe RWKV_JIT_ON=1 i RWKV_CUDA_ON=1 dla przyspieszenia, następnie inicjalizuj RWKV z wybraną strategią (np. 'cuda fp16').

  6. Użyj modelu w trybie GPT (przetwarzanie równoległe wielu tokenów) lub RNN (sekwencyjne przetwarzanie token po tokenie) — obie metody dają identyczne wyniki logitów, ale RNN jest bardziej efektywny dla generowania tekstu w czasie rzeczywistym.

Related skills

backend-security-coder

by sickn33

Expert in secure backend coding practices specializing in input validation, authentication, and API security. Use PROACTIVELY for backend security implementations or security code reviews.

Security
1133

typescript-review

by metabase

Review TypeScript and JavaScript code changes for compliance with Metabase coding standards, style violations, and code quality issues. Use when reviewing pull requests or diffs containing TypeScript/JavaScript code.

Security
17133

skill-writer

by pytorch

Guide users through creating Agent Skills for Claude Code. Use when the user wants to create, write, author, or design a new Skill, or needs help with SKILL.md files, frontmatter, or skill structure.

Security
15116

ui-audit

by openclaw

AI skill for automated UI audits. Evaluate interfaces against proven UX principles for visual hierarchy, accessibility, cognitive load, navigation, and more. Based on Making UX Decisions by Tommy Geoco.

Security
1223

senior-security

by davila7

Comprehensive security engineering skill for application security, penetration testing, security architecture, and compliance auditing. Includes security assessment tools, threat modeling, crypto implementation, and security automation. Use when designing security architecture,

Security
2482

llama-cpp

by zechenzhangAGI

Runs LLM inference on CPU, Apple Silicon, and consumer GPUs without NVIDIA hardware. Use for edge deployment, M1/M2/M3 Macs, AMD/Intel GPUs, or when CUDA is unavailable. Supports GGUF quantization (1.5-8 bit) for reduced memory and 4-10× speedup vs PyTorch on CPU.

Security
11252