pufferlib

Name: pufferlib
Author: K-Dense-AI

by K-Dense-AI

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: K-Dense-AI
Category: Security
Views: 1

GitHub repo

About this skill

High-performance reinforcement learning framework optimized for speed and scale. Use when you need fast parallel training, vectorized environments, multi-agent systems, or integration with game environments (Atari, Procgen, NetHack). Achieves 2-10x speedups over standard implementations. For quick prototyping or standard algorithm implementations with extensive documentation, use stable-baselines3 instead.

How to use

Zainstaluj PufferLib za pomocą pip, a następnie zaimportuj bibliotekę oraz moduł PuffeRL w swoim skrypcie Pythona. 2. Przygotuj środowisko treningowe — możesz użyć istniejącego ze zbiorów Gymnasium, PettingZoo lub Procgen, albo zdefiniować własne, korzystając z API PufferEnv. 3. Skonfiguruj parametry treningu, takie jak urządzenie (CPU/GPU), współczynnik uczenia i architekturę sieci (CNN, LSTM lub niestandardowa). 4. Uruchom trening z linii poleceń poleceniem puffer train z nazwą środowiska i parametrami, na przykład puffer train procgen-coinrun --train.device cuda --train.learning-rate 3e-4. 5. Dla treningu rozproszonego na wielu GPU użyj torchrun z parametrem --nproc_per_node, aby przyspieszyć eksperymentację na dużych zbiorach danych. 6. Monitoruj postęp treningu i dostosowuj hiperparametry w zależności od osiąganych wyników.

Related skills

qmd

by tobi

Search personal markdown knowledge bases, notes, meeting transcripts, and documentation using QMD - a local hybrid search engine. Combines BM25 keyword search, vector semantic search, and LLM re-ranking. Use when users ask to search notes, find documents, look up information in

Security

1951

google-analytics

by davila7

Analyze Google Analytics data, review website performance metrics, identify traffic patterns, and suggest data-driven improvements. Use when the user asks about analytics, website metrics, traffic analysis, conversion rates, user behavior, or performance optimization.

Security

1260

better-auth-best-practices

by novuhq

Skill for integrating Better Auth - the comprehensive TypeScript authentication framework.

Security

1148

llama-cpp

by zechenzhangAGI

Runs LLM inference on CPU, Apple Silicon, and consumer GPUs without NVIDIA hardware. Use for edge deployment, M1/M2/M3 Macs, AMD/Intel GPUs, or when CUDA is unavailable. Supports GGUF quantization (1.5-8 bit) for reduced memory and 4-10× speedup vs PyTorch on CPU.

Security

11252

obsidian

by gapmiss

Comprehensive guidelines for Obsidian.md plugin development including all 27 ESLint rules, TypeScript best practices, memory management, API usage (requestUrl vs fetch), UI/UX standards, and submission requirements. Use when working with Obsidian plugins, main.ts files,

Security

14111

gmail-manager

by jeffvincent

Manage Gmail - send, read, search emails, manage labels and drafts. Use when user wants to interact with their Gmail account for email operations.

Security

17128