nowait-reasoning-optimizer

Name: nowait-reasoning-optimizer
Author: davila7

Implements the NOWAIT technique for efficient reasoning in R1-style LLMs. Use when optimizing inference of reasoning models (QwQ, DeepSeek-R1, Phi4-Reasoning, Qwen3, Kimi-VL, QvQ), reducing chain-of-thought token usage by 27-51% while preserving accuracy. Triggers on \

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: davila7
Category: Testing

GitHub repo

About this skill

How to use

Sklonuj repozytorium z komponentem umiejętności z gałęzi głównej projektu claude-code-templates. Umiejętność znajduje się w katalogu cli-tool/components/skills/productivity/nowait.
Zainstaluj wymagane zależności dla swojego modelu rozumującego. Upewnij się, że masz dostęp do tokenizera modelu, który będziesz optymalizować (QwQ, DeepSeek-R1, Phi4-Reasoning, Qwen3, Kimi-VL lub QvQ).
Zaimportuj NOWAITLogitProcessor z modułu scripts/nowait_processor w swoim kodzie Pythona. Inicjalizuj procesor, przekazując tokenizer Twojego modelu jako argument.
Podczas generowania tekstu przekaż zainicjalizowany procesor do parametru logits_processor metody generate() modelu. Ustaw max_new_tokens na wymaganą wartość (np. 32768 dla dłuższych sekwencji myślenia).
Uruchom model z włączonym procesorem NOWAIT. Technika automatycznie supresuje tokeny autorefleksji (takie jak "Wait", "Hmm", "Alternatively"), skracając wyjście bez konieczności dodatkowego trenowania.
Monitoruj zmianę długości wyjścia i kosztów tokenów. Dla modeli opartych na uczeniu przez wzmacnianie (RL) możesz oczekiwać redukcji 16-60% w zależności od serii modelu. Unikaj stosowania NOWAIT do modeli zdystylowanych (Qwen3-4B/8B/14B), gdzie mogą pojawić się problemy z wydajnością.

Related skills

vitest

by antfu

Vitest fast unit testing framework powered by Vite with Jest-compatible API. Use when writing tests, mocking, configuring coverage, or working with test filtering and fixtures.

Testing

1236

performing-penetration-testing

by jeremylongshore

This skill enables automated penetration testing of web applications. It uses the penetration-tester plugin to identify vulnerabilities, including OWASP Top 10 threats, and suggests exploitation techniques. Use this skill when the user requests a \

Testing

1546

langchain

by zechenzhangAGI

Framework for building LLM-powered applications with agents, chains, and RAG. Supports multiple providers (OpenAI, Anthropic, Google), 500+ integrations, ReAct agents, tool calling, memory management, and vector store retrieval. Use for building chatbots, question-answering

Testing

21123

hono

by openstatusHQ

Efficiently develop Hono applications using Hono CLI. Supports documentation search, API reference lookup, request testing, and bundle optimization.

Testing

1257

testing-workflow

by amo-tech-ai

Comprehensive testing workflow for E2E, integration, and unit tests. Use when testing applications layer-by-layer, validating user journeys, or running test suites.

Testing

1076

langgraph-docs

by langchain-ai

Use this skill for requests related to LangGraph in order to fetch relevant documentation to provide accurate, up-to-date guidance.

Testing

23127