gguf-quantization

Name: gguf-quantization
Author: davila7

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: davila7
Category: Security
Views: 20

GitHub repo

About this skill

How to use

Sklonuj repozytorium llama.cpp z GitHuba i przejdź do katalogu projektu.
Zbuduj projekt za pomocą make — wybierz wersję dla swojego sprzętu: make dla CPU, make GGML_CUDA=1 dla NVIDIA, lub make GGML_METAL=1 dla Apple Silicon.
Zainstaluj opcjonalne wiązania Pythona poleceniem pip install llama-cpp-python, jeśli planujesz używać modelu z kodu Python.
Pobierz model w formacie GGUF z repozytorium HuggingFace (szukaj tagów GGUF) lub skonwertuj istniejący model za pomocą skryptu konwersji z llama.cpp.
Uruchom model lokalnie za pomocą LM Studio, Ollama lub innego narzędzia obsługującego GGUF, wskazując pobrany plik.
Dostosuj parametry kwantyzacji (Q2_K do Q8_0) w zależności od dostępnej pamięci i wymaganej dokładności — niższe wartości (Q2_K) zużywają mniej RAM, wyższe (Q8_0) zachowują lepszą jakość.

Related skills

windows-ui-automation

by martinholovsky

Security

10115

google-analytics

by davila7

Analyze Google Analytics data, review website performance metrics, identify traffic patterns, and suggest data-driven improvements. Use when the user asks about analytics, website metrics, traffic analysis, conversion rates, user behavior, or performance optimization.

Security

1260

software-security

by project-codeguard

A software security skill that integrates with Project CodeGuard to help AI coding agents write secure code and prevent common vulnerabilities. Use this skill when writing, reviewing, or modifying code to ensure secure-by-default practices are followed.

Security

1678

architect-review

by sickn33

Master software architect specializing in modern architecture patterns, clean architecture, microservices, event-driven systems, and DDD. Reviews system designs and code changes for architectural integrity, scalability, and maintainability. Use PROACTIVELY for architectural

Security

2773

ui-audit

by openclaw

AI skill for automated UI audits. Evaluate interfaces against proven UX principles for visual hierarchy, accessibility, cognitive load, navigation, and more. Based on Making UX Decisions by Tommy Geoco.

Security

1223

zendesk

by vm0-ai

Zendesk Support REST API for managing tickets, users, organizations, and support operations. Use this skill to create tickets, manage users, search, and automate customer support workflows.

Security

11100