benchmark-kernel

Name: benchmark-kernel
Author: flashinfer-ai

by flashinfer-ai

Guide for benchmarking FlashInfer kernels with CUPTI timing

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: flashinfer-ai
Category: Backend

GitHub repo

About this skill

Guide for benchmarking FlashInfer kernels with CUPTI timing

How to use

Zainstaluj CUPTI dla najdokładniejszych pomiarów: uruchom pip install -U cupti-python. Wymaga CUDA 13 lub nowszej. Jeśli CUPTI nie jest dostępny, skill automatycznie przejdzie na CUDA Events i wyświetli ostrzeżenie.
Wybierz rutynę testową z dostępnych opcji: Attention (BatchDecodeWithPagedKVCacheWrapper, BatchPrefillWithPagedKVCacheWrapper, BatchPrefillWithRaggedKVCacheWrapper, BatchMLAPagedAttentionWrapper), GEMM (bmm_fp8, gemm_fp8_nt_groupwise, group_gemm_fp8_nt_groupwise, mm_fp4) lub MOE (trtllm_fp4_block_scale_moe, trtllm_fp8_block_scale_moe, trtllm_fp8_per_tensor_scale_moe, cutlass_fused_moe).
Uruchom benchmark dla wybranej rutyny za pomocą skryptu flashinfer_benchmark.py z parametrami określającymi typ testu i konfigurację.
Skill zmierzy czysty czas wykonania jądra GPU bez narzutu komunikacji host-device (CUPTI) lub z minimalnym narzutem (CUDA Events).
Wyniki będą zapisane w formacie CSV do analizy i porównania wydajności między różnymi backendami.

Related skills

openspec

by ruan-cat

OpenSpec 中文版规范助手 - 规范驱动的 AI 编程开发，帮助初始化、创建提案、编写规格、校验格式并归档变更。触发条件：当用户提及 openspec、规范文档、需求管理、变更提案、spec-driven development 等关键词时主动调用。

Backend

27122

youtube-transcript

by michalparkola

Download YouTube video transcripts when user provides a YouTube URL or asks to download/get/fetch a transcript from YouTube. Also use when user wants to transcribe or get captions/subtitles from a YouTube video.

Backend

53214

postgresql-psql

by timelessco

Comprehensive guide for PostgreSQL psql - the interactive terminal client for PostgreSQL. Use when connecting to PostgreSQL databases, executing queries, managing databases/tables, configuring connection options, formatting output, writing scripts, managing transactions, and

Backend

34173

find-skills

by openstatusHQ

Helps users discover and install agent skills when they ask questions like \

Backend

150111

travel-planner

by ailabs-393

This skill should be used whenever users need help planning trips, creating travel itineraries, managing travel budgets, or seeking destination advice. On first use, collects comprehensive travel preferences including budget level, travel style, interests, and dietary

Backend

4379

supabase-developer

by daffy0208

Build full-stack applications with Supabase (PostgreSQL, Auth, Storage, Real-time, Edge Functions). Use when implementing authentication, database design with RLS, file storage, real-time features, or serverless functions.

Backend

78170