rag-implementation

Name: rag-implementation
Author: wshobson

Build Retrieval-Augmented Generation (RAG) systems for LLM applications with vector databases and semantic search. Use when implementing knowledge-grounded AI, building document Qu0026A systems, or integrating LLMs with external knowledge bases.

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: wshobson
Category: Data Science
Views: 98

GitHub repo

About this skill

How to use

Wybierz bazę wektorową dostosowaną do Twoich potrzeb: Pinecone dla zarządzanego rozwiązania, Weaviate dla hybrydowego wyszukiwania, Milvus dla wdrożenia on-premise, Chroma dla lekkiego prototypu, Qdrant dla filtrowanego wyszukiwania lub FAISS dla lokalnego wdrożenia.
Wybierz model do generowania embeddings — text-embedding-ada-002 od OpenAI do zastosowań ogólnych, all-MiniLM-L6-v2 dla szybkości, e5-large-v2 dla wysokiej jakości i wielojęzyczności, lub bge-large-en-v1.5 dla najlepszej wydajności.
Przygotuj dokumenty i przekonwertuj je na wektory za pomocą wybranego modelu embeddings, a następnie załaduj do bazy wektorowej.
Implementuj strategię wyszukiwania — użyj dense retrieval dla podobieństwa semantycznego, sparse retrieval dla dopasowania słów kluczowych, hybrid search do połączenia obu podejść, multi-query do generowania wariantów zapytania lub HyDE do generowania hipotetycznych dokumentów.
Dodaj reranking do poprawy jakości wyników — zastosuj cross-encoders lub inne metody rerankerów, aby przenieść najistotniejsze dokumenty na początek listy.
Zintegruj wyniki wyszukiwania z dużym modelem językowym, aby generować odpowiedzi ugruntowane w pobranej wiedzy, zmniejszając halucynacje i zapewniając cytowanie źródeł.

Related skills

excalidraw

by ryanquinn3

Data Science

124204

claude-automation-recommender

by anthropics

Analyze a codebase and recommend Claude Code automations (hooks, subagents, skills, plugins, MCP servers). Use when user asks for automation recommendations, wants to optimize their Claude Code setup, mentions improving Claude Code workflows, asks how to first set up Claude Code

Data Science

1787

pptx

by anthropics

Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks

Data Science

134310

pdf-processing

by Ming-Kai-LC

Comprehensive PDF processing techniques for handling large files that exceed Claude Code's reading limits, including chunking strategies, text/table extraction, and OCR for scanned documents. Use when working with PDFs larger than 10-15MB or more than 30-50 pages.

Data Science

23134

web-artifacts-builder

by anthropics

Suite of tools for creating elaborate, multi-component claude.ai HTML artifacts using modern frontend web technologies (React, Tailwind CSS, shadcn/ui). Use for complex artifacts requiring state management, routing, or shadcn/ui components - not for simple single-file HTML/JSX

Data Science

37124

nano-banana-pro

by garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., \

Data Science

535772