rag-engineer

Name: rag-engineer
Author: davila7

Expert in building Retrieval-Augmented Generation systems. Masters embedding models, vector databases, chunking strategies, and retrieval optimization for LLM applications. Use when: building RAG, vector search, embeddings, semantic search, document retrieval.

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: davila7
Category: Data Science
Views: 15

GitHub repo

About this skill

How to use

Zainstaluj umiejętność rag-engineer w swoim środowisku Claude lub kompatybilnym systemie agenta. Upewnij się, że masz dostęp do dokumentacji modeli embeddingów i podstawową wiedzę o NLP.
Przygotuj swoje dokumenty do indeksowania, stosując semantic chunking — dziel tekst na fragmenty oparte na znaczeniu, a nie na arbitralnych limitach tokenów. Zachowaj strukturę dokumentu (nagłówki, paragrafy) i dodaj metadane dla przyszłego filtrowania.
Wygeneruj embeddingi dla każdego fragmentu dokumentu, wybierając odpowiedni model embeddingów. Przechowuj je w bazie wektorowej, która wspiera wyszukiwanie podobieństwa.
Zaimplementuj wyszukiwanie hybrydowe, łączące wyszukiwanie semantyczne (przez podobieństwo wektorów) z wyszukiwaniem słów kluczowych (BM25/TF-IDF). Użyj Reciprocal Rank Fusion do połączenia wyników z obu podejść.
Optymalizuj okno kontekstu, testując różne rozmiary fragmentów i strategie retrieval. Rozważ hierarchiczne wyszukiwanie — indeksuj dokumenty na wielu poziomach (paragraf, sekcja, dokument) i wykonaj dwuetapową retrieval dla lepszej precyzji.
Ewaluuj jakość retrieval przed wdrożeniem — garbage in, garbage out. Upewnij się, że fragmenty zwracane przez system rzeczywiście zawierają odpowiedzi na pytania użytkowników.

Related skills

arxiv-search

by langchain-ai

Search arXiv preprint repository for papers in physics, mathematics, computer science, quantitative biology, and related fields

Data Science

76172

rust-coding-skill

by UtakataKyosui

Guides Claude in writing idiomatic, efficient, well-structured Rust code using proper data modeling, traits, impl organization, macros, and build-speed best practices.

Data Science

248325

stock-analyzer

by FrancyJGLisboa

Provides comprehensive technical analysis for stocks and ETFs using RSI, MACD, Bollinger Bands, and other indicators. Activates when user requests stock analysis, technical indicators, trading signals, or market data for specific ticker symbols.

Data Science

23128

xlsx

by anthropics

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2)

Data Science

40128

skill-installer

by openai

Install Codex skills into $CODEX_HOME/skills from a curated list or a GitHub repo path. Use when a user asks to list installable skills, install a curated skill, or install a skill from another repo (including private repos).

Data Science

23118

excalidraw

by ryanquinn3

Data Science

124204