data-engineer

Name: data-engineer
Author: sickn33

Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implements Apache Spark, dbt, Airflow, and cloud-native data platforms. Use PROACTIVELY for data pipeline design, analytics infrastructure, or modern data stack implementation.

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: sickn33
Category: DevOps
Views: 19

GitHub repo

About this skill

How to use

Załaduj umiejętność data-engineer do swojego agenta lub Claude'a. Umiejętność aktywuje się automatycznie, gdy będziesz projektować potoki danych, magazyny lub architektury lakehouse.
Zdefiniuj źródła danych, umowy dotyczące danych (data contracts) i wymagane SLA. Opisz, skąd pochodzą dane, jak często się aktualizują i jakie są wymagania dotyczące opóźnień.
Wybierz architekturę i narzędzia: określ, czy potrzebujesz przetwarzania batch'owego czy streamingowego, jakie magazyny danych (Snowflake, BigQuery, Redshift) i narzędzia orkiestracji (Airflow, dbt) będą pasować do Twoich wymagań.
Zaplanuj ingestion, transformacje i walidację danych. Umiejętność pomoże Ci zbudować etapy oczyszczania, transformacji i kontroli jakości przed zapisem do systemów produkcyjnych.
Wdrażaj zabezpieczenia: upewnij się, że dane osobowe (PII) są chronione, zastosuj least-privilege access i waliduj dane przed zapisem w produkcji.
Monitoruj niezawodność, koszty i wydajność potoków. Umiejętność wspiera ustawienie alertów, śledzenie lineage danych i optymalizację kosztów infrastruktury cloud.

Related skills

task-master

by sfc-gh-dflippo

AI-powered task management for structured, specification-driven development. Use this skill when you need to manage complex projects with PRDs, break down tasks into subtasks, track dependencies, and maintain organized development workflows across features and branches.

DevOps

14126

cloudflare-manager

by qdhenry

Comprehensive Cloudflare account management for deploying Workers, KV Storage, R2, Pages, DNS, and Routes. Use when deploying cloudflare services, managing worker containers, configuring KV/R2 storage, or setting up DNS/routing. Requires CLOUDFLARE_API_KEY in .env and Bun

DevOps

20122

planning-with-files

by davila7

Implements Manus-style file-based planning for complex tasks. Creates task_plan.md, findings.md, and progress.md. Use when starting complex multi-step tasks, research projects, or any task requiring u003e5 tool calls.

DevOps

2365

docker-containerization

by openclaw

This skill should be used when containerizing applications with Docker, creating Dockerfiles, docker-compose configurations, or deploying containers to various platforms. Ideal for Next.js, React, Node.js applications requiring containerization for development, production, or

DevOps

1334

postmortem-writing

by wshobson

Write effective blameless postmortems with root cause analysis, timelines, and action items. Use when conducting incident reviews, writing postmortem documents, or improving incident response processes.

DevOps

1385

crawl4ai

by basher83

This skill should be used when users need to scrape websites, extract structured data, handle JavaScript-heavy pages, crawl multiple URLs, or build automated web data pipelines. Includes optimized extraction patterns with schema generation for efficient, LLM-free extraction.

DevOps

11128