mlops-observability

Name: mlops-observability
Author: fmind

Guide to implement full stack observability including reproducibility, lineage, monitoring, alerting, and explainability.

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: fmind
Category: DevOps
Views: 14

GitHub repo

About this skill

Guide to implement full stack observability including reproducibility, lineage, monitoring, alerting, and explainability.

How to use

Zainstaluj wymagane biblioteki Python: MLflow, Evidently, SHAP i plyer. Upewnij się, że masz Docker i narzędzie uv do zarządzania zależnościami.
Gwarantuj powtarzalność wyników, ustawiając seedy dla random, numpy, torch i tensorflow na początku każdego eksperymentu. Zablokuj wersje zależności w pliku uv.lock i używaj Docker do spójności środowiska. Śledź hash commitu git dla każdego uruchomienia.
Śledzenie lineażu danych: utwórz datasety MLflow za pomocą mlflow.data.from_pandas, loguj wejścia kontekstu z mlflow.log_input, wersjonuj pliki danych (np. data/v1.csv) lub użyj DVC do śledzenia zmian.
Skonfiguruj monitorowanie i detekcję dryfu: włącz MLflow Evaluate do walidacji modeli względem progów jakości, użyj Evidently do porównania danych treningowych (reference) z danymi produkcyjnymi (current), włącz metryki systemowe MLflow (CPU/GPU) za pomocą log_system_metrics=True.
Ustaw alerty: dla lokalnych uruchomień użyj plyer do powiadomień na pulpicie, dla produkcji skonfiguruj integrację z PagerDuty (alerty krytyczne) lub Slack (ostrzeżenia), zdefiniuj progi statyczne lub dynamiczne (anomalie).
Wdrażaj w produkcji, regularnie porównując metryki modelu i rozkłady danych, reagując na alerty przed degradacją wydajności.

Related skills

unity-developer

by sickn33

Build Unity games with optimized C# scripts, efficient rendering, and proper asset management. Masters Unity 6 LTS, URP/HDRP pipelines, and cross-platform deployment. Handles gameplay systems, UI implementation, and platform optimization. Use PROACTIVELY for Unity performance

DevOps

66164

clawhub

by openclaw

Use the ClawHub CLI to search, install, update, and publish agent skills from clawhub.com. Use when you need to fetch new skills on the fly, sync installed skills to latest or a specific version, or publish new/updated skill folders with the npm-installed clawhub CLI.

DevOps

14106

file-organizer

by ComposioHQ

Intelligently organizes your files and folders across your computer by understanding context, finding duplicates, suggesting better structures, and automating cleanup tasks. Reduces cognitive load and keeps your digital workspace tidy without manual effort.

DevOps

1399

chrome-devtools

by mrgoonie

Browser automation, debugging, and performance analysis using Puppeteer CLI scripts. Use for automating browsers, taking screenshots, analyzing performance, monitoring network traffic, web scraping, form automation, and JavaScript debugging.

DevOps

29141

macos-cleaner

by daymade

Analyze and reclaim macOS disk space through intelligent cleanup recommendations. This skill should be used when users report disk space issues, need to clean up their Mac, or want to understand what's consuming storage. Focus on safe, interactive analysis with user confirmation

DevOps

1331

3d-games

by davila7

3D game development principles. Rendering, shaders, physics, cameras.

DevOps

1355