* Add .worktrees/ to .gitignore
* feat: add Jaeger alerting rules (8 rules from official jaeger-mixin)
Rules cover agent HTTP errors, RPC errors, client/agent/collector span drops,
sampling update failures, throttling update failures, and query request failures.
All rules sourced from https://github.com/jaegertracing/jaeger/tree/main/monitoring/jaeger-mixin
* fix: rename Jaeger agent RPC alert to Jaeger client RPC
The jaeger_client_jaeger_rpc_http_requests metric is client-side,
not agent-side. Rename alert to match the actual metric source.
* smartctl_exporter publishes both drive_trip and current drive temperatures. Since most of the alerts are going to be permanent, it does not make sense to wait for the alert to be on for a certain time. Temperature sensors likewise vary, using the last sample is not sufficient to alert on potential issues.
* Add an option to run GitHub Action manually
* Add an option to force running the action for testing purposes
* Set variables correctly
* Set variables correctly
* Publish
* Clean up some more metrics
* Publish
* Minor bug fixes
* Publish
* Removed queries that throw errors when systems are upgraded. Also fixed and simplified a few Postgres queries.
* Publish
* Refined some more queries
* Publish
* PostgreSQL now has optimized autovacuum behavior
* Publish
* PostgreSQL now has optimized autovacuum behavior
* Publish
* Publish
* Query fails if instance names are not unique across jobs. This fixes it.
* Publish
* Ruby is out of date
---------
Co-authored-by: samber <samber@users.noreply.github.com>