mirror of https://github.com/samber/awesome-prometheus-alerts.git synced 2026-06-25 02:46:59 +08:00

🚨 Collection of Prometheus alerting rules

alert alerting alerting-rules alertmanager awesome collection exporter grafana monitoring prometheus prometheus-alerting-rules promql query rule supervision

Find a file

Samuel Berthe 04a8ae2fe3 fix(data): PromQL type fixes, job filter cleanup, query correctness review - Replace rate()/increase() with deriv()/delta() on gauge metrics: node_vmstat_pgmajfault, cassandra_stats (criteo exporter), gitlab_ci_pipeline_failure_reasons, flink_taskmanager_job_task_numRecordsIn - Fix histogram_quantile on non-_bucket metric: cilium_policy_implementation_delay - Fix Thanos bucket replicate latency: use _count instead of _bucket for guard clause - Fix Thanos query latency: use _count instead of _bucket for guard clause - Restore job filter in Thanos objstore guard clauses (compact + store) - Remove redundant job= filters from unique metrics: ~30 Thanos rules, kube_persistentvolume_status_phase, otelcol_process_runtime_* - Fix high-cardinality Istio latency grouping (drop source labels from by()) - Add division-by-zero guard to host context switch ratio - Raise noisy ClickHouse thresholds: RejectedInserts > 2, DelayedInserts > 10 - Remove redundant for: 1m from HAProxy check failure rules - Add job rename comments to up{job=...} rules (Hadoop, OpenStack, SNMP, OTel) - Remove external mixin references from comments - Fix Tempo dropped spans metric name: add missing _total suffix - Fix Thanos bucket replicate run latency: add missing le label in by()		2026-04-06 20:12:18 +02:00
.github	fix ci	2026-03-16 19:17:05 +01:00
_data	fix(data): PromQL type fixes, job filter cleanup, query correctness review	2026-04-06 20:12:18 +02:00
_layouts	fix: fix favicon path	2026-03-15 23:54:05 +01:00
assets	Website: Support dark mode (#501 )	2026-03-01 22:54:42 +01:00
dist	Publish	2026-03-18 17:06:34 +00:00
.gitignore	Feat/jaeger alerting rules (#521 )	2026-03-16 14:09:03 +01:00
.travis.yml	💄 awesome-lint	2019-02-11 22:09:50 +01:00
_config.yml	chore: move from "https://awesome-prometheus-alerts.grep.to " to "https://samber.github.io/awesome-prometheus-alerts/"	2023-04-23 23:32:26 +02:00
alertmanager.md	Update alertmanager.md	2024-10-06 17:31:23 +02:00
blackbox-exporter.md	Remove Screeb	2025-08-29 15:20:21 +02:00
CLAUDE.md	fix(data): prevent division by 0	2026-03-18 18:06:00 +01:00
CONTRIBUTING.md	Update CONTRIBUTING.md	2025-11-13 16:24:49 +01:00
docker-compose.yml	feat(ui): adding copy buttons	2019-10-26 16:41:11 +02:00
Gemfile	build(deps): bump webrick from 1.7.0 to 1.8.2 (#435 )	2024-09-27 22:24:21 +02:00
Gemfile.lock	build(deps-dev): bump nokogiri from 1.18.10 to 1.19.1 (#506 )	2026-03-16 01:51:52 +01:00
index.md	Update index.md (#353 )	2023-05-03 01:13:46 +02:00
LICENSE	Changing license	2019-02-11 21:05:55 +01:00
package.json	💄 awesome-lint	2019-02-11 22:09:50 +01:00
README.md	refactor(categories): change categories and move some exporters (#528 )	2026-03-17 13:30:13 +01:00
rules.md	fix: corrects download URL for rules files (#494 )	2026-01-30 01:40:38 +01:00
sleep-peacefully.md	Update sleep-peacefully.md (#487 )	2025-12-08 15:19:11 +01:00

README.md

👋 Awesome Prometheus Alerts

Most alerting rules are common to every Prometheus setup. We need a place to find them all. 🤘 🚨 📊

Collection available here: https://samber.github.io/awesome-prometheus-alerts

^{Sponsored by:}

Cut Kubernetes & AI costs, boost application stability.

Better Stack lets you centralize, search, and visualize your logs.

🚨 Rules

Basic resource monitoring

Databases

Message brokers

Proxies, load balancers and service meshes

Runtimes

Data engineering

Orchestrators

CI/CD

Network and security

Storage

Cloud providers

Observability

Other

🤝 Contributing

Contributions from community (you!) are most welcome!

There are many ways to contribute: writing code, alerting rules, documentation, reporting issues, discussing better error tracking...

Instructions here

🏋️ Improvements

Create an alert rule builder in Jekyll for custom alerts (severity, thresholds, instances...)
Add resolution suggestions to rule descriptions, for faster incident resolution (#85).

💫 Show your support

Give a ⭐️ if this project helped you!

📝 License

Licensed under the Creative Commons 4.0 License, see LICENSE file for more detail.

README.md Unescape Escape