🚨 Collection of Prometheus alerting rules
Find a file
Samuel Berthe c2615fae52
fix/promql rules review 2 (#534)
* fix(data): fix queries and thresholds across multiple exporters

- Ceph: fix OSD latency metric name (ceph_osd_apply_latency_ms), replace
  ceph_osd_utilization with ceph_health_detail{name="OSD_NEARFULL"}, add for: durations
- ZFS: improve description, remove incorrect ON() join on readonly check
- Thanos: filter gRPC errors to actual error codes only (drop NotFound, Cancelled, etc.)
- Loki/Promtail: fix histogram_quantile to aggregate by (namespace, job, route, le)
- Mimir: raise rate()>0 thresholds to >0.05, add missing for: durations
- OTel Collector: raise rate()>0 thresholds to >0.05, add deprecation comments
- Tempo/Cortex: raise >0 thresholds to avoid transient spikes
- APC UPS: add division-by-zero guard on battery voltage ratio
- DigitalOcean: raise increase()>0 to >3
- Grafana Alloy: fix missing name: field on exporter
- Graph Node: add threshold comments

* fix(data): remove official mixin reference from Ceph OSD comment

* fix(data): remove official mixin references from comments
2026-04-06 21:14:15 +02:00
.github fix ci 2026-03-16 19:17:05 +01:00
_data fix/promql rules review 2 (#534) 2026-04-06 21:14:15 +02:00
_layouts fix: fix favicon path 2026-03-15 23:54:05 +01:00
assets Website: Support dark mode (#501) 2026-03-01 22:54:42 +01:00
dist Publish 2026-04-06 18:38:45 +00:00
.gitignore Feat/jaeger alerting rules (#521) 2026-03-16 14:09:03 +01:00
.travis.yml 💄 awesome-lint 2019-02-11 22:09:50 +01:00
_config.yml chore: move from "https://awesome-prometheus-alerts.grep.to" to "https://samber.github.io/awesome-prometheus-alerts/" 2023-04-23 23:32:26 +02:00
alertmanager.md Update alertmanager.md 2024-10-06 17:31:23 +02:00
blackbox-exporter.md Remove Screeb 2025-08-29 15:20:21 +02:00
CLAUDE.md docs: update CLAUDE.md with lessons from PromQL review 2026-04-06 21:08:48 +02:00
CONTRIBUTING.md Update CONTRIBUTING.md 2025-11-13 16:24:49 +01:00
docker-compose.yml feat(ui): adding copy buttons 2019-10-26 16:41:11 +02:00
Gemfile build(deps): bump webrick from 1.7.0 to 1.8.2 (#435) 2024-09-27 22:24:21 +02:00
Gemfile.lock build(deps-dev): bump activesupport from 7.2.3 to 7.2.3.1 (#531) 2026-03-24 08:24:41 +01:00
index.md Update index.md (#353) 2023-05-03 01:13:46 +02:00
LICENSE Changing license 2019-02-11 21:05:55 +01:00
package.json 💄 awesome-lint 2019-02-11 22:09:50 +01:00
README.md Update README.md 2026-03-31 16:41:19 +02:00
rules.md fix: corrects download URL for rules files (#494) 2026-01-30 01:40:38 +01:00
sleep-peacefully.md Update sleep-peacefully.md (#487) 2025-12-08 15:19:11 +01:00

👋 Awesome Prometheus Alerts Awesome

Most alerting rules are common to every Prometheus setup. We need a place to find them all. 🤘 🚨 📊

Collection available here: https://samber.github.io/awesome-prometheus-alerts

Contents

🚨 Rules

Basic resource monitoring

Databases

Message brokers

Proxies, load balancers and service meshes

Runtimes

Data engineering

Orchestrators

CI/CD

Network and security

Storage

Cloud providers

Observability

Other

🤝 Contributing

Contributions from community (you!) are most welcome!

There are many ways to contribute: writing code, alerting rules, documentation, reporting issues, discussing better error tracking...

Instructions here

🏋️ Improvements

  • Create an alert rule builder in Jekyll for custom alerts (severity, thresholds, instances...)
  • Add resolution suggestions to rule descriptions, for faster incident resolution (#85).

💫 Show your support

Give a if this project helped you!

support us

📝 License

CC4

Licensed under the Creative Commons 4.0 License, see LICENSE file for more detail.