🚨 Collection of Prometheus alerting rules
Find a file
Jan Gosmann df6d71bad5
Make ElasticsearchNoNewDocuments alert more robust (#334)
Use `elasticsearch_indices_indexing_index_total` instead of
`elasticsearch_indices_docs` because `elasticsearch_indices_docs` might
not update without an index refresh [1]. Refreshes happen every second
by default, *but* only if there have been search requests within the
last 30 seconds [2]. If there are no search requests for a sufficiently
long duration, the alert based on `elasticsearch_indices_docs` will fire
mistakenly.

Apart from that, `elasticsearch_indices_docs` has the gauge metric type
(while `elasticsearch_indices_indexing_index_total` is of the counter
type) and the `increase` function is not intended to be used with
gauges. Drops in the document count would be treated as a reset to 0,
thus showing an increase by all remaining documents.

[1]: https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-stats.html#index-stats-api-path-params
[2]: https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-refresh.html
2023-01-30 17:06:40 +01:00
.github chore(ci): upgrade ruby 2022-08-02 10:18:02 +02:00
_data Make ElasticsearchNoNewDocuments alert more robust (#334) 2023-01-30 17:06:40 +01:00
_layouts oops 2022-12-04 23:03:12 +01:00
assets fix(clipboard): copy of multiple rules was broken 2021-05-01 21:20:41 +02:00
dist Publish 2023-01-15 23:38:08 +00:00
test chore(ci): adding test with promtool (#281) 2022-04-18 23:30:32 +02:00
.gitignore chore(ci): adding test with promtool (#281) 2022-04-18 23:30:32 +02:00
.travis.yml 💄 awesome-lint 2019-02-11 22:09:50 +01:00
_config.yml Add pulsar rules (#286) 2022-06-07 01:21:10 +02:00
alertmanager.md typo 2022-06-09 01:47:49 +02:00
CNAME Create CNAME 2018-10-21 21:01:49 +02:00
CONTRIBUTING.md better contributing guidelines 2020-03-10 10:01:08 +01:00
docker-compose.yml feat(ui): adding copy buttons 2019-10-26 16:41:11 +02:00
Gemfile Add pulsar rules (#286) 2022-06-07 01:21:10 +02:00
Gemfile.lock build(deps): bump commonmarker from 0.23.6 to 0.23.7 (#333) 2023-01-30 12:37:25 +01:00
index.md Adding "sleep peacefully" doc section (#246) 2021-10-03 23:58:16 +02:00
LICENSE Changing license 2019-02-11 21:05:55 +01:00
package.json 💄 awesome-lint 2019-02-11 22:09:50 +01:00
README.md feat: adding cloudflare exporter (#310) 2022-10-03 16:57:24 +02:00
rules.md layout: add wget script 2022-12-05 00:13:22 +01:00
sleep-peacefully.md Adding "sleep peacefully" doc section (#246) 2021-10-03 23:58:16 +02:00

👋 Awesome Prometheus Alerts Awesome

Most alerting rules are common to every Prometheus setup. We need a place to find them all. 🤘 🚨 📊

Collection available here: https://awesome-prometheus-alerts.grep.to

Contents

🚨 Rules

Basic resource monitoring

Databases and brokers

Reverse proxies and load balancers

Runtimes

Orchestrators

Network, security and storage

Other

🤝 Contributing

Contributions from community (you!) are most welcome!

There are many ways to contribute: writing code, alerting rules, documentation, reporting issues, discussing better error tracking...

Instructions here

🏋️ Improvements

  • Create an alert rule builder in Jekyll for custom alerts (severity, thresholds, instances...)
  • Add resolution suggestions to rule descriptions, for faster incident resolution (#85).

💫 Show your support

Give a if this project helped you!

support us

👏 Thanks

Gratitude for the Gitlab operation team that provided 50+ rules. \o/

📝 License

CC4

Licensed under the Creative Commons 4.0 License, see LICENSE file for more detail.