Commit graph

336 commits

Author SHA1 Message Date
Sören König
40478c50cc
Add under-utilized HPA alert (#330)
This alert should inform when HPAs are scaled more than half the time at their minReplicas, which is an indication of possible cost savings.
In addition, it is assumed that a minimum number of replicas should still be running for redundancy.
2023-01-16 00:36:59 +01:00
Samuel Berthe
160d0adcc2
Update rules.yml 2023-01-13 18:35:37 +01:00
Panos Rontogiannis
8f48bbfb25
Cert rules issues (#329)
* add comment for BlackboxSslCertificateExpired rule

* use last_over_time to make certificate rules less prone to flapping

* add lower bound thresholds on BlackboxSslCertificateWillExpireSoon rules to avoid overlap

* changed upper bound threshold for BlackboxSslCertificateWillExpireSoon to 20 days

* make BlackboxSslCertificateWillExpireSoon description clearer

* use days in certificate rules queries to improve notification values

Co-authored-by: Panos Rontogiannis <pronto@admin.grnet.gr>
2023-01-06 11:27:46 +01:00
Samuel Berthe
032eb896f5
rearrange 2022-12-06 10:37:09 +01:00
michaelact
447bb94c4d
Add under-utilized host and hardware alerts (#320)
* chore: add under-utilized alerts

* docs: add under-utilized alerts

* chore: add alert consideration times

* chore: delete generated alert rules file

* chore: not using for, instead in rule
2022-12-06 10:26:50 +01:00
Samuel Berthe
c00dd87733
fix kube rule 2022-12-04 23:12:35 +01:00
Samuel Berthe
a381fb5e22
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2022-12-04 23:12:05 +01:00
Samuel Berthe
a0c32093cb
oops 2022-12-04 23:12:00 +01:00
MatthieuFin
a5f32a0fab
fix(rule): fixing KubernetesPodNotHealthy (#215 #253) (#263) 2022-12-04 23:08:24 +01:00
michaelact
4466a07962
fix: add space for labels KubernetesJobFailed alert rule (#321)
Co-authored-by: xb4dc0d3
2022-11-30 12:28:23 +01:00
Samuel Berthe
1b25cbe568
See #323 2022-11-30 12:26:36 +01:00
Samuel Berthe
5956d28148
data: fix haproxy rule #319 2022-11-15 09:47:34 +01:00
Samuel Berthe
f484d30d66
data: fix haproxy rule #319 2022-11-11 14:46:56 +01:00
Valery Voronov
1e46eacbe7
fix: added NodeNetworkUnavailable alerts, rm unused OOD alert (#318) 2022-10-31 15:47:27 +01:00
Nicolai Antiferov
9419e3fe7e
fix: Update elasticsearch_exporter repository (#317)
Was migrated some time ago to https://github.com/prometheus-community/elasticsearch_exporter

Fix #316
2022-10-31 10:10:46 +01:00
Samuel Berthe
cdf4551ab7
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2022-10-24 16:55:36 +02:00
Samuel Berthe
19c4223ce7
fix(minio): update queries 2022-10-24 16:54:38 +02:00
meoww-bot
98d8a7b53b
fix: check inodes space for all mountpoints (#315) 2022-10-24 13:47:12 +02:00
Samuel Berthe
6ba9eb104c
feat: adding cloudflare exporter (#310) 2022-10-03 16:57:24 +02:00
Yonah Dissen
55b049eb28
add argocd rules (#309)
* add argocd rules

* fix(argocd): move contrib into _data/rules.yml instead of dist/...

Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2022-10-02 18:05:30 +02:00
meoww-bot
86d5efe399
Fix broken link (#305) 2022-08-30 09:51:07 +02:00
Samuel Berthe
40c0ff32f0
oops 2022-08-28 17:47:17 +02:00
Brett
0887515f98
Added query for node warmup before reporing it's down (#304)
Co-authored-by: Brett Yoakum <yoakum@adobe.com>
2022-08-28 16:31:15 +02:00
Samuel Berthe
b49a49c920
Update rules.yml 2022-08-16 20:17:46 +02:00
Samuel Berthe
250a71e95a
fix(postgresql): remove broken rules 2022-08-01 22:43:30 +02:00
Samuel Berthe
d8f7ecd5b4
adding zpool alert 2022-07-24 01:56:17 +02:00
Samuel Berthe
34081e4f43
fix #292 2022-07-24 00:42:21 +02:00
Samuel Berthe
9bbb65ffe1
Update rules.yml 2022-07-24 00:20:54 +02:00
Samuel Berthe
67266bbca6
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2022-07-06 12:50:02 +02:00
Samuel Berthe
95af2b4d95
fix: fix quantile query 2022-07-06 12:49:49 +02:00
Pooya
03fdabbfc5
Changed metric names to match new metric names. (#291)
* Changed alert names to match new alert names.

* Added MongodbReplicaMemberHealth to check health of replica members health which is added in new metrics

Co-authored-by: Pooya Dowlatabadi <pooya.dowlatabadi@arvancloud.com>
Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2022-06-27 17:29:07 +02:00
Samuel Berthe
4201302285
Update rules.yml 2022-06-23 22:29:21 +02:00
Samuel Berthe
9bbe04799f
feat: build and publish into dist/rules 2022-06-15 01:42:18 +02:00
Samuel Berthe
cbc20228e2
fix #226 2022-06-14 22:12:00 +02:00
Samuel Berthe
10b810fd6e
fix #276 2022-06-14 22:03:34 +02:00
Samuel Berthe
23876f8c6b
fix #155 2022-06-14 22:00:00 +02:00
Samuel Berthe
075d85b2d6
fix #236 2022-06-14 21:36:59 +02:00
Samuel Berthe
72a0d78638
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2022-06-14 21:29:22 +02:00
Samuel Berthe
e82b504e00
fixes #251 2022-06-14 21:29:12 +02:00
Bastien Dronneau
bac2e99aee
docs(postgresql): add auto prefix in order to match query (#288) 2022-06-14 21:19:00 +02:00
Samuel Berthe
b36ea8f45d
data: adding rule "Host CPU high iowait" 2022-06-09 02:04:45 +02:00
Samuel Berthe
0207783284
data: change postgresql exporter name 2022-06-09 01:00:35 +02:00
Samuel Berthe
3faf1332a1
fix: PrometheusAllTargetsMissing (#283) 2022-06-09 00:43:40 +02:00
Samuel Berthe
2323541f2d
data: adding mgob query 2022-06-09 00:23:17 +02:00
Samuel Berthe
08d482f314
doc: add postgrseql bloat 2022-06-07 02:32:09 +02:00
Samuel Berthe
4662cd2812
doc: improve pulsar doc 2022-06-07 01:29:31 +02:00
Marcel Körtgen
074e3e6d04
Add pulsar rules (#286)
* Add pulsar rules

* Add webrick, cf.:
- https://github.com/github/pages-gem/issues/752

* Update gems (minitest / ruby 3 issue)

* Add repo info (workaround), cf.
- https://github.com/jekyll/jekyll/issues/4705
2022-06-07 01:21:10 +02:00
Samuel Berthe
4d26719d41
removed some rules 2022-04-19 00:07:31 +02:00
Samuel Berthe
97810b6537
change severity of PostgresqlConfigurationChanged to info 2022-04-18 23:37:17 +02:00
Samuel Berthe
8941f71c6c
chore(ci): adding test with promtool (#281) 2022-04-18 23:30:32 +02:00