samber
9e7d8adf96
Publish
2023-01-30 16:07:23 +00:00
Jan Gosmann
df6d71bad5
Make ElasticsearchNoNewDocuments alert more robust ( #334 )
...
Use `elasticsearch_indices_indexing_index_total` instead of
`elasticsearch_indices_docs` because `elasticsearch_indices_docs` might
not update without an index refresh [1]. Refreshes happen every second
by default, *but* only if there have been search requests within the
last 30 seconds [2]. If there are no search requests for a sufficiently
long duration, the alert based on `elasticsearch_indices_docs` will fire
mistakenly.
Apart from that, `elasticsearch_indices_docs` has the gauge metric type
(while `elasticsearch_indices_indexing_index_total` is of the counter
type) and the `increase` function is not intended to be used with
gauges. Drops in the document count would be treated as a reset to 0,
thus showing an increase by all remaining documents.
[1]: https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-stats.html#index-stats-api-path-params
[2]: https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-refresh.html
2023-01-30 17:06:40 +01:00
dependabot[bot]
e3a63394da
build(deps): bump commonmarker from 0.23.6 to 0.23.7 ( #333 )
...
Bumps [commonmarker](https://github.com/gjtorikian/commonmarker ) from 0.23.6 to 0.23.7.
- [Release notes](https://github.com/gjtorikian/commonmarker/releases )
- [Changelog](https://github.com/gjtorikian/commonmarker/blob/main/CHANGELOG.md )
- [Commits](https://github.com/gjtorikian/commonmarker/compare/v0.23.6...v0.23.7 )
---
updated-dependencies:
- dependency-name: commonmarker
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-01-30 12:37:25 +01:00
samber
06ae149603
Publish
2023-01-15 23:38:08 +00:00
Samuel Berthe
5e84329360
Update rules.yml
2023-01-16 00:37:38 +01:00
samber
e98e4d0516
Publish
2023-01-15 23:37:32 +00:00
Sören König
40478c50cc
Add under-utilized HPA alert ( #330 )
...
This alert should inform when HPAs are scaled more than half the time at their minReplicas, which is an indication of possible cost savings.
In addition, it is assumed that a minimum number of replicas should still be running for redundancy.
2023-01-16 00:36:59 +01:00
samber
cdb90443b8
Publish
2023-01-13 17:36:11 +00:00
Samuel Berthe
160d0adcc2
Update rules.yml
2023-01-13 18:35:37 +01:00
Pedro Torres
ec13582bb5
fix typo in instaclustr-cassandra-exporter.yml ( #332 )
...
unvailable ==> unavailable
2023-01-13 18:35:06 +01:00
samber
ae1d84c788
Publish
2023-01-06 10:28:19 +00:00
Panos Rontogiannis
8f48bbfb25
Cert rules issues ( #329 )
...
* add comment for BlackboxSslCertificateExpired rule
* use last_over_time to make certificate rules less prone to flapping
* add lower bound thresholds on BlackboxSslCertificateWillExpireSoon rules to avoid overlap
* changed upper bound threshold for BlackboxSslCertificateWillExpireSoon to 20 days
* make BlackboxSslCertificateWillExpireSoon description clearer
* use days in certificate rules queries to improve notification values
Co-authored-by: Panos Rontogiannis <pronto@admin.grnet.gr>
2023-01-06 11:27:46 +01:00
dependabot[bot]
3c787b342e
build(deps): bump nokogiri from 1.13.9 to 1.13.10 ( #324 )
...
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri ) from 1.13.9 to 1.13.10.
- [Release notes](https://github.com/sparklemotion/nokogiri/releases )
- [Changelog](https://github.com/sparklemotion/nokogiri/blob/main/CHANGELOG.md )
- [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.13.9...v1.13.10 )
---
updated-dependencies:
- dependency-name: nokogiri
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-08 11:19:53 +01:00
samber
32a0ce2c0b
Publish
2022-12-06 09:38:04 +00:00
Samuel Berthe
032eb896f5
rearrange
2022-12-06 10:37:09 +01:00
Samuel Berthe
edd513a40a
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts
2022-12-06 10:28:02 +01:00
samber
4f908b36fb
Publish
2022-12-06 09:27:25 +00:00
Samuel Berthe
7641c0eeb0
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts
2022-12-06 10:27:13 +01:00
michaelact
447bb94c4d
Add under-utilized host and hardware alerts ( #320 )
...
* chore: add under-utilized alerts
* docs: add under-utilized alerts
* chore: add alert consideration times
* chore: delete generated alert rules file
* chore: not using for, instead in rule
2022-12-06 10:26:50 +01:00
Samuel Berthe
076db62de4
layout: add wget script
2022-12-05 00:13:22 +01:00
samber
7e9677ba18
Publish
2022-12-04 22:13:18 +00:00
Samuel Berthe
c00dd87733
fix kube rule
2022-12-04 23:12:35 +01:00
Samuel Berthe
a381fb5e22
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts
2022-12-04 23:12:05 +01:00
Samuel Berthe
a0c32093cb
oops
2022-12-04 23:12:00 +01:00
samber
a4735f3512
Publish
2022-12-04 22:08:59 +00:00
MatthieuFin
a5f32a0fab
fix(rule): fixing KubernetesPodNotHealthy ( #215 #253 ) ( #263 )
2022-12-04 23:08:24 +01:00
Samuel Berthe
8a31e4a4be
oops
2022-12-04 23:03:12 +01:00
Samuel Berthe
9c5933a125
oops
2022-12-04 22:50:26 +01:00
Samuel Berthe
c8d711caf2
replace widgets
2022-12-04 22:42:41 +01:00
samber
cbdd246d67
Publish
2022-11-30 11:28:57 +00:00
michaelact
4466a07962
fix: add space for labels KubernetesJobFailed alert rule ( #321 )
...
Co-authored-by: xb4dc0d3
2022-11-30 12:28:23 +01:00
samber
ebab149c50
Publish
2022-11-30 11:27:05 +00:00
Samuel Berthe
1b25cbe568
See #323
2022-11-30 12:26:36 +01:00
samber
7b4de0ce60
Publish
2022-11-30 11:25:55 +00:00
Todd Lyons
85a9cc8e21
Reorder mysql metrics in alerts ( #323 )
...
Was graphing the server ID instead of the values for io running,
sql running, and lag.
Co-authored-by: Todd Lyons <todd.lyons@appetize.com>
2022-11-30 12:25:16 +01:00
samber
abde89f88b
Publish
2022-11-15 08:48:35 +00:00
Samuel Berthe
11948f27d1
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts
2022-11-15 09:47:45 +01:00
Samuel Berthe
5956d28148
data: fix haproxy rule #319
2022-11-15 09:47:34 +01:00
samber
be76ab0968
Publish
2022-11-11 13:49:41 +00:00
0xb4dc0d3
d10daaed67
fix: add space for labels KubernetesJobFailed alert rule ( #311 )
2022-11-11 14:49:01 +01:00
samber
91c9877bc6
Publish
2022-11-11 13:47:33 +00:00
Samuel Berthe
f484d30d66
data: fix haproxy rule #319
2022-11-11 14:46:56 +01:00
Valery Voronov
1e46eacbe7
fix: added NodeNetworkUnavailable alerts, rm unused OOD alert ( #318 )
2022-10-31 15:47:27 +01:00
samber
670787ae34
Publish
2022-10-31 09:11:35 +00:00
Nicolai Antiferov
9419e3fe7e
fix: Update elasticsearch_exporter repository ( #317 )
...
Was migrated some time ago to https://github.com/prometheus-community/elasticsearch_exporter
Fix #316
2022-10-31 10:10:46 +01:00
samber
1b7ba0d97c
Publish
2022-10-24 14:56:13 +00:00
Samuel Berthe
cdf4551ab7
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts
2022-10-24 16:55:36 +02:00
Samuel Berthe
19c4223ce7
fix(minio): update queries
2022-10-24 16:54:38 +02:00
samber
6c9c521150
Publish
2022-10-24 11:47:50 +00:00
meoww-bot
98d8a7b53b
fix: check inodes space for all mountpoints ( #315 )
2022-10-24 13:47:12 +02:00