Commit graph

562 commits

Author SHA1 Message Date
dependabot[bot]
7e53aa2edd
build(deps): bump activesupport from 6.0.5 to 6.0.6.1 (#335)
Bumps [activesupport](https://github.com/rails/rails) from 6.0.5 to 6.0.6.1.
- [Release notes](https://github.com/rails/rails/releases)
- [Changelog](https://github.com/rails/rails/blob/v7.0.4.2/activesupport/CHANGELOG.md)
- [Commits](https://github.com/rails/rails/compare/v6.0.5...v6.0.6.1)

---
updated-dependencies:
- dependency-name: activesupport
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-02-02 11:18:55 +01:00
samber
9e7d8adf96 Publish 2023-01-30 16:07:23 +00:00
Jan Gosmann
df6d71bad5
Make ElasticsearchNoNewDocuments alert more robust (#334)
Use `elasticsearch_indices_indexing_index_total` instead of
`elasticsearch_indices_docs` because `elasticsearch_indices_docs` might
not update without an index refresh [1]. Refreshes happen every second
by default, *but* only if there have been search requests within the
last 30 seconds [2]. If there are no search requests for a sufficiently
long duration, the alert based on `elasticsearch_indices_docs` will fire
mistakenly.

Apart from that, `elasticsearch_indices_docs` has the gauge metric type
(while `elasticsearch_indices_indexing_index_total` is of the counter
type) and the `increase` function is not intended to be used with
gauges. Drops in the document count would be treated as a reset to 0,
thus showing an increase by all remaining documents.

[1]: https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-stats.html#index-stats-api-path-params
[2]: https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-refresh.html
2023-01-30 17:06:40 +01:00
dependabot[bot]
e3a63394da
build(deps): bump commonmarker from 0.23.6 to 0.23.7 (#333)
Bumps [commonmarker](https://github.com/gjtorikian/commonmarker) from 0.23.6 to 0.23.7.
- [Release notes](https://github.com/gjtorikian/commonmarker/releases)
- [Changelog](https://github.com/gjtorikian/commonmarker/blob/main/CHANGELOG.md)
- [Commits](https://github.com/gjtorikian/commonmarker/compare/v0.23.6...v0.23.7)

---
updated-dependencies:
- dependency-name: commonmarker
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-01-30 12:37:25 +01:00
samber
06ae149603 Publish 2023-01-15 23:38:08 +00:00
Samuel Berthe
5e84329360
Update rules.yml 2023-01-16 00:37:38 +01:00
samber
e98e4d0516 Publish 2023-01-15 23:37:32 +00:00
Sören König
40478c50cc
Add under-utilized HPA alert (#330)
This alert should inform when HPAs are scaled more than half the time at their minReplicas, which is an indication of possible cost savings.
In addition, it is assumed that a minimum number of replicas should still be running for redundancy.
2023-01-16 00:36:59 +01:00
samber
cdb90443b8 Publish 2023-01-13 17:36:11 +00:00
Samuel Berthe
160d0adcc2
Update rules.yml 2023-01-13 18:35:37 +01:00
Pedro Torres
ec13582bb5
fix typo in instaclustr-cassandra-exporter.yml (#332)
unvailable ==> unavailable
2023-01-13 18:35:06 +01:00
samber
ae1d84c788 Publish 2023-01-06 10:28:19 +00:00
Panos Rontogiannis
8f48bbfb25
Cert rules issues (#329)
* add comment for BlackboxSslCertificateExpired rule

* use last_over_time to make certificate rules less prone to flapping

* add lower bound thresholds on BlackboxSslCertificateWillExpireSoon rules to avoid overlap

* changed upper bound threshold for BlackboxSslCertificateWillExpireSoon to 20 days

* make BlackboxSslCertificateWillExpireSoon description clearer

* use days in certificate rules queries to improve notification values

Co-authored-by: Panos Rontogiannis <pronto@admin.grnet.gr>
2023-01-06 11:27:46 +01:00
dependabot[bot]
3c787b342e
build(deps): bump nokogiri from 1.13.9 to 1.13.10 (#324)
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.13.9 to 1.13.10.
- [Release notes](https://github.com/sparklemotion/nokogiri/releases)
- [Changelog](https://github.com/sparklemotion/nokogiri/blob/main/CHANGELOG.md)
- [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.13.9...v1.13.10)

---
updated-dependencies:
- dependency-name: nokogiri
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-08 11:19:53 +01:00
samber
32a0ce2c0b Publish 2022-12-06 09:38:04 +00:00
Samuel Berthe
032eb896f5
rearrange 2022-12-06 10:37:09 +01:00
Samuel Berthe
edd513a40a
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2022-12-06 10:28:02 +01:00
samber
4f908b36fb Publish 2022-12-06 09:27:25 +00:00
Samuel Berthe
7641c0eeb0
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2022-12-06 10:27:13 +01:00
michaelact
447bb94c4d
Add under-utilized host and hardware alerts (#320)
* chore: add under-utilized alerts

* docs: add under-utilized alerts

* chore: add alert consideration times

* chore: delete generated alert rules file

* chore: not using for, instead in rule
2022-12-06 10:26:50 +01:00
Samuel Berthe
076db62de4
layout: add wget script 2022-12-05 00:13:22 +01:00
samber
7e9677ba18 Publish 2022-12-04 22:13:18 +00:00
Samuel Berthe
c00dd87733
fix kube rule 2022-12-04 23:12:35 +01:00
Samuel Berthe
a381fb5e22
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2022-12-04 23:12:05 +01:00
Samuel Berthe
a0c32093cb
oops 2022-12-04 23:12:00 +01:00
samber
a4735f3512 Publish 2022-12-04 22:08:59 +00:00
MatthieuFin
a5f32a0fab
fix(rule): fixing KubernetesPodNotHealthy (#215 #253) (#263) 2022-12-04 23:08:24 +01:00
Samuel Berthe
8a31e4a4be
oops 2022-12-04 23:03:12 +01:00
Samuel Berthe
9c5933a125
oops 2022-12-04 22:50:26 +01:00
Samuel Berthe
c8d711caf2
replace widgets 2022-12-04 22:42:41 +01:00
samber
cbdd246d67 Publish 2022-11-30 11:28:57 +00:00
michaelact
4466a07962
fix: add space for labels KubernetesJobFailed alert rule (#321)
Co-authored-by: xb4dc0d3
2022-11-30 12:28:23 +01:00
samber
ebab149c50 Publish 2022-11-30 11:27:05 +00:00
Samuel Berthe
1b25cbe568
See #323 2022-11-30 12:26:36 +01:00
samber
7b4de0ce60 Publish 2022-11-30 11:25:55 +00:00
Todd Lyons
85a9cc8e21
Reorder mysql metrics in alerts (#323)
Was graphing the server ID instead of the values for io running,
sql running, and lag.

Co-authored-by: Todd Lyons <todd.lyons@appetize.com>
2022-11-30 12:25:16 +01:00
samber
abde89f88b Publish 2022-11-15 08:48:35 +00:00
Samuel Berthe
11948f27d1
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2022-11-15 09:47:45 +01:00
Samuel Berthe
5956d28148
data: fix haproxy rule #319 2022-11-15 09:47:34 +01:00
samber
be76ab0968 Publish 2022-11-11 13:49:41 +00:00
0xb4dc0d3
d10daaed67
fix: add space for labels KubernetesJobFailed alert rule (#311) 2022-11-11 14:49:01 +01:00
samber
91c9877bc6 Publish 2022-11-11 13:47:33 +00:00
Samuel Berthe
f484d30d66
data: fix haproxy rule #319 2022-11-11 14:46:56 +01:00
Valery Voronov
1e46eacbe7
fix: added NodeNetworkUnavailable alerts, rm unused OOD alert (#318) 2022-10-31 15:47:27 +01:00
samber
670787ae34 Publish 2022-10-31 09:11:35 +00:00
Nicolai Antiferov
9419e3fe7e
fix: Update elasticsearch_exporter repository (#317)
Was migrated some time ago to https://github.com/prometheus-community/elasticsearch_exporter

Fix #316
2022-10-31 10:10:46 +01:00
samber
1b7ba0d97c Publish 2022-10-24 14:56:13 +00:00
Samuel Berthe
cdf4551ab7
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2022-10-24 16:55:36 +02:00
Samuel Berthe
19c4223ce7
fix(minio): update queries 2022-10-24 16:54:38 +02:00
samber
6c9c521150 Publish 2022-10-24 11:47:50 +00:00