Kratik Jain
aa2988693b
Adding more rules for Thanos Monitoring ( #340 )
...
* Adding more rules for Thanos Components Monitoring
* lint
* lint
* lint
---------
Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2023-03-15 18:26:24 +01:00
samber
293aba1437
Publish
2023-02-26 01:34:30 +00:00
Samuel Berthe
59891728e4
Solves #336
2023-02-26 02:33:50 +01:00
Samuel Berthe
d404e69c5a
oops
2023-02-24 14:15:21 +01:00
Samuel Berthe
8ba845a23b
oops
2023-02-24 14:13:42 +01:00
Samuel Berthe
fe3362262a
adding blackbox exporter page
2023-02-24 14:10:34 +01:00
samber
b726b065ed
Publish
2023-02-23 14:20:09 +00:00
Samuel Berthe
60cb26681f
Update rules.yml
2023-02-23 15:19:36 +01:00
samber
ed35cfef1f
Publish
2023-02-17 00:14:49 +00:00
Samuel Berthe
bde83bc9ee
Update rules.yml
2023-02-17 01:14:19 +01:00
alexandrumarian-portal
1e44e348ee
Hashicorp Vault cluster health ( #338 )
...
* Hashicorp Vault cluster health
2023-02-17 01:13:41 +01:00
samber
fa56b637a1
Publish
2023-02-14 13:03:11 +00:00
Samuel Berthe
65a0f969be
Update rules.yml
2023-02-14 14:02:35 +01:00
samber
5de0ee850b
Publish
2023-02-14 13:01:25 +00:00
alexandrumarian-portal
18da40f8b4
disk io ops alarm ( #337 )
...
* disk io ops alarm
* disk io ops alarm
2023-02-14 14:00:43 +01:00
samber
50b171a491
Publish
2023-02-12 19:01:58 +00:00
Yannick Markus
7aeccf2874
Add APC UPS & ZFS exporter ( #331 )
...
* add apcupsd_exporter rules
* add zfs_exporter rules
2023-02-12 20:01:26 +01:00
dependabot[bot]
7e53aa2edd
build(deps): bump activesupport from 6.0.5 to 6.0.6.1 ( #335 )
...
Bumps [activesupport](https://github.com/rails/rails ) from 6.0.5 to 6.0.6.1.
- [Release notes](https://github.com/rails/rails/releases )
- [Changelog](https://github.com/rails/rails/blob/v7.0.4.2/activesupport/CHANGELOG.md )
- [Commits](https://github.com/rails/rails/compare/v6.0.5...v6.0.6.1 )
---
updated-dependencies:
- dependency-name: activesupport
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-02-02 11:18:55 +01:00
samber
9e7d8adf96
Publish
2023-01-30 16:07:23 +00:00
Jan Gosmann
df6d71bad5
Make ElasticsearchNoNewDocuments alert more robust ( #334 )
...
Use `elasticsearch_indices_indexing_index_total` instead of
`elasticsearch_indices_docs` because `elasticsearch_indices_docs` might
not update without an index refresh [1]. Refreshes happen every second
by default, *but* only if there have been search requests within the
last 30 seconds [2]. If there are no search requests for a sufficiently
long duration, the alert based on `elasticsearch_indices_docs` will fire
mistakenly.
Apart from that, `elasticsearch_indices_docs` has the gauge metric type
(while `elasticsearch_indices_indexing_index_total` is of the counter
type) and the `increase` function is not intended to be used with
gauges. Drops in the document count would be treated as a reset to 0,
thus showing an increase by all remaining documents.
[1]: https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-stats.html#index-stats-api-path-params
[2]: https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-refresh.html
2023-01-30 17:06:40 +01:00
dependabot[bot]
e3a63394da
build(deps): bump commonmarker from 0.23.6 to 0.23.7 ( #333 )
...
Bumps [commonmarker](https://github.com/gjtorikian/commonmarker ) from 0.23.6 to 0.23.7.
- [Release notes](https://github.com/gjtorikian/commonmarker/releases )
- [Changelog](https://github.com/gjtorikian/commonmarker/blob/main/CHANGELOG.md )
- [Commits](https://github.com/gjtorikian/commonmarker/compare/v0.23.6...v0.23.7 )
---
updated-dependencies:
- dependency-name: commonmarker
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-01-30 12:37:25 +01:00
samber
06ae149603
Publish
2023-01-15 23:38:08 +00:00
Samuel Berthe
5e84329360
Update rules.yml
2023-01-16 00:37:38 +01:00
samber
e98e4d0516
Publish
2023-01-15 23:37:32 +00:00
Sören König
40478c50cc
Add under-utilized HPA alert ( #330 )
...
This alert should inform when HPAs are scaled more than half the time at their minReplicas, which is an indication of possible cost savings.
In addition, it is assumed that a minimum number of replicas should still be running for redundancy.
2023-01-16 00:36:59 +01:00
samber
cdb90443b8
Publish
2023-01-13 17:36:11 +00:00
Samuel Berthe
160d0adcc2
Update rules.yml
2023-01-13 18:35:37 +01:00
Pedro Torres
ec13582bb5
fix typo in instaclustr-cassandra-exporter.yml ( #332 )
...
unvailable ==> unavailable
2023-01-13 18:35:06 +01:00
samber
ae1d84c788
Publish
2023-01-06 10:28:19 +00:00
Panos Rontogiannis
8f48bbfb25
Cert rules issues ( #329 )
...
* add comment for BlackboxSslCertificateExpired rule
* use last_over_time to make certificate rules less prone to flapping
* add lower bound thresholds on BlackboxSslCertificateWillExpireSoon rules to avoid overlap
* changed upper bound threshold for BlackboxSslCertificateWillExpireSoon to 20 days
* make BlackboxSslCertificateWillExpireSoon description clearer
* use days in certificate rules queries to improve notification values
Co-authored-by: Panos Rontogiannis <pronto@admin.grnet.gr>
2023-01-06 11:27:46 +01:00
dependabot[bot]
3c787b342e
build(deps): bump nokogiri from 1.13.9 to 1.13.10 ( #324 )
...
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri ) from 1.13.9 to 1.13.10.
- [Release notes](https://github.com/sparklemotion/nokogiri/releases )
- [Changelog](https://github.com/sparklemotion/nokogiri/blob/main/CHANGELOG.md )
- [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.13.9...v1.13.10 )
---
updated-dependencies:
- dependency-name: nokogiri
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-08 11:19:53 +01:00
samber
32a0ce2c0b
Publish
2022-12-06 09:38:04 +00:00
Samuel Berthe
032eb896f5
rearrange
2022-12-06 10:37:09 +01:00
Samuel Berthe
edd513a40a
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts
2022-12-06 10:28:02 +01:00
samber
4f908b36fb
Publish
2022-12-06 09:27:25 +00:00
Samuel Berthe
7641c0eeb0
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts
2022-12-06 10:27:13 +01:00
michaelact
447bb94c4d
Add under-utilized host and hardware alerts ( #320 )
...
* chore: add under-utilized alerts
* docs: add under-utilized alerts
* chore: add alert consideration times
* chore: delete generated alert rules file
* chore: not using for, instead in rule
2022-12-06 10:26:50 +01:00
Samuel Berthe
076db62de4
layout: add wget script
2022-12-05 00:13:22 +01:00
samber
7e9677ba18
Publish
2022-12-04 22:13:18 +00:00
Samuel Berthe
c00dd87733
fix kube rule
2022-12-04 23:12:35 +01:00
Samuel Berthe
a381fb5e22
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts
2022-12-04 23:12:05 +01:00
Samuel Berthe
a0c32093cb
oops
2022-12-04 23:12:00 +01:00
samber
a4735f3512
Publish
2022-12-04 22:08:59 +00:00
MatthieuFin
a5f32a0fab
fix(rule): fixing KubernetesPodNotHealthy ( #215 #253 ) ( #263 )
2022-12-04 23:08:24 +01:00
Samuel Berthe
8a31e4a4be
oops
2022-12-04 23:03:12 +01:00
Samuel Berthe
9c5933a125
oops
2022-12-04 22:50:26 +01:00
Samuel Berthe
c8d711caf2
replace widgets
2022-12-04 22:42:41 +01:00
samber
cbdd246d67
Publish
2022-11-30 11:28:57 +00:00
michaelact
4466a07962
fix: add space for labels KubernetesJobFailed alert rule ( #321 )
...
Co-authored-by: xb4dc0d3
2022-11-30 12:28:23 +01:00
samber
ebab149c50
Publish
2022-11-30 11:27:05 +00:00