Commit graph

644 commits

Author SHA1 Message Date
Samuel Berthe
053cde27e4
Update rules.yml 2023-08-22 15:51:53 +02:00
samber
4279dedb52 Publish 2023-08-19 22:41:12 +00:00
Pavel Timofeev
6b1685261d
Rework kube-state-metrics alerts (#381)
* Rework kube-state-metrics alerts:
- provide meaningful labels in summary as 'instance' label hardly makes sense in most of them
- rename some alerts to tell more accurate what the problem is
- adjust description trying to follow some kind of the message schema found in other alerts

* move changes to _data/rules.yml

* Update rules.yml

---------

Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2023-08-20 00:39:22 +02:00
Samuel Berthe
3e53d29986
fix: larger body 2023-08-15 20:38:58 +02:00
Samuel Berthe
4b818ad86f
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2023-08-15 20:38:02 +02:00
Samuel Berthe
d106e10185
fix: larger body 2023-08-15 20:37:50 +02:00
samber
afddf710ab Publish 2023-08-15 18:28:36 +00:00
Samuel Berthe
c3d78786e8
fix ci 2023-08-15 20:27:13 +02:00
Samuel Berthe
b19b403862
Update README.md 2023-08-15 20:05:13 +02:00
Samuel Berthe
5b6a86fa00
Update README.md 2023-08-15 20:03:06 +02:00
Samuel Berthe
ab7e29cfc0
Update README.md 2023-08-15 20:01:45 +02:00
Roman Pertl
ecd92399d5
feat: adding patroni alert rules (#369) 2023-08-15 19:54:15 +02:00
fzyzcjy
13e90b3aea
Update rules.yml (#371) 2023-08-15 19:42:46 +02:00
Ted Hahn
94b9f3cfbb
Fix for Postgres max connections. Postgres does not limit connections by database, but total over the server. Additionally, alert labels didn't match across the pair. Using a min by on the right side deals with the possibility additional labels are present on your exporter. (#376) 2023-08-15 19:39:41 +02:00
Samuel Berthe
15e3131547
Update rules.yml 2023-08-15 19:36:22 +02:00
Samuel Berthe
eb3220c8d7
Update rules.yml 2023-08-15 19:34:14 +02:00
Pavel Timofeev
c419732e2e
Substract failed jobs from KubernetesJobSlowCompletion (#363) 2023-08-15 19:32:51 +02:00
dependabot[bot]
1b1473eb14
build(deps-dev): bump commonmarker from 0.23.9 to 0.23.10 (#378)
Bumps [commonmarker](https://github.com/gjtorikian/commonmarker) from 0.23.9 to 0.23.10.
- [Release notes](https://github.com/gjtorikian/commonmarker/releases)
- [Changelog](https://github.com/gjtorikian/commonmarker/blob/v0.23.10/CHANGELOG.md)
- [Commits](https://github.com/gjtorikian/commonmarker/compare/v0.23.9...v0.23.10)

---
updated-dependencies:
- dependency-name: commonmarker
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-08-09 09:46:23 +02:00
Ivan Dudin
86e3e38a99
fix typo (#377) 2023-08-07 19:43:10 +02:00
Samuel Berthe
ff76ceccde
Update rules.yml 2023-07-30 22:24:31 +02:00
samber
f72620203f Publish 2023-07-30 20:22:47 +00:00
Moritz
fe5f78171a
update rules.yml (#374) 2023-07-30 22:21:20 +02:00
samber
c0ec625dc6 Publish 2023-07-29 16:22:19 +00:00
Samuel Berthe
8c811045e5
Update rules.yml 2023-07-29 18:20:58 +02:00
Roman Pertl
0eb66a5d2c
fix(ci): split rule groups into seperate files (#368)
- mitigate an issue that command line is too long for
  liquid-cli
2023-07-15 18:05:25 +02:00
Yevhen Tienkaiev
68d45a0856
Fix threshold for IstioLatency99Percentile (#366)
* Fix threshold for IstioLatency99Percentile

IstioLatency99Percentile is in milliseconds so to have 1s we need to set 1000 instead of 1

* Update embedded-exporter.yml
2023-07-12 14:47:53 +02:00
samber
3ad1536226 Publish 2023-07-12 12:34:10 +00:00
Samuel Berthe
32cf16a53d
Update rules.yml 2023-07-12 14:32:43 +02:00
samber
4394de4713 Publish 2023-07-06 11:55:47 +00:00
Samuel Berthe
1bb6c602f7
Update rules.yml 2023-07-06 13:54:31 +02:00
Samuel Berthe
5d254811b4
Update rules.yml 2023-06-27 00:28:31 +02:00
Roman Pertl
71f488d744
feat: improve rule for used connection on redis (#358)
use max allowed connections value instead of a fixed value
2023-06-27 00:27:20 +02:00
samber
7a05f925b4 Publish 2023-06-22 16:42:13 +00:00
Samuel Berthe
47b7748618
Update rules.yml 2023-06-22 18:40:33 +02:00
samber
a4dbefd853 Publish 2023-06-22 16:30:42 +00:00
Samuel Berthe
3d0c5fcafd
Update rules.yml 2023-06-22 18:29:21 +02:00
Pavel Timofeev
a3e951aa15
Update kubestate-exporter.yml (#359)
Use proper label for KubernetesJobFailed
2023-06-22 18:16:52 +02:00
samber
f9c71ab724 Publish 2023-06-22 13:02:23 +00:00
Samuel Berthe
600a759344
Update rules.yml 2023-06-22 15:01:06 +02:00
Samuel Berthe
ee86c2d233
Update rules.yml 2023-06-22 15:00:40 +02:00
Pavel Timofeev
247dabffd8
Rename KubernetesNodeReady alert (#360)
Better wording for NodeReady nodes
2023-06-22 15:00:08 +02:00
samber
ac09fd8a2d Publish 2023-05-21 20:58:38 +00:00
michaelact
7e8bc1a215
Add under-utilized container alerts (#322)
* chore: add container under-utilized allerts

* chore: resolve duplicated query and description
2023-05-21 22:58:04 +02:00
John Losito
80f3970c3b
Update blackbox-exporter.md (#352) 2023-05-03 01:15:04 +02:00
John Losito
a41fe9ede9
Update index.md (#353) 2023-05-03 01:13:46 +02:00
samber
99b101077d Publish 2023-04-28 14:06:51 +00:00
Paul-Élie Testud
c36014f03e
fix(nginx): fix nginx query for histogram_percentile (#351) 2023-04-28 16:06:12 +02:00
samber
7a874b7205 Publish 2023-04-25 08:59:28 +00:00
deimosOmegaChan
b98b2a2777
fix node-exporter nodename regex expression (#349)
nodename should not depends with the prefix "hostname"
2023-04-25 10:58:52 +02:00
Samuel Berthe
9efec14d26
chore: move from "https://awesome-prometheus-alerts.grep.to" to "https://samber.github.io/awesome-prometheus-alerts/" 2023-04-23 23:32:26 +02:00