Commit graph

632 commits

Author SHA1 Message Date
fzyzcjy
13e90b3aea
Update rules.yml (#371) 2023-08-15 19:42:46 +02:00
Ted Hahn
94b9f3cfbb
Fix for Postgres max connections. Postgres does not limit connections by database, but total over the server. Additionally, alert labels didn't match across the pair. Using a min by on the right side deals with the possibility additional labels are present on your exporter. (#376) 2023-08-15 19:39:41 +02:00
Samuel Berthe
15e3131547
Update rules.yml 2023-08-15 19:36:22 +02:00
Samuel Berthe
eb3220c8d7
Update rules.yml 2023-08-15 19:34:14 +02:00
Pavel Timofeev
c419732e2e
Substract failed jobs from KubernetesJobSlowCompletion (#363) 2023-08-15 19:32:51 +02:00
dependabot[bot]
1b1473eb14
build(deps-dev): bump commonmarker from 0.23.9 to 0.23.10 (#378)
Bumps [commonmarker](https://github.com/gjtorikian/commonmarker) from 0.23.9 to 0.23.10.
- [Release notes](https://github.com/gjtorikian/commonmarker/releases)
- [Changelog](https://github.com/gjtorikian/commonmarker/blob/v0.23.10/CHANGELOG.md)
- [Commits](https://github.com/gjtorikian/commonmarker/compare/v0.23.9...v0.23.10)

---
updated-dependencies:
- dependency-name: commonmarker
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-08-09 09:46:23 +02:00
Ivan Dudin
86e3e38a99
fix typo (#377) 2023-08-07 19:43:10 +02:00
Samuel Berthe
ff76ceccde
Update rules.yml 2023-07-30 22:24:31 +02:00
samber
f72620203f Publish 2023-07-30 20:22:47 +00:00
Moritz
fe5f78171a
update rules.yml (#374) 2023-07-30 22:21:20 +02:00
samber
c0ec625dc6 Publish 2023-07-29 16:22:19 +00:00
Samuel Berthe
8c811045e5
Update rules.yml 2023-07-29 18:20:58 +02:00
Roman Pertl
0eb66a5d2c
fix(ci): split rule groups into seperate files (#368)
- mitigate an issue that command line is too long for
  liquid-cli
2023-07-15 18:05:25 +02:00
Yevhen Tienkaiev
68d45a0856
Fix threshold for IstioLatency99Percentile (#366)
* Fix threshold for IstioLatency99Percentile

IstioLatency99Percentile is in milliseconds so to have 1s we need to set 1000 instead of 1

* Update embedded-exporter.yml
2023-07-12 14:47:53 +02:00
samber
3ad1536226 Publish 2023-07-12 12:34:10 +00:00
Samuel Berthe
32cf16a53d
Update rules.yml 2023-07-12 14:32:43 +02:00
samber
4394de4713 Publish 2023-07-06 11:55:47 +00:00
Samuel Berthe
1bb6c602f7
Update rules.yml 2023-07-06 13:54:31 +02:00
Samuel Berthe
5d254811b4
Update rules.yml 2023-06-27 00:28:31 +02:00
Roman Pertl
71f488d744
feat: improve rule for used connection on redis (#358)
use max allowed connections value instead of a fixed value
2023-06-27 00:27:20 +02:00
samber
7a05f925b4 Publish 2023-06-22 16:42:13 +00:00
Samuel Berthe
47b7748618
Update rules.yml 2023-06-22 18:40:33 +02:00
samber
a4dbefd853 Publish 2023-06-22 16:30:42 +00:00
Samuel Berthe
3d0c5fcafd
Update rules.yml 2023-06-22 18:29:21 +02:00
Pavel Timofeev
a3e951aa15
Update kubestate-exporter.yml (#359)
Use proper label for KubernetesJobFailed
2023-06-22 18:16:52 +02:00
samber
f9c71ab724 Publish 2023-06-22 13:02:23 +00:00
Samuel Berthe
600a759344
Update rules.yml 2023-06-22 15:01:06 +02:00
Samuel Berthe
ee86c2d233
Update rules.yml 2023-06-22 15:00:40 +02:00
Pavel Timofeev
247dabffd8
Rename KubernetesNodeReady alert (#360)
Better wording for NodeReady nodes
2023-06-22 15:00:08 +02:00
samber
ac09fd8a2d Publish 2023-05-21 20:58:38 +00:00
michaelact
7e8bc1a215
Add under-utilized container alerts (#322)
* chore: add container under-utilized allerts

* chore: resolve duplicated query and description
2023-05-21 22:58:04 +02:00
John Losito
80f3970c3b
Update blackbox-exporter.md (#352) 2023-05-03 01:15:04 +02:00
John Losito
a41fe9ede9
Update index.md (#353) 2023-05-03 01:13:46 +02:00
samber
99b101077d Publish 2023-04-28 14:06:51 +00:00
Paul-Élie Testud
c36014f03e
fix(nginx): fix nginx query for histogram_percentile (#351) 2023-04-28 16:06:12 +02:00
samber
7a874b7205 Publish 2023-04-25 08:59:28 +00:00
deimosOmegaChan
b98b2a2777
fix node-exporter nodename regex expression (#349)
nodename should not depends with the prefix "hostname"
2023-04-25 10:58:52 +02:00
Samuel Berthe
9efec14d26
chore: move from "https://awesome-prometheus-alerts.grep.to" to "https://samber.github.io/awesome-prometheus-alerts/" 2023-04-23 23:32:26 +02:00
Samuel Berthe
fa740dcb04
Delete CNAME 2023-04-23 22:18:32 +02:00
samber
9d3d52bbfa Publish 2023-04-23 20:16:41 +00:00
Madhu Sudhan
8b9fc8864f
refactor: node-exporter queries to include hostname as label which will be helpful for alerting (#348) 2023-04-23 22:16:08 +02:00
Samuel Berthe
60b4f69606
Update sleep-peacefully.md 2023-04-12 21:10:59 +02:00
dependabot[bot]
4b6fbcaa2f
build(deps): bump nokogiri from 1.13.10 to 1.14.3 (#347)
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.13.10 to 1.14.3.
- [Release notes](https://github.com/sparklemotion/nokogiri/releases)
- [Changelog](https://github.com/sparklemotion/nokogiri/blob/main/CHANGELOG.md)
- [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.13.10...v1.14.3)

---
updated-dependencies:
- dependency-name: nokogiri
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-04-12 21:04:11 +02:00
dependabot[bot]
8c7dcad973
build(deps): bump commonmarker from 0.23.7 to 0.23.9 (#346)
Bumps [commonmarker](https://github.com/gjtorikian/commonmarker) from 0.23.7 to 0.23.9.
- [Release notes](https://github.com/gjtorikian/commonmarker/releases)
- [Changelog](https://github.com/gjtorikian/commonmarker/blob/main/CHANGELOG.md)
- [Commits](https://github.com/gjtorikian/commonmarker/compare/v0.23.7...v0.23.9)

---
updated-dependencies:
- dependency-name: commonmarker
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-04-12 21:03:52 +02:00
Samuel Berthe
6c93417fea
Update sleep-peacefully.md 2023-04-12 21:03:32 +02:00
samber
603edb2536 Publish 2023-04-06 23:43:00 +00:00
Mikael Lindström
8357165cfb
Update MongoDB replication lag alert to use seconds (#344)
The mongodb_rs_members_optimeDate metric is in milliseconds, the
replication lag query has been updated to reflect this.
2023-04-07 01:42:25 +02:00
samber
ebbfc496cd Publish 2023-04-03 08:03:11 +00:00
Mikael Lindström
2617aa5dab
Fix MongoDB replication headroom query (#342)
The query was changed to use `mongodb_oplog_stats_start` and
`mongodb_oplog_stats_end` in #291 but these metrics does not represent
the start and end of the oplog. The original head and tail metrics are
calculated from the oplog and are consistent with the output of
`db.getReplicationInfo()`.
2023-04-03 10:01:25 +02:00
Samuel Berthe
f9b43cf3bf
Update rules.yml 2023-03-24 14:36:52 +01:00