Commit graph

613 commits

Author SHA1 Message Date
Roman Pertl
71f488d744
feat: improve rule for used connection on redis (#358)
use max allowed connections value instead of a fixed value
2023-06-27 00:27:20 +02:00
samber
7a05f925b4 Publish 2023-06-22 16:42:13 +00:00
Samuel Berthe
47b7748618
Update rules.yml 2023-06-22 18:40:33 +02:00
samber
a4dbefd853 Publish 2023-06-22 16:30:42 +00:00
Samuel Berthe
3d0c5fcafd
Update rules.yml 2023-06-22 18:29:21 +02:00
Pavel Timofeev
a3e951aa15
Update kubestate-exporter.yml (#359)
Use proper label for KubernetesJobFailed
2023-06-22 18:16:52 +02:00
samber
f9c71ab724 Publish 2023-06-22 13:02:23 +00:00
Samuel Berthe
600a759344
Update rules.yml 2023-06-22 15:01:06 +02:00
Samuel Berthe
ee86c2d233
Update rules.yml 2023-06-22 15:00:40 +02:00
Pavel Timofeev
247dabffd8
Rename KubernetesNodeReady alert (#360)
Better wording for NodeReady nodes
2023-06-22 15:00:08 +02:00
samber
ac09fd8a2d Publish 2023-05-21 20:58:38 +00:00
michaelact
7e8bc1a215
Add under-utilized container alerts (#322)
* chore: add container under-utilized allerts

* chore: resolve duplicated query and description
2023-05-21 22:58:04 +02:00
John Losito
80f3970c3b
Update blackbox-exporter.md (#352) 2023-05-03 01:15:04 +02:00
John Losito
a41fe9ede9
Update index.md (#353) 2023-05-03 01:13:46 +02:00
samber
99b101077d Publish 2023-04-28 14:06:51 +00:00
Paul-Élie Testud
c36014f03e
fix(nginx): fix nginx query for histogram_percentile (#351) 2023-04-28 16:06:12 +02:00
samber
7a874b7205 Publish 2023-04-25 08:59:28 +00:00
deimosOmegaChan
b98b2a2777
fix node-exporter nodename regex expression (#349)
nodename should not depends with the prefix "hostname"
2023-04-25 10:58:52 +02:00
Samuel Berthe
9efec14d26
chore: move from "https://awesome-prometheus-alerts.grep.to" to "https://samber.github.io/awesome-prometheus-alerts/" 2023-04-23 23:32:26 +02:00
Samuel Berthe
fa740dcb04
Delete CNAME 2023-04-23 22:18:32 +02:00
samber
9d3d52bbfa Publish 2023-04-23 20:16:41 +00:00
Madhu Sudhan
8b9fc8864f
refactor: node-exporter queries to include hostname as label which will be helpful for alerting (#348) 2023-04-23 22:16:08 +02:00
Samuel Berthe
60b4f69606
Update sleep-peacefully.md 2023-04-12 21:10:59 +02:00
dependabot[bot]
4b6fbcaa2f
build(deps): bump nokogiri from 1.13.10 to 1.14.3 (#347)
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.13.10 to 1.14.3.
- [Release notes](https://github.com/sparklemotion/nokogiri/releases)
- [Changelog](https://github.com/sparklemotion/nokogiri/blob/main/CHANGELOG.md)
- [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.13.10...v1.14.3)

---
updated-dependencies:
- dependency-name: nokogiri
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-04-12 21:04:11 +02:00
dependabot[bot]
8c7dcad973
build(deps): bump commonmarker from 0.23.7 to 0.23.9 (#346)
Bumps [commonmarker](https://github.com/gjtorikian/commonmarker) from 0.23.7 to 0.23.9.
- [Release notes](https://github.com/gjtorikian/commonmarker/releases)
- [Changelog](https://github.com/gjtorikian/commonmarker/blob/main/CHANGELOG.md)
- [Commits](https://github.com/gjtorikian/commonmarker/compare/v0.23.7...v0.23.9)

---
updated-dependencies:
- dependency-name: commonmarker
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-04-12 21:03:52 +02:00
Samuel Berthe
6c93417fea
Update sleep-peacefully.md 2023-04-12 21:03:32 +02:00
samber
603edb2536 Publish 2023-04-06 23:43:00 +00:00
Mikael Lindström
8357165cfb
Update MongoDB replication lag alert to use seconds (#344)
The mongodb_rs_members_optimeDate metric is in milliseconds, the
replication lag query has been updated to reflect this.
2023-04-07 01:42:25 +02:00
samber
ebbfc496cd Publish 2023-04-03 08:03:11 +00:00
Mikael Lindström
2617aa5dab
Fix MongoDB replication headroom query (#342)
The query was changed to use `mongodb_oplog_stats_start` and
`mongodb_oplog_stats_end` in #291 but these metrics does not represent
the start and end of the oplog. The original head and tail metrics are
calculated from the oplog and are consistent with the output of
`db.getReplicationInfo()`.
2023-04-03 10:01:25 +02:00
Samuel Berthe
f9b43cf3bf
Update rules.yml 2023-03-24 14:36:52 +01:00
Julien Lecomte
baa4f223cd
Ignore temperature from tctl sensors (#341) 2023-03-24 14:36:24 +01:00
Samuel Berthe
1652c24e67
Create .github/FUNDING.yml 2023-03-21 10:06:15 +01:00
samber
2ead3bcbd8 Publish 2023-03-15 17:27:02 +00:00
Kratik Jain
aa2988693b
Adding more rules for Thanos Monitoring (#340)
* Adding more rules for Thanos Components Monitoring

* lint

* lint

* lint

---------

Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2023-03-15 18:26:24 +01:00
samber
293aba1437 Publish 2023-02-26 01:34:30 +00:00
Samuel Berthe
59891728e4
Solves #336 2023-02-26 02:33:50 +01:00
Samuel Berthe
d404e69c5a
oops 2023-02-24 14:15:21 +01:00
Samuel Berthe
8ba845a23b
oops 2023-02-24 14:13:42 +01:00
Samuel Berthe
fe3362262a
adding blackbox exporter page 2023-02-24 14:10:34 +01:00
samber
b726b065ed Publish 2023-02-23 14:20:09 +00:00
Samuel Berthe
60cb26681f
Update rules.yml 2023-02-23 15:19:36 +01:00
samber
ed35cfef1f Publish 2023-02-17 00:14:49 +00:00
Samuel Berthe
bde83bc9ee
Update rules.yml 2023-02-17 01:14:19 +01:00
alexandrumarian-portal
1e44e348ee
Hashicorp Vault cluster health (#338)
* Hashicorp Vault cluster health
2023-02-17 01:13:41 +01:00
samber
fa56b637a1 Publish 2023-02-14 13:03:11 +00:00
Samuel Berthe
65a0f969be
Update rules.yml 2023-02-14 14:02:35 +01:00
samber
5de0ee850b Publish 2023-02-14 13:01:25 +00:00
alexandrumarian-portal
18da40f8b4
disk io ops alarm (#337)
* disk io ops alarm

* disk io ops alarm
2023-02-14 14:00:43 +01:00
samber
50b171a491 Publish 2023-02-12 19:01:58 +00:00