Commit graph

851 commits

Author SHA1 Message Date
Samuel Berthe
478d99c6a1
Modify systemd service crash rule description
Updated the description for the systemd service crash rule to include the service name.
2026-03-01 20:13:56 +01:00
Marcin Morawski
206ec8696d Add systemd service name to HostSystemdServiceCrashed summary 2026-02-27 15:55:18 +01:00
samber
0693ed168e Publish 2026-02-21 18:40:35 +00:00
dxrayz
e60601fdcd
tune Targets Missing rules (#497)
* tune Targets Missing rules

* reworked query logic

* Update rules.yml

---------

Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2026-02-21 19:40:10 +01:00
dependabot[bot]
9998e22145
build(deps-dev): bump nokogiri from 1.18.9 to 1.19.1 (#498)
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.18.9 to 1.19.1.
- [Release notes](https://github.com/sparklemotion/nokogiri/releases)
- [Changelog](https://github.com/sparklemotion/nokogiri/blob/main/CHANGELOG.md)
- [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.18.9...v1.19.1)

---
updated-dependencies:
- dependency-name: nokogiri
  dependency-version: 1.19.1
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-20 01:58:02 +01:00
dependabot[bot]
52cc00fc4c
build(deps-dev): bump faraday from 2.12.0 to 2.14.1 (#496)
Bumps [faraday](https://github.com/lostisland/faraday) from 2.12.0 to 2.14.1.
- [Release notes](https://github.com/lostisland/faraday/releases)
- [Changelog](https://github.com/lostisland/faraday/blob/main/CHANGELOG.md)
- [Commits](https://github.com/lostisland/faraday/compare/v2.12.0...v2.14.1)

---
updated-dependencies:
- dependency-name: faraday
  dependency-version: 2.14.1
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-10 00:26:42 +01:00
samber
dd10c7ef05 Publish 2026-01-30 11:15:52 +00:00
Per Lundberg
51aea96ba7
Adjust OOM kill detected rule (#495)
* Adjust OOM kill detected rule

When a machine runs out of memory, it happens that the node
exporter stops responding for multiple minutes. I've adjusted
the rule now to take this into account: even if it takes 15-20
minutes before the machine becomes responsive again, the
alert should still fire.

* Update rules.yml

---------

Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2026-01-30 12:15:27 +01:00
Andreyev Dias de Melo
1d69457017
fix: corrects download URL for rules files (#494) 2026-01-30 01:40:38 +01:00
Samuel Berthe
f0107caf9e
Update README.md 2026-01-15 12:33:35 +01:00
Samuel Berthe
34cc80ffea
Update app.css 2026-01-15 02:48:16 +01:00
Samuel Berthe
a5d1c04955
Update default.html 2026-01-15 02:43:57 +01:00
Samuel Berthe
65551ae19f
Update README.md 2026-01-15 02:42:42 +01:00
Samuel Berthe
570521429e
Update default.html 2026-01-15 02:42:00 +01:00
Samuel Berthe
55f16705eb
Add files via upload 2026-01-15 02:40:58 +01:00
Samuel Berthe
2b5c8b0ec7
Update README.md 2026-01-15 02:39:24 +01:00
samber
81081bdda5 Publish 2026-01-07 12:58:08 +00:00
Samuel Berthe
d400e3e64d
feat(k8s): cronjob rule (#491) 2026-01-07 13:57:42 +01:00
Samuel Berthe
1136aa3a87
remove file 2026-01-07 13:29:12 +01:00
Simon Matic Langford
f810ff531d
Node exporter rules to preserve instance labels (#488)
* Jenkins node offline for clause (#2)

* Convert cpu alert expressions to without() rather than on()

* Remove on() expression from network throughput alerts as labels fully match

---------

Co-authored-by: Simon Matic Langford <simon@longshotsystems.co.uk>
2026-01-06 16:24:18 +01:00
dependabot[bot]
74ba870f05
build(deps-dev): bump uri from 0.13.2 to 0.13.3 (#489)
Bumps [uri](https://github.com/ruby/uri) from 0.13.2 to 0.13.3.
- [Release notes](https://github.com/ruby/uri/releases)
- [Commits](https://github.com/ruby/uri/compare/v0.13.2...v0.13.3)

---
updated-dependencies:
- dependency-name: uri
  dependency-version: 0.13.3
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-06 00:55:03 +01:00
5bentz
ffa260b39d
Update sleep-peacefully.md (#487)
Fix business hours (9:00 to 18:00)
2025-12-08 15:19:11 +01:00
dependabot[bot]
766b224c67
build(deps): bump actions/checkout from 5 to 6 (#485)
Bumps [actions/checkout](https://github.com/actions/checkout) from 5 to 6.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](https://github.com/actions/checkout/compare/v5...v6)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-version: '6'
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-01 21:34:15 +01:00
Simon Matic Langford
79f2858037
Improve Jenkins node alerts to better handle servers with multiple nodes (#484) 2025-11-17 14:56:04 +01:00
Samuel Berthe
d6589237e1
Update CONTRIBUTING.md 2025-11-13 16:24:49 +01:00
Samuel Berthe
d0d1b00a7b
Fix typo in OpenTelemetry Collector link 2025-11-05 17:15:10 +01:00
Samuel Berthe
e617c07179
Update README.md 2025-11-05 17:14:47 +01:00
Samuel Berthe
48f2dde80c
feat: use /ref/head/ instead of /master/ for yaml url (#482) 2025-11-05 17:12:50 +01:00
samber
cea78d7fd6 Publish 2025-11-05 16:08:52 +00:00
Arve Knudsen
d58bc324ad
Add OpenTelemetry Collector monitoring alerts (#480)
Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>
2025-11-05 17:08:26 +01:00
samber
4acbddb21a Publish 2025-11-05 16:04:56 +00:00
Samuel Berthe
6e2db98590
feat: add support for exporter-level comments (#481) 2025-11-05 17:04:30 +01:00
samber
ae8cfb0366 Publish 2025-10-13 12:24:59 +00:00
andrii.k
9edef74e73
update kafka alerts (#478) 2025-10-13 14:24:37 +02:00
dependabot[bot]
2f9279d707
build(deps-dev): bump rexml from 3.3.9 to 3.4.2 (#476)
Bumps [rexml](https://github.com/ruby/rexml) from 3.3.9 to 3.4.2.
- [Release notes](https://github.com/ruby/rexml/releases)
- [Changelog](https://github.com/ruby/rexml/blob/master/NEWS.md)
- [Commits](https://github.com/ruby/rexml/compare/v3.3.9...v3.4.2)

---
updated-dependencies:
- dependency-name: rexml
  dependency-version: 3.4.2
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-19 04:17:09 +02:00
samber
606d6fc592 Publish 2025-09-15 13:04:10 +00:00
Riccardo Cannella
7832e01082
haproxy: align v1 and v2 HAProxy backend max active session > 80% alerts (#475)
* haproxy: align v1 and v2 max current session alerts

* fix: remove non-existing label

---------

Co-authored-by: Riccardo Cannella <riccardo.cannella@reevo.it>
2025-09-15 15:03:44 +02:00
samber
b158ebb551 Publish 2025-09-14 17:22:29 +00:00
Samuel Berthe
237e89babc
Update query for unused replication slot rule 2025-09-14 19:22:05 +02:00
dependabot[bot]
264bcb82be
build(deps): bump actions/checkout from 4 to 5 (#473)
Bumps [actions/checkout](https://github.com/actions/checkout) from 4 to 5.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](https://github.com/actions/checkout/compare/v4...v5)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-version: '5'
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-02 17:06:00 +02:00
Samuel Berthe
dfac84209d
Update README.md 2025-09-01 15:41:07 +02:00
samber
5fbce5f513 Publish 2025-09-01 13:41:06 +00:00
Sajjad hassanzadeh
a2c31358d1
Add couchdb alerts (#472)
* add : additional essential clickhouse alerts

* Add new ClickHouse alert rules for monitoring

* linting

* add : couchdb roles config in rules.yml

* add : couchdb alerts in rules directory

---------

Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2025-09-01 15:40:42 +02:00
Samuel Berthe
edae18b8df
Remove Screeb tag 2025-08-29 15:20:48 +02:00
Samuel Berthe
0a55137e6a
Remove Screeb 2025-08-29 15:20:21 +02:00
samber
3abc7144aa Publish 2025-08-28 21:07:00 +00:00
Sajjad hassanzadeh
7bced89d2d
add : additional essential clickhouse alerts (#471)
* add : additional essential clickhouse alerts

* Add new ClickHouse alert rules for monitoring

* linting

---------

Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2025-08-28 23:06:31 +02:00
dependabot[bot]
52e4ba143c
build(deps-dev): bump nokogiri from 1.18.8 to 1.18.9 (#469)
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.18.8 to 1.18.9.
- [Release notes](https://github.com/sparklemotion/nokogiri/releases)
- [Changelog](https://github.com/sparklemotion/nokogiri/blob/main/CHANGELOG.md)
- [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.18.8...v1.18.9)

---
updated-dependencies:
- dependency-name: nokogiri
  dependency-version: 1.18.9
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-07-22 16:59:26 +02:00
samber
b04b11ce1d Publish 2025-06-25 11:32:39 +00:00
Samuel Berthe
554850df41
Update rules.yml 2025-06-25 13:32:16 +02:00