Commit graph

95 commits

Author SHA1 Message Date
samber
04886da968 Publish 2024-05-13 10:10:12 +00:00
samber
613401a960 Publish 2024-05-13 09:12:01 +00:00
samber
84b0569c97 Publish 2024-05-13 08:33:30 +00:00
Ali
2547288c13
Added Clickhouse (#412)
* Added Clickhouse

* Update rules.yml

Added reasonable time periods for each query to avoid false positives and in some cased give the system a short window to try to solve the issue.
Also changed the severity level of authentication alerts from critical to info which seems more appropriate

* Modified time period for alerts embedded-exporter.yml

I made a few adjustments in time periods.
See if they seem reasonable or not

* Replication alerts time periods were adjusted

IMHO, replication alerts must be sent right away.
2024-05-13 10:32:18 +02:00
samber
515fca9c10 Publish 2024-05-05 23:33:11 +00:00
samber
5c0963558a Publish 2024-05-02 18:49:56 +00:00
samber
b77cb3467c Publish 2024-04-29 20:36:49 +00:00
samber
6b05a59ad9 Publish 2024-03-26 15:57:31 +00:00
Rastislav Pôbiš
2494ccdf31
Added prepared statements mysqld-exporter alert (#407) 2024-03-26 16:56:15 +01:00
samber
693c9e51b2 Publish 2024-03-11 22:29:17 +00:00
samber
7b3cef8bf9 Publish 2024-03-11 21:56:16 +00:00
samber
e2d3dadbc5 Publish 2024-02-12 08:42:15 +00:00
samber
c3258de6c7 Publish 2024-02-10 22:25:26 +00:00
samber
284db65e46 Publish 2024-02-10 19:02:28 +00:00
samber
0dba950ccc Publish 2024-02-09 19:25:17 +00:00
Brett Beutell
56a7e0d03a
Update rule for host memory underutilization to use avg_over_time instead of rate, since node_memory_MemAvailable_bytes is a gauge (#400) 2024-01-26 04:09:35 +01:00
samber
df4016bf6a Publish 2024-01-20 19:34:37 +00:00
josedev-union
c6ff5a59dc
feat: Add rules for Graph Node (#387)
Co-authored-by: josedev-union <josedev-union@users.noreply.github.com>
2024-01-20 20:33:26 +01:00
samber
6ee065c636 Publish 2023-12-01 17:26:16 +00:00
michaelact
7fa11bf6cc
Add simple and meaningful kube-state-metrics alert summary (#394)
* feat: add 'summary' to be overriden from rules.yml

* chore: add simple and meaningful summary for kubernetes alerts
2023-12-01 18:25:11 +01:00
samber
7d05d142d5 Publish 2023-11-26 01:19:24 +00:00
samber
308b3c52dd Publish 2023-10-24 13:05:40 +00:00
samber
97da7f97b6 Publish 2023-10-13 15:10:33 +00:00
samber
82f2798620 Publish 2023-10-06 16:50:22 +00:00
Vicky Wilson Jacob
7a8f883df6
feat: adding hadoop jmx exporter (#391)
* adding hadoop exporter

* added hadoop rules with jmx exporter

* added hadoop rules with jmx exporter

* Update rules.yml

---------

Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2023-10-06 18:48:54 +02:00
samber
ccdfd22a41 Publish 2023-09-18 18:16:22 +00:00
samber
93a62d4271 Publish 2023-08-22 13:53:16 +00:00
samber
4279dedb52 Publish 2023-08-19 22:41:12 +00:00
Pavel Timofeev
6b1685261d
Rework kube-state-metrics alerts (#381)
* Rework kube-state-metrics alerts:
- provide meaningful labels in summary as 'instance' label hardly makes sense in most of them
- rename some alerts to tell more accurate what the problem is
- adjust description trying to follow some kind of the message schema found in other alerts

* move changes to _data/rules.yml

* Update rules.yml

---------

Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2023-08-20 00:39:22 +02:00
samber
afddf710ab Publish 2023-08-15 18:28:36 +00:00
Ted Hahn
94b9f3cfbb
Fix for Postgres max connections. Postgres does not limit connections by database, but total over the server. Additionally, alert labels didn't match across the pair. Using a min by on the right side deals with the possibility additional labels are present on your exporter. (#376) 2023-08-15 19:39:41 +02:00
Pavel Timofeev
c419732e2e
Substract failed jobs from KubernetesJobSlowCompletion (#363) 2023-08-15 19:32:51 +02:00
samber
f72620203f Publish 2023-07-30 20:22:47 +00:00
samber
c0ec625dc6 Publish 2023-07-29 16:22:19 +00:00
samber
3ad1536226 Publish 2023-07-12 12:34:10 +00:00
samber
4394de4713 Publish 2023-07-06 11:55:47 +00:00
Roman Pertl
71f488d744
feat: improve rule for used connection on redis (#358)
use max allowed connections value instead of a fixed value
2023-06-27 00:27:20 +02:00
samber
7a05f925b4 Publish 2023-06-22 16:42:13 +00:00
samber
a4dbefd853 Publish 2023-06-22 16:30:42 +00:00
samber
f9c71ab724 Publish 2023-06-22 13:02:23 +00:00
Pavel Timofeev
247dabffd8
Rename KubernetesNodeReady alert (#360)
Better wording for NodeReady nodes
2023-06-22 15:00:08 +02:00
samber
ac09fd8a2d Publish 2023-05-21 20:58:38 +00:00
samber
99b101077d Publish 2023-04-28 14:06:51 +00:00
samber
7a874b7205 Publish 2023-04-25 08:59:28 +00:00
samber
9d3d52bbfa Publish 2023-04-23 20:16:41 +00:00
samber
603edb2536 Publish 2023-04-06 23:43:00 +00:00
samber
ebbfc496cd Publish 2023-04-03 08:03:11 +00:00
Julien Lecomte
baa4f223cd
Ignore temperature from tctl sensors (#341) 2023-03-24 14:36:24 +01:00
samber
2ead3bcbd8 Publish 2023-03-15 17:27:02 +00:00
samber
293aba1437 Publish 2023-02-26 01:34:30 +00:00