guruevi
0c2876e37d
Merge branch 'samber:master' into master
2024-04-11 17:37:20 -04:00
Evi Vanoost
c823aca7c2
PostgreSQL now has optimized autovacuum behavior
2024-04-11 17:35:33 -04:00
Rastislav Pôbiš
2494ccdf31
Added prepared statements mysqld-exporter alert ( #407 )
2024-03-26 16:56:15 +01:00
Evi Vanoost
351e45cb13
Merge remote-tracking branch 'guruevi/master'
2024-03-13 15:54:01 -04:00
Evi Vanoost
a68beee3c4
PostgreSQL now has optimized autovacuum behavior
2024-03-13 15:53:56 -04:00
guruevi
bfd04e6f36
Merge branch 'samber:master' into master
2024-03-13 14:57:53 -04:00
Samuel Berthe
1eb5c5834f
Update rules.yml
2024-03-11 23:28:06 +01:00
Samuel Berthe
90706282ad
Update rules.yml
2024-03-11 22:55:05 +01:00
Evi Vanoost
224e6d00a9
Refined some more queries
2024-03-06 11:13:48 -05:00
Evi Vanoost
46043360de
Removed queries that throw errors when systems are upgraded. Also fixed and simplified a few Postgres queries.
2024-02-25 14:53:30 -05:00
Evi Vanoost
87ee1292e7
Minor bug fixes
2024-02-24 19:30:22 -05:00
Evi Vanoost
d6ef8e7449
Clean up some more metrics
2024-02-24 19:06:50 -05:00
Evi Vanoost
8d3ae0436b
smartctl_exporter publishes both drive_trip and current drive temperatures. Since most of the alerts are going to be permanent, it does not make sense to wait for the alert to be on for a certain time. Temperature sensors likewise vary, using the last sample is not sufficient to alert on potential issues.
2024-02-24 13:49:42 -05:00
Samuel Berthe
05c4716c2b
Fix KubernetesAPIserverlatency
2024-02-12 09:41:03 +01:00
Samuel Berthe
f5f6b338a3
fix: high/low cpu alert
2024-02-10 23:24:10 +01:00
Samuel Berthe
937cd35df7
💄
2024-02-10 20:04:17 +01:00
Samuel Berthe
5f57f09db0
fix(HostOutOfInodes): exclude msdosfs FS
...
See #398
2024-02-10 20:01:19 +01:00
Marek Červenka
4eb0e910e7
SMART monitoring ( #402 )
...
* SMART monitoring
* query regex fix
---------
Co-authored-by: Marek Cervenka <cervenka@ipex.cz>
2024-02-09 20:23:30 +01:00
Samuel Berthe
0727f2ef2e
Update rules.yml
2024-01-26 04:10:22 +01:00
josedev-union
c6ff5a59dc
feat: Add rules for Graph Node ( #387 )
...
Co-authored-by: josedev-union <josedev-union@users.noreply.github.com>
2024-01-20 20:33:26 +01:00
michaelact
7fa11bf6cc
Add simple and meaningful kube-state-metrics alert summary ( #394 )
...
* feat: add 'summary' to be overriden from rules.yml
* chore: add simple and meaningful summary for kubernetes alerts
2023-12-01 18:25:11 +01:00
Samuel Berthe
a4de5323ad
Update rules.yml
2023-11-26 02:18:16 +01:00
Samuel Berthe
76de11d71b
Update rules.yml
2023-10-24 15:03:51 +02:00
Pierre Riteau
cbf7046afa
Fix capitalisation of RabbitMQ ( #392 )
2023-10-13 17:09:10 +02:00
Vicky Wilson Jacob
7a8f883df6
feat: adding hadoop jmx exporter ( #391 )
...
* adding hadoop exporter
* added hadoop rules with jmx exporter
* added hadoop rules with jmx exporter
* Update rules.yml
---------
Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2023-10-06 18:48:54 +02:00
Samuel Berthe
bacb433089
Update rules.yml
2023-09-18 20:14:57 +02:00
Samuel Berthe
053cde27e4
Update rules.yml
2023-08-22 15:51:53 +02:00
Pavel Timofeev
6b1685261d
Rework kube-state-metrics alerts ( #381 )
...
* Rework kube-state-metrics alerts:
- provide meaningful labels in summary as 'instance' label hardly makes sense in most of them
- rename some alerts to tell more accurate what the problem is
- adjust description trying to follow some kind of the message schema found in other alerts
* move changes to _data/rules.yml
* Update rules.yml
---------
Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2023-08-20 00:39:22 +02:00
Samuel Berthe
c3d78786e8
fix ci
2023-08-15 20:27:13 +02:00
Roman Pertl
ecd92399d5
feat: adding patroni alert rules ( #369 )
2023-08-15 19:54:15 +02:00
fzyzcjy
13e90b3aea
Update rules.yml ( #371 )
2023-08-15 19:42:46 +02:00
Ted Hahn
94b9f3cfbb
Fix for Postgres max connections. Postgres does not limit connections by database, but total over the server. Additionally, alert labels didn't match across the pair. Using a min by on the right side deals with the possibility additional labels are present on your exporter. ( #376 )
2023-08-15 19:39:41 +02:00
Samuel Berthe
15e3131547
Update rules.yml
2023-08-15 19:36:22 +02:00
Samuel Berthe
eb3220c8d7
Update rules.yml
2023-08-15 19:34:14 +02:00
Ivan Dudin
86e3e38a99
fix typo ( #377 )
2023-08-07 19:43:10 +02:00
Samuel Berthe
ff76ceccde
Update rules.yml
2023-07-30 22:24:31 +02:00
Moritz
fe5f78171a
update rules.yml ( #374 )
2023-07-30 22:21:20 +02:00
Samuel Berthe
8c811045e5
Update rules.yml
2023-07-29 18:20:58 +02:00
Samuel Berthe
32cf16a53d
Update rules.yml
2023-07-12 14:32:43 +02:00
Samuel Berthe
1bb6c602f7
Update rules.yml
2023-07-06 13:54:31 +02:00
Samuel Berthe
5d254811b4
Update rules.yml
2023-06-27 00:28:31 +02:00
Samuel Berthe
47b7748618
Update rules.yml
2023-06-22 18:40:33 +02:00
Samuel Berthe
3d0c5fcafd
Update rules.yml
2023-06-22 18:29:21 +02:00
Samuel Berthe
600a759344
Update rules.yml
2023-06-22 15:01:06 +02:00
Samuel Berthe
ee86c2d233
Update rules.yml
2023-06-22 15:00:40 +02:00
michaelact
7e8bc1a215
Add under-utilized container alerts ( #322 )
...
* chore: add container under-utilized allerts
* chore: resolve duplicated query and description
2023-05-21 22:58:04 +02:00
Paul-Élie Testud
c36014f03e
fix(nginx): fix nginx query for histogram_percentile ( #351 )
2023-04-28 16:06:12 +02:00
deimosOmegaChan
b98b2a2777
fix node-exporter nodename regex expression ( #349 )
...
nodename should not depends with the prefix "hostname"
2023-04-25 10:58:52 +02:00
Samuel Berthe
9efec14d26
chore: move from " https://awesome-prometheus-alerts.grep.to " to " https://samber.github.io/awesome-prometheus-alerts/ "
2023-04-23 23:32:26 +02:00
Madhu Sudhan
8b9fc8864f
refactor: node-exporter queries to include hostname as label which will be helpful for alerting ( #348 )
2023-04-23 22:16:08 +02:00