Samuel Berthe
2186841f29
Merge pull request #140 from yasharne/percona_mongodb
2020-11-15 18:12:20 +01:00
Vincent Fiset
6ed4358452
remove replset_oplog based alerts
2020-11-09 11:14:01 -05:00
Samuel Berthe
3ccfaa47ea
remove useless brackets
2020-11-07 18:08:02 +01:00
Samuel Berthe
9f144acb30
haproxy: fix description of request errors
2020-11-07 18:07:20 +01:00
Samuel Berthe
be20363602
rate is better than irate for alerting
2020-11-07 17:46:18 +01:00
Liudmyla Derkach
e6113ff2db
feat: adding few useful rabbitmq alerts
2020-10-30 19:10:52 +02:00
Yashar Nesabian
2a2ecf8a8c
change alert rules which were using avg to show more accurate value based on the replica set
2020-10-24 22:03:42 +03:30
Felix Breidenstein
1b6cd55200
Adapt rules for windows to new exporter
2020-10-20 14:52:36 +02:00
Nabil BENDAFI
e024c542ed
feat(kubernetes): add Out of capacity
2020-10-16 12:15:56 +02:00
Samuel Berthe
ead7db708e
alert on containers CPU: add a comment to exclude cAdvisor
2020-10-11 21:38:48 +02:00
Samuel Berthe
50b4c499fa
rules: adding a few cassandra alerts
2020-10-11 19:55:18 +02:00
Samuel Berthe
0cf82fd3e7
Merge branch 'master' into NetworkSpeed
2020-10-11 19:39:59 +02:00
Samuel Berthe
06205cd91c
Update rules.yml
2020-10-11 19:39:17 +02:00
Samuel Berthe
89252f999f
Merge branch 'master' into master
2020-10-11 19:26:04 +02:00
Samuel Berthe
66e6581b07
Merge pull request #121 from osterik/master
...
check free space for all mountpoints
2020-10-11 19:22:27 +02:00
Samuel Berthe
ea7e6d6aa9
Merge pull request #125 from mcanevet/patch-1
...
Fix HAProxy rules
2020-10-11 18:21:41 +02:00
Samuel Berthe
8616b0241c
Merge pull request #130 from nabilbendafi/feature/traefik_rules
2020-10-11 18:10:06 +02:00
Samuel Berthe
e8572f618b
Merge pull request #133 from tux-00/master
2020-10-11 18:07:11 +02:00
Samuel Berthe
2f6b9832fa
Update rules.yml
2020-10-11 18:06:06 +02:00
Samuel Berthe
8af9ca4ba8
Merge pull request #134 from nanorobocop/fix-prometheus-job-missing-alert
...
Fix PrometheusJobMissing alert
2020-10-11 17:48:42 +02:00
Samuel Berthe
2e6e46da45
Merge branch 'master' into master
2020-10-11 17:42:51 +02:00
Samuel Berthe
c469d26c4d
Merge pull request #137 from Ozarklake/sql_server_rules
2020-10-11 17:37:40 +02:00
Samuel Berthe
bafcd1e922
Update rules.yml
2020-10-11 17:35:46 +02:00
Samuel Berthe
e60fc805f6
Merge pull request #138 from nirav-chotai/nchotai/fix-hpa-alerts
...
[PLEASE_MERGE] Fix HPA alerts
2020-10-11 17:24:13 +02:00
Samuel Berthe
45103f0a0d
Merge branch 'master' into master
2020-10-11 17:10:20 +02:00
Samuel Berthe
7a609adf18
adding comment to container OOM killer warning
2020-10-11 16:11:44 +02:00
Samuel Berthe
cf70272309
fix(container memory limit): filter by containers having max memory setting
2020-10-11 16:08:54 +02:00
Samuel Berthe
4128004475
Merge pull request #119 from fernandocarletti/patch-1
...
fix: container ContainerMemoryUsage alert
2020-10-11 16:06:33 +02:00
Samuel Berthe
f67162bf57
Merge pull request #148 from fsschmitt/fix/disk-latency-unit
...
Fix time unit on disk read/write latency rule
2020-10-11 15:49:15 +02:00
fsschmitt
4266b4d326
Fix time unit on disk read/write latency rule
2020-10-06 14:36:22 +01:00
fsschmitt
5288c9a2f5
Fix node_md_disks state from fail to failed
2020-10-06 13:33:50 +01:00
Daniel Andrzejewski
fc4797db9e
small fix
2020-09-17 15:19:14 +02:00
Daniel Andrzejewski
6c5f708179
node_disk_write_time_seconds_total is in seconds, not in milliseconds. node_disk_write_time_seconds_total should be grater than 0, otherwise you get +Inf result.
2020-09-17 15:13:42 +02:00
Yashar Nesabian
d6b39a7f3f
More accurate alerts
...
added `mondodb instance down` alert and changed the `too many
connections` alert to fire when the connections are more than 80% of the
available connections.
removed `mongodb_replset_member_state` based alerts as I don't have
enough information on them
2020-08-09 10:35:39 +04:30
Yashar Nesabian
3ce1084f5b
Added percona mongodb alert rules
2020-08-03 10:45:32 +04:30
Nirav Chotai
8fb5da83de
Fix HPA alerts
...
- Fixing KubernetesHpaMetricAvailability
- Fixing KubernetesHpaScalingAbility
2020-07-24 13:32:44 +08:00
Ozarklake
88e812c78e
add sql server rules
2020-07-17 15:02:41 +08:00
Ozarklake
4e66d17d01
add sql server rules
2020-07-17 14:58:26 +08:00
Ozarklake
e009c5d8b5
Optimizing mysql slow query alert rules
2020-07-14 12:55:17 +08:00
Mansur Marvanov
05e521c0a8
Fix PrometheusJobMissing alert
2020-07-09 16:36:45 +09:00
tux
add6d9c2f3
Add official rabbitmq exporter rules
2020-06-30 15:48:42 +02:00
Nabil BENDAFI
b324c6f32f
feat(traefik): add rules for Traefik v2
...
Fixes #7
2020-06-23 13:40:01 +02:00
Mickaël Canévet
24f7095cd5
Fix HAProxy rules
2020-05-29 10:11:54 +02:00
Ilya Kisleyko
663b0e94da
check free space for all mountpoints
2020-05-20 20:04:32 +03:00
Anton Smolkov
bbbe14f2bd
Update rules.yml
...
WMI memory alert had opposite meaning, triggered on 90% free instead of 90% used
2020-05-19 11:07:11 +03:00
Fernando Carletti
e6de413146
fix: container ContainerMemoryUsage alert
2020-05-18 17:38:05 -05:00
Rob Brown
5050fd64d5
Correct "device" to "interface"
2020-05-14 16:57:19 +01:00
Samuel Berthe
da1e4f6301
💄 replacing "error" severity by "critical", repo wide
2020-05-14 17:20:19 +02:00
Rob Brown
5d3e812fd7
Add HostNetworkNot1GbSpeed rule
2020-05-14 15:00:24 +01:00
Samuel Berthe
7293bca720
Merge pull request #107 from robert-will-brown/NetworkTransmitErrors
2020-05-09 21:32:40 +02:00