Commit graph

312 commits

Author SHA1 Message Date
Per Lundberg
f673fe72c3
Update rules.yml
Fixes bug in previous commit. `or` has lower precedence than `<` in PromQL so hence the need for the grouping using parentheses.
2020-11-27 11:08:46 +02:00
Per Lundberg
00dd58eace
Fix Redis missing master query
The previous approach fails because of the "missing data" semantics in Prometheus. If the Redis server is down, PromQL will typically return "no data" instead of 0 for a `count()`; this is by design in Prometheus.

This suggestion as given by @slovdahl works around this by returning an vector with a single `0` entry in this case, making the query work as intended.
2020-11-25 16:06:05 +02:00
Samuel Berthe
2186841f29
Merge pull request #140 from yasharne/percona_mongodb 2020-11-15 18:12:20 +01:00
Samuel Berthe
6e99bd59cf
Merge pull request #163 from vfiset/remove-mongodb-oplog
remove replset_oplog based alerts
2020-11-09 17:38:12 +01:00
Vincent Fiset
6ed4358452 remove replset_oplog based alerts 2020-11-09 11:14:01 -05:00
Samuel Berthe
3ccfaa47ea
remove useless brackets 2020-11-07 18:08:02 +01:00
Samuel Berthe
9f144acb30
haproxy: fix description of request errors 2020-11-07 18:07:20 +01:00
Samuel Berthe
be20363602
rate is better than irate for alerting 2020-11-07 17:46:18 +01:00
Samuel Berthe
59fd40b113
Merge pull request #161 from MacPaw/feature/rabbitmq-rules
feat: adding few useful RabbitMQ alerts
2020-10-31 23:30:19 +01:00
Liudmyla Derkach
e6113ff2db feat: adding few useful rabbitmq alerts 2020-10-30 19:10:52 +02:00
Yashar Nesabian
2a2ecf8a8c change alert rules which were using avg to show more accurate value based on the replica set 2020-10-24 22:03:42 +03:30
Samuel Berthe
46b5c7bb43
Merge pull request #160 from samber/fix-rule-template-quotes
Fix rule template: removing quotes
2020-10-23 10:55:38 +02:00
Samuel Berthe
a400f53195
fix rule template: removing simple quotes, fixing #159 2020-10-23 10:53:04 +02:00
Samuel Berthe
2734bb43b5
Merge pull request #158 from codemonauts/fix_windows_exporter
Adapt rules for windows to new exporter
2020-10-21 23:39:44 +02:00
Felix Breidenstein
1b6cd55200 Adapt rules for windows to new exporter 2020-10-20 14:52:36 +02:00
Samuel Berthe
5f6127f35a
Merge pull request #152 from nabilbendafi/master
feat(kubernetes): add Out of capacity
2020-10-16 18:19:37 +02:00
Nabil BENDAFI
e024c542ed feat(kubernetes): add Out of capacity 2020-10-16 12:15:56 +02:00
Samuel Berthe
57b7007de4
Merge pull request #151 from samber/comment-cadvisor
Alert on containers CPU: add a comment to exclude cAdvisor
2020-10-11 21:44:15 +02:00
Samuel Berthe
ead7db708e
alert on containers CPU: add a comment to exclude cAdvisor 2020-10-11 21:38:48 +02:00
Samuel Berthe
7fd2841d6d
Merge pull request #150 from samber/alerts-cassadnra
Adding Cassandra alerts
2020-10-11 19:57:57 +02:00
Samuel Berthe
50b4c499fa
rules: adding a few cassandra alerts 2020-10-11 19:55:18 +02:00
Samuel Berthe
ca822ec96c
Merge pull request #116 from robert-will-brown/NetworkSpeed 2020-10-11 19:40:09 +02:00
Samuel Berthe
0cf82fd3e7
Merge branch 'master' into NetworkSpeed 2020-10-11 19:39:59 +02:00
Samuel Berthe
06205cd91c
Update rules.yml 2020-10-11 19:39:17 +02:00
Samuel Berthe
9611847a6e
Merge pull request #120 from AntonSmolkov/master 2020-10-11 19:26:28 +02:00
Samuel Berthe
89252f999f
Merge branch 'master' into master 2020-10-11 19:26:04 +02:00
Samuel Berthe
66e6581b07
Merge pull request #121 from osterik/master
check free space for all mountpoints
2020-10-11 19:22:27 +02:00
Samuel Berthe
2e1db22b9b
Merge pull request #123 from samber/dependabot/bundler/activesupport-6.0.3.1
Bump activesupport from 6.0.2.1 to 6.0.3.1
2020-10-11 18:22:13 +02:00
Samuel Berthe
ea7e6d6aa9
Merge pull request #125 from mcanevet/patch-1
Fix HAProxy rules
2020-10-11 18:21:41 +02:00
Samuel Berthe
5c4e0ec7a3
Merge pull request #128 from nabilbendafi/master 2020-10-11 18:18:08 +02:00
Samuel Berthe
1c2a662d22
Merge branch 'master' into master 2020-10-11 18:15:51 +02:00
Samuel Berthe
94c5f259d2
Merge pull request #131 from nabilbendafi/fix/binary_test
fix: remove unnecessary test
2020-10-11 18:12:52 +02:00
Samuel Berthe
8616b0241c
Merge pull request #130 from nabilbendafi/feature/traefik_rules 2020-10-11 18:10:06 +02:00
Samuel Berthe
e8572f618b
Merge pull request #133 from tux-00/master 2020-10-11 18:07:11 +02:00
Samuel Berthe
2f6b9832fa
Update rules.yml 2020-10-11 18:06:06 +02:00
Samuel Berthe
8af9ca4ba8
Merge pull request #134 from nanorobocop/fix-prometheus-job-missing-alert
Fix PrometheusJobMissing alert
2020-10-11 17:48:42 +02:00
Samuel Berthe
f38f3ef098
Merge pull request #136 from Ozarklake/master 2020-10-11 17:45:41 +02:00
Samuel Berthe
2e6e46da45
Merge branch 'master' into master 2020-10-11 17:42:51 +02:00
Samuel Berthe
c469d26c4d
Merge pull request #137 from Ozarklake/sql_server_rules 2020-10-11 17:37:40 +02:00
Samuel Berthe
bafcd1e922
Update rules.yml 2020-10-11 17:35:46 +02:00
Samuel Berthe
e60fc805f6
Merge pull request #138 from nirav-chotai/nchotai/fix-hpa-alerts
[PLEASE_MERGE] Fix HPA alerts
2020-10-11 17:24:13 +02:00
Samuel Berthe
6defbb18ca
Merge pull request #146 from sys-ops/master 2020-10-11 17:12:22 +02:00
Samuel Berthe
45103f0a0d
Merge branch 'master' into master 2020-10-11 17:10:20 +02:00
Samuel Berthe
7a609adf18
adding comment to container OOM killer warning 2020-10-11 16:11:44 +02:00
Samuel Berthe
7cf383234a
Merge pull request #149 from samber/fix-container-memory-usage-filter-limited
Fix container memory limit: filter containers having memory limit
2020-10-11 16:10:08 +02:00
Samuel Berthe
cf70272309
fix(container memory limit): filter by containers having max memory setting 2020-10-11 16:08:54 +02:00
Samuel Berthe
4128004475
Merge pull request #119 from fernandocarletti/patch-1
fix: container ContainerMemoryUsage alert
2020-10-11 16:06:33 +02:00
Samuel Berthe
f67162bf57
Merge pull request #148 from fsschmitt/fix/disk-latency-unit
Fix time unit on disk read/write latency rule
2020-10-11 15:49:15 +02:00
Samuel Berthe
ccc485f86d
Merge pull request #147 from fsschmitt/master
Fix node_md_disks state from fail to failed
2020-10-11 15:47:19 +02:00
fsschmitt
4266b4d326 Fix time unit on disk read/write latency rule 2020-10-06 14:36:22 +01:00