Samuel Berthe
fe00569998
Merge pull request #172 from bdossantos/chore/haproxy2
...
chore: add Prometheus alerts for HAProxy v2
2021-01-06 15:37:19 +01:00
Gert Vilain
de8e2f6cd9
Remove duplicate kubernetes job failed
2021-01-05 20:49:25 +01:00
Benjamin Dos Santos
1b7c36666c
chore: add Prometheus alerts for HAProxy v2
...
ref #87
2021-01-05 16:45:52 +01:00
Samuel Berthe
209fdf86e8
reduce p99 quantile aggregation duration
2021-01-05 12:30:32 +01:00
Samuel Berthe
5d7d99a658
Merge pull request #171 from tosin-ogunrinde/master
2021-01-03 21:45:45 +01:00
Tosin Ogunrinde
21817c3551
Improve JVM "JVM memory filling up" alert by summing up all the heap areas which include a separate entry for the Eden Space, Survivor Space and Tenured Gen.
2020-12-31 09:16:09 +00:00
Tosin Ogunrinde
ebf402aa7d
Improve JVM "JVM memory filling up" alert by summing up all the heap areas which include a separate entry for the Eden Space, Survivor Space and Tenured Gen.
2020-12-31 09:06:36 +00:00
Samuel Berthe
97345d3b6f
mysql restart alert: severity=info
2020-12-31 00:47:14 +01:00
Samuel Berthe
971bbe03ec
Add FOR clause to alerting rules (when necessary)
2020-12-31 00:27:12 +01:00
Samuel Berthe
3a352d08dc
fix k8s rule: longer alert check time
2020-12-30 19:13:02 +01:00
Samuel Berthe
d3ecfaaad3
Merge pull request #139 from xkfen/istio
2020-12-30 18:47:28 +01:00
Samuel Berthe
2f6d4921c6
fix initial istio alerts
2020-12-30 18:46:50 +01:00
Samuel Berthe
fa4325218f
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts
2020-12-30 17:46:58 +01:00
Samuel Berthe
ed62bdc567
alerts node_exporter: improve network and disk rules
2020-12-30 17:45:30 +01:00
Tosin Ogunrinde
0add93363f
Fix JVM "JVM memory filling up" alert
2020-12-30 00:30:08 +00:00
Samuel Berthe
f686698f68
Merge pull request #166 from cityofships/fix_es
...
Fix Elasticsearch "No new documents" alert
2020-12-28 16:50:47 +01:00
Samuel Berthe
965fefab89
fix alert description
2020-12-28 16:40:11 +01:00
Carl Düvel
a7c5155002
Add cpu steal alert
2020-12-21 19:06:45 +01:00
Piotr Parczewski
f7d08e364b
Fix Elasticsearch "No new documents" alert.
...
Prometheus rate() function calculates the per-second average rate
of increase. This means the alert gets triggered whenever during
last 10 minutes there were less than 1 document ingested *per second*
(60 documents per minute).
Signed-off-by: Piotr Parczewski <piotr@stackhpc.com>
2020-12-17 15:00:01 +01:00
Per Lundberg
f673fe72c3
Update rules.yml
...
Fixes bug in previous commit. `or` has lower precedence than `<` in PromQL so hence the need for the grouping using parentheses.
2020-11-27 11:08:46 +02:00
Per Lundberg
00dd58eace
Fix Redis missing master query
...
The previous approach fails because of the "missing data" semantics in Prometheus. If the Redis server is down, PromQL will typically return "no data" instead of 0 for a `count()`; this is by design in Prometheus.
This suggestion as given by @slovdahl works around this by returning an vector with a single `0` entry in this case, making the query work as intended.
2020-11-25 16:06:05 +02:00
Samuel Berthe
2186841f29
Merge pull request #140 from yasharne/percona_mongodb
2020-11-15 18:12:20 +01:00
Vincent Fiset
6ed4358452
remove replset_oplog based alerts
2020-11-09 11:14:01 -05:00
Samuel Berthe
3ccfaa47ea
remove useless brackets
2020-11-07 18:08:02 +01:00
Samuel Berthe
9f144acb30
haproxy: fix description of request errors
2020-11-07 18:07:20 +01:00
Samuel Berthe
be20363602
rate is better than irate for alerting
2020-11-07 17:46:18 +01:00
Liudmyla Derkach
e6113ff2db
feat: adding few useful rabbitmq alerts
2020-10-30 19:10:52 +02:00
Yashar Nesabian
2a2ecf8a8c
change alert rules which were using avg to show more accurate value based on the replica set
2020-10-24 22:03:42 +03:30
Felix Breidenstein
1b6cd55200
Adapt rules for windows to new exporter
2020-10-20 14:52:36 +02:00
Nabil BENDAFI
e024c542ed
feat(kubernetes): add Out of capacity
2020-10-16 12:15:56 +02:00
Samuel Berthe
ead7db708e
alert on containers CPU: add a comment to exclude cAdvisor
2020-10-11 21:38:48 +02:00
Samuel Berthe
50b4c499fa
rules: adding a few cassandra alerts
2020-10-11 19:55:18 +02:00
Samuel Berthe
0cf82fd3e7
Merge branch 'master' into NetworkSpeed
2020-10-11 19:39:59 +02:00
Samuel Berthe
06205cd91c
Update rules.yml
2020-10-11 19:39:17 +02:00
Samuel Berthe
89252f999f
Merge branch 'master' into master
2020-10-11 19:26:04 +02:00
Samuel Berthe
66e6581b07
Merge pull request #121 from osterik/master
...
check free space for all mountpoints
2020-10-11 19:22:27 +02:00
Samuel Berthe
ea7e6d6aa9
Merge pull request #125 from mcanevet/patch-1
...
Fix HAProxy rules
2020-10-11 18:21:41 +02:00
Samuel Berthe
8616b0241c
Merge pull request #130 from nabilbendafi/feature/traefik_rules
2020-10-11 18:10:06 +02:00
Samuel Berthe
e8572f618b
Merge pull request #133 from tux-00/master
2020-10-11 18:07:11 +02:00
Samuel Berthe
2f6b9832fa
Update rules.yml
2020-10-11 18:06:06 +02:00
Samuel Berthe
8af9ca4ba8
Merge pull request #134 from nanorobocop/fix-prometheus-job-missing-alert
...
Fix PrometheusJobMissing alert
2020-10-11 17:48:42 +02:00
Samuel Berthe
2e6e46da45
Merge branch 'master' into master
2020-10-11 17:42:51 +02:00
Samuel Berthe
c469d26c4d
Merge pull request #137 from Ozarklake/sql_server_rules
2020-10-11 17:37:40 +02:00
Samuel Berthe
bafcd1e922
Update rules.yml
2020-10-11 17:35:46 +02:00
Samuel Berthe
e60fc805f6
Merge pull request #138 from nirav-chotai/nchotai/fix-hpa-alerts
...
[PLEASE_MERGE] Fix HPA alerts
2020-10-11 17:24:13 +02:00
Samuel Berthe
45103f0a0d
Merge branch 'master' into master
2020-10-11 17:10:20 +02:00
Samuel Berthe
7a609adf18
adding comment to container OOM killer warning
2020-10-11 16:11:44 +02:00
Samuel Berthe
cf70272309
fix(container memory limit): filter by containers having max memory setting
2020-10-11 16:08:54 +02:00
Samuel Berthe
4128004475
Merge pull request #119 from fernandocarletti/patch-1
...
fix: container ContainerMemoryUsage alert
2020-10-11 16:06:33 +02:00
Samuel Berthe
f67162bf57
Merge pull request #148 from fsschmitt/fix/disk-latency-unit
...
Fix time unit on disk read/write latency rule
2020-10-11 15:49:15 +02:00