mirror of
https://github.com/samber/awesome-prometheus-alerts.git
synced 2026-06-26 19:37:27 +08:00
doc: more explicit "ContainerAbsent" and "ContainerKilled" rules
This commit is contained in:
parent
251a929db0
commit
603aacd0e2
1 changed files with 9 additions and 0 deletions
|
|
@ -291,6 +291,15 @@ groups:
|
||||||
description: A container has disappeared
|
description: A container has disappeared
|
||||||
query: 'time() - container_last_seen > 60'
|
query: 'time() - container_last_seen > 60'
|
||||||
severity: warning
|
severity: warning
|
||||||
|
comments: |
|
||||||
|
This rule can be very noisy in dynamic infra with legitimate container start/stop/deployment.
|
||||||
|
- name: Container absent
|
||||||
|
description: A container is absent for 5 min
|
||||||
|
query: 'absent(container_last_seen)'
|
||||||
|
severity: warning
|
||||||
|
for: 5m
|
||||||
|
comments: |
|
||||||
|
This rule can be very noisy in dynamic infra with legitimate container start/stop/deployment.
|
||||||
- name: Container CPU usage
|
- name: Container CPU usage
|
||||||
description: Container CPU usage is above 80%
|
description: Container CPU usage is above 80%
|
||||||
query: '(sum(rate(container_cpu_usage_seconds_total[3m])) BY (instance, name) * 100) > 80'
|
query: '(sum(rate(container_cpu_usage_seconds_total[3m])) BY (instance, name) * 100) > 80'
|
||||||
|
|
|
||||||
Loading…
Reference in a new issue