mirror of
https://github.com/samber/awesome-prometheus-alerts.git
synced 2026-06-23 09:58:16 +08:00
Merge pull request #84 from samber/doc-postgresql-replication-lag
Adding a comment to PostgresqlReplicationLag alert
This commit is contained in:
commit
1a56c3032f
1 changed files with 6 additions and 1 deletions
|
|
@ -207,10 +207,15 @@ services:
|
||||||
description: PostgreSQL instance is down
|
description: PostgreSQL instance is down
|
||||||
query: "pg_up == 0"
|
query: "pg_up == 0"
|
||||||
severity: error
|
severity: error
|
||||||
- name: Replication lag
|
- name: Postgresql replication lag
|
||||||
description: PostgreSQL replication lag is going up (> 10s)
|
description: PostgreSQL replication lag is going up (> 10s)
|
||||||
query: "pg_replication_lag > 10"
|
query: "pg_replication_lag > 10"
|
||||||
severity: warning
|
severity: warning
|
||||||
|
comments: |
|
||||||
|
A label excluding master nodes should be added to this query,
|
||||||
|
in order to monitor lag on standby servers only.
|
||||||
|
Exporter does not guarantee a NaN value for pg_replication_log on promoted master nodes.
|
||||||
|
See https://github.com/samber/awesome-prometheus-alerts/issues/74
|
||||||
- name: Table not vaccumed
|
- name: Table not vaccumed
|
||||||
description: Table has not been vaccum for 24 hours
|
description: Table has not been vaccum for 24 hours
|
||||||
query: "time() - pg_stat_user_tables_last_autovacuum > 60 * 60 * 24"
|
query: "time() - pg_stat_user_tables_last_autovacuum > 60 * 60 * 24"
|
||||||
|
|
|
||||||
Loading…
Reference in a new issue