Commit graph

195 commits

Author SHA1 Message Date
Samuel Berthe
c653b37e15
adding rules to prometheus self monitoring 2020-03-17 20:56:49 +01:00
Samuel Berthe
fc3e72041c
Merge branch 'master' of github.com:samber/awesome-prometheus-alerts 2020-03-17 19:05:57 +01:00
Samuel Berthe
778e101030
adding alerts for Ceph 2020-03-17 18:50:36 +01:00
Samuel Berthe
5125c683c5
adding alerts for Ceph 2020-03-17 18:50:08 +01:00
Samuel Berthe
fb07c2bcd4
Merge pull request #91 from obitech/fix_slow_rule_eval_rule
Fix PrometheusRuleEvaluationSlow
2020-03-17 17:46:07 +01:00
Alexander Knipping
c82df5d005 Fix PrometheusRuleEvaluationSlow
Fixes the rule PrometheusRuleEvaluationSlow as it should fire if
prometheus_rule_group_last_duration_seconds takes longer than
prometheus_rule_group_interval_seconds.

prometheus_rule_group_last_duration_seconds: The duration of the last rule group evaluation.
prometheus_rule_group_interval_seconds: The interval of a rule group.
2020-03-17 15:14:40 +01:00
Samuel Berthe
f5bcac33fe
better contributing guidelines 2020-03-10 10:01:08 +01:00
Samuel Berthe
5b457b0e52
adding github buttons to layout 2020-03-09 23:31:27 +01:00
Samuel Berthe
f554b72671
Add alert for kubernetes api latency 2020-03-09 21:55:17 +01:00
Samuel Berthe
0b89a764ee
Adding exporters: sidekiq, pgbouncer and thanos.
Adding rules to: prometheus, kubernetes, redis, docker and postgresql.
Arranging exporters into categories.
Showing number of rules.
Thanks to Gitlab for opensourcing alerting rules!
2020-03-09 21:18:56 +01:00
Samuel Berthe
affacde49b
adding prometheus internal alerts 2020-03-09 00:16:17 +01:00
Samuel Berthe
189a3129c3
moving prom config to alertmanager page 2020-03-08 23:06:33 +01:00
Samuel Berthe
6408af5ba3
don't ask french people to write in english without error 2020-03-08 23:00:01 +01:00
Samuel Berthe
3ad9015293
don't ask french people to write in english without error 2020-03-08 22:53:49 +01:00
Samuel Berthe
99e3e64252
Insert Commit Message Here 2020-03-08 22:21:30 +01:00
Samuel Berthe
77eccab0e9
some random changes on rules 2020-03-08 20:30:22 +01:00
Samuel Berthe
8f515ceae2
Improves repo intro 2020-03-08 19:23:28 +01:00
Samuel Berthe
542adc3ca7
Adding minio rules 2020-03-08 18:55:53 +01:00
Samuel Berthe
b5469f2a59
Doc: organizing sections 2020-03-08 17:39:49 +01:00
Samuel Berthe
5bace11107
data: ensure alert name prefix 2020-03-08 17:24:39 +01:00
Samuel Berthe
953878df03
HAProxy 1.*: adding rules 2020-03-08 17:17:06 +01:00
Samuel Berthe
7dbbbb0e09
Doc: organizing lb and reverse proxy 2020-03-08 16:10:33 +01:00
Samuel Berthe
c4d35090eb
Improves readme and contributing guidelines 2020-03-08 15:19:48 +01:00
Samuel Berthe
90a9a08b7c
Improves readme and contributing guidelines 2020-03-08 15:17:55 +01:00
Samuel Berthe
718a039313
Adding an alert for prometheus internals: rule evaluation slowing down 2020-03-08 15:08:11 +01:00
Samuel Berthe
072a435f32
Fixing @jpds queries ;) 🚀 2020-03-08 14:41:36 +01:00
Samuel Berthe
f620fe31ee
Merge pull request #36 from jpds/prom-errors
_data/rules.yml: Added Prometheus error alerts.
2020-03-08 14:29:18 +01:00
Samuel Berthe
de778a9aec
don't ask french people to write in english without error 2020-03-07 20:12:03 +01:00
Samuel Berthe
d19171f5c6
doc: adding disclamer about alert thresholds 2020-03-07 20:06:11 +01:00
Samuel Berthe
1a56c3032f
Merge pull request #84 from samber/doc-postgresql-replication-lag
Adding a comment to PostgresqlReplicationLag alert
2020-03-07 19:34:25 +01:00
Samuel Berthe
6ba051d747
doc: adding a comment to PostgresqlReplicationLag alert 2020-03-07 19:30:58 +01:00
Samuel Berthe
05a2c9604b
Renaming some alert categories 2020-03-07 19:06:54 +01:00
Samuel Berthe
6edcdc75af
my brain is out for vacation, please forgive me 2020-03-07 18:57:09 +01:00
Samuel Berthe
ab126b1de6
Merge pull request #83 from samber/feat-cassandra-criteo
Adding alerts for criteo/cassandra_exporter
2020-03-07 18:54:43 +01:00
Samuel Berthe
b97ece8c69
Adding alerts for criteo/cassandra_exporter 2020-03-07 18:51:34 +01:00
Samuel Berthe
75a17a79be
please contribute 🙏 2020-03-07 18:11:09 +01:00
Samuel Berthe
a2d92e25c5
please contribute 🙏 2020-03-07 18:09:41 +01:00
Samuel Berthe
cde4e243ae
no quotes no cry 2020-03-07 17:59:42 +01:00
Samuel Berthe
0add8466c6
Merge pull request #82 from samber/feat-nodeexporter-raid
Added RAID alerts (node-exporter)
2020-03-07 17:51:39 +01:00
Samuel Berthe
ab477bb21e
Added RAID alerts 2020-03-07 17:50:41 +01:00
Samuel Berthe
9cb6dc1bd0
build: upgrade dependencies 2020-03-07 17:30:52 +01:00
Samuel Berthe
e0b556a623
Merge pull request #80 from danilomagalhaes/patch-2
Update rules.yml
2020-03-07 17:05:08 +01:00
Danilo Magalhães
5bd2e03c51
Update rules.yml
Group by instance and name instead of only instance.  
Change from container_spec_memory_limit_bytes to correct max memory metric container_spec_memory_limit_bytes.
2020-02-27 11:08:09 +00:00
Samuel Berthe
a9c9629cb5 oops 2020-01-25 00:16:49 +01:00
Samuel Berthe
134264026a
Does not alert on tmpfs volume filling-up. Closing #77 2020-01-25 00:13:01 +01:00
Samuel Berthe
67b322ae5b
fix check free disk space (#75)
fix check free disk space
2020-01-15 14:28:23 +01:00
iamdenchik
29b66f9b3e fix check free disk space 2020-01-15 12:40:19 +05:00
Samuel Berthe
d699a0d924
oops 2020-01-14 17:18:03 +01:00
Samuel Berthe
b8685adee4
Update GA 2020-01-14 17:15:57 +01:00
Samuel Berthe
2ec17b215f
Merge pull request #73 from Behoston/patch-1
Fix Etcd rule: Insufficient Members
2020-01-03 15:50:30 +01:00