Commit graph

150 commits

Author SHA1 Message Date
Samuel Berthe
ab477bb21e
Added RAID alerts 2020-03-07 17:50:41 +01:00
Samuel Berthe
67b322ae5b
fix check free disk space (#75)
fix check free disk space
2020-01-15 14:28:23 +01:00
iamdenchik
29b66f9b3e fix check free disk space 2020-01-15 12:40:19 +05:00
Samuel Berthe
d699a0d924
oops 2020-01-14 17:18:03 +01:00
Samuel Berthe
b8685adee4
Update GA 2020-01-14 17:15:57 +01:00
Samuel Berthe
2ec17b215f
Merge pull request #73 from Behoston/patch-1
Fix Etcd rule: Insufficient Members
2020-01-03 15:50:30 +01:00
Mateusz Legięcki
a72feb4ff6
Fix Etcd rule: Insufficient Members 2020-01-03 12:58:25 +01:00
Samuel Berthe
97225efc72
Merge pull request #72 from dieswaytoofast/fix_instance_usage
Replace 'ip' by 'instance' in some rules
2019-12-27 15:53:00 +01:00
Mahesh Paolini-Subramanya
88b55f1dee Replace 'ip' by 'instance' in some rules
The metrics return 'instance', not 'ip'
This PR fixes the rules to use 'instance'
2019-12-27 09:18:16 -05:00
Samuel Berthe
580366554d
Merge pull request #71 from robert-will-brown/prometheus-alerts
Prometheus alerts
2019-12-19 20:39:06 +01:00
Rob Brown
ce51db2a6f Added Prometheus Not connected to alertmanager alert 2019-12-18 15:38:23 +00:00
Rob Brown
97ecdab26c Added "Disk will fill in 4 hours" alert 2019-12-18 15:32:52 +00:00
Samuel Berthe
6aeb60cb02
Merge pull request #69 from robert-will-brown/master
Added hardware temperature alerts
2019-12-13 16:31:24 +01:00
Rob Brown
58f843dbc6 Added hardware temperature alerts 2019-12-12 17:29:23 +00:00
Samuel Berthe
c2c9a58959
Merge pull request #68 from pepakriz/patch-1
Fixed `rabbitmq cluster down` rule
2019-12-02 19:29:31 +01:00
Josef Kříž
d10e30aed0
Fixed rabbitmq cluster down rule 2019-12-02 13:12:02 +01:00
Samuel Berthe
febd5f93e0
Merge pull request #67 from mattiasr/patch-1
Fixed typo in alertmanager.md
2019-11-19 21:38:24 +01:00
Mattias Ryrlén
6d33f32b43
Fixed typo in alertmanager.md
Assumed betch should be batch
2019-11-19 17:04:47 +01:00
Samuel Berthe
7eb68b1c4b
Merge pull request #65 from maxbrunet/patch-1
elasticsearch: Alert for no new docs on data nodes only
2019-11-07 15:20:37 +01:00
Maxime Brunet
1e2a35e058
elasticsearch: Alert for no new docs on data nodes only
We can have nodes that are not masters, but don not hold any data. For example the client/coordinating nodes set up by the `stable/elasticsearch` helm chart:
https://github.com/helm/charts/tree/master/stable/elasticsearch#client-and-coordinating-nodes

And we can also have nodes being data and master nodes simultaneously.
So I think, this alert has to look for `es_data_node="true"` to be correct.
2019-11-06 15:23:26 -05:00
Samuel Berthe
9306d8947f
PG: Alert in case of high rollback ratio (#64)
PG: Alert in case of high rollback ratio
2019-10-31 12:02:03 +01:00
Samuel Berthe
0c9a24a4e7 feat(pg): alert in case of high rollback ratio 2019-10-31 12:00:53 +01:00
Samuel Berthe
cca2872ade
typo 2019-10-31 11:47:57 +01:00
Samuel Berthe
768fac56ae
Merge pull request #62 from jdorel/patch-1
SllCertificateExpired synthax
2019-10-29 12:15:15 +01:00
Samuel Berthe
20744c3d3d
Update rules.yml 2019-10-29 12:12:43 +01:00
Samuel Berthe
f3be238ce0
Merge pull request #63 from jdorel/master
Add Kubernetes alerts from kube-state-metric exporter
2019-10-29 12:09:46 +01:00
Jonas DOREL
80aebe84e9 Add Kubernetes alerts from kube-state-metric exporter 2019-10-29 11:59:14 +01:00
Jonas DOREL
267a064d26
SllCertificateExpired synthax
Match other alert names, without the `has` part.
2019-10-29 11:39:01 +01:00
Samuel Berthe
82cf3ac1ef adding cassandra 2019-10-26 17:48:22 +02:00
Samuel Berthe
4f9e88bad4 improving blackbox alerts 2019-10-26 17:43:18 +02:00
Samuel Berthe
dfa5446cd5 adding comments in data structure 2019-10-26 17:25:35 +02:00
Samuel Berthe
c70a5089c7
Adding copy buttons (#61)
Adding copy buttons
2019-10-26 16:42:33 +02:00
Samuel Berthe
0139c3728f feat(ui): adding copy buttons 2019-10-26 16:41:11 +02:00
Samuel Berthe
3040fe56d5
Update README.md 2019-09-25 16:36:37 +02:00
Samuel Berthe
8f6c85774a
Clean data file 2019-09-25 16:36:10 +02:00
Samuel Berthe
018bd44db2
Merge pull request #56 from orange-cloudfoundry/master
Add OpenEBS and Minio alert
2019-09-25 16:31:08 +02:00
olivier beyler
e3628c5ba8 Add OpenEBS and Minio alert
Signed-off-by: olivier beyler <olivier.beyler@orange.com>
2019-09-25 16:13:44 +02:00
Samuel Berthe
1f4a1f8052
Updating Traefik -> Traefik v1.* 2019-09-25 14:23:16 +02:00
Samuel Berthe
6090463cda
Merge pull request #53 from Tri0L/patch-1
Fix typo in query of PG DeadLocks
2019-09-25 09:17:06 +02:00
Andrey Dudin
6d9866cefb
Fix typo in query of PG DeadLocks 2019-09-25 02:42:44 +03:00
Samuel Berthe
f7f94ed81e
Fixed time interval (10min->10m) 2019-09-13 18:08:04 +02:00
Samuel Berthe
d39acd0a86
Merge pull request #50 from timfeirg/master
free memory should include node_memory_Slab_bytes
2019-09-03 12:06:12 +02:00
timfeirg
37ef9a6f5c
free memory should include node_memory_Slab_bytes 2019-09-03 15:47:17 +08:00
Samuel Berthe
51e7231b3d fix(blackbox exporter): alert when http >= 400 instead of 300 2019-08-29 19:03:54 +02:00
Samuel Berthe
1d3e31669d
Merge pull request #48 from samber/dependabot/bundler/nokogiri-1.10.4
Bump nokogiri from 1.8.5 to 1.10.4
2019-08-23 13:26:02 +02:00
dependabot[bot]
5b7ecd2ce0
Bump nokogiri from 1.8.5 to 1.10.4
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.8.5 to 1.10.4.
- [Release notes](https://github.com/sparklemotion/nokogiri/releases)
- [Changelog](https://github.com/sparklemotion/nokogiri/blob/master/CHANGELOG.md)
- [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.8.5...v1.10.4)

Signed-off-by: dependabot[bot] <support@github.com>
2019-08-23 11:23:49 +00:00
Samuel Berthe
4e2346e6bd
Add CollectorError alert for WMI exporter (#47)
Add CollectorError alert for WMI exporter
2019-08-23 13:18:40 +02:00
Jonas Kongslund
9bd8b3698f Add CollectorError alert for WMI exporter 2019-08-22 13:52:15 +04:00
Samuel Berthe
8e5d0ab71d
Merge pull request #45 from 0x46616c6b/traefik-alerts
add alerts for traefik
2019-08-08 16:51:22 +02:00
louis
e9f247783b add alerts for traefik 2019-08-08 14:32:47 +02:00