Commit graph

983 commits

Author SHA1 Message Date
Samuel Berthe
640f06588d
Delete FUNDING.json 2024-10-05 18:21:35 +02:00
Samuel Berthe
cd5b39a1f0
Create FUNDING.json 2024-10-05 18:06:22 +02:00
dependabot[bot]
35596c866f
build(deps): bump webrick from 1.7.0 to 1.8.2 (#435)
Bumps [webrick](https://github.com/ruby/webrick) from 1.7.0 to 1.8.2.
- [Release notes](https://github.com/ruby/webrick/releases)
- [Commits](https://github.com/ruby/webrick/compare/v1.7.0...v1.8.2)

---
updated-dependencies:
- dependency-name: webrick
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-09-27 22:24:21 +02:00
Samuel Berthe
d6d6ae4ef8
fix: Gemfile to reduce vulnerabilities (#434)
The following vulnerabilities are fixed with an upgrade:
- https://snyk.io/vuln/SNYK-RUBY-WEBRICK-8068535

Co-authored-by: snyk-bot <snyk-bot@snyk.io>
2024-09-26 11:31:21 +02:00
dependabot[bot]
65a5f586cb
build(deps-dev): bump rexml from 3.3.3 to 3.3.6 (#431)
Bumps [rexml](https://github.com/ruby/rexml) from 3.3.3 to 3.3.6.
- [Release notes](https://github.com/ruby/rexml/releases)
- [Changelog](https://github.com/ruby/rexml/blob/master/NEWS.md)
- [Commits](https://github.com/ruby/rexml/compare/v3.3.3...v3.3.6)

---
updated-dependencies:
- dependency-name: rexml
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-09-09 20:09:20 +02:00
samber
4aa45dee05 Publish 2024-08-28 06:49:52 +00:00
Samuel Berthe
f08e8df514
oops 2024-08-28 08:48:42 +02:00
Samuel Berthe
995ab4d27a
Update rules.yml 2024-08-28 08:46:41 +02:00
Samuel Berthe
3bf8d6d824
fix: Gemfile to reduce vulnerabilities (#432)
The following vulnerabilities are fixed with an upgrade:
- https://snyk.io/vuln/SNYK-RUBY-REXML-7814166

Co-authored-by: snyk-bot <snyk-bot@snyk.io>
2024-08-24 10:42:21 +02:00
Somrat Dutta
8c0bdc2b24
feat: Add NATS and JetStream Prometheus alert rules (#430)
* feat: Add comprehensive NATS and JetStream Prometheus alert rules

- Added multiple Prometheus alert rules for monitoring NATS server and JetStream metrics.
- Included alerts for:
  - High connection count
  - High pending bytes
  - High subscriptions count
  - High routes count
  - High memory usage
  - Slow consumers
  - NATS server downtime
  - High CPU usage
  - High number of active connections
  - High JetStream store and memory usage
  - Subscription limits exceeded
  - High pending messages
  - Authentication timeouts
  - Errors in NATS (JetStream API errors)
  - JetStream consumers limit exceeded
  - Exceeding max payload size
  - Leaf node connection issues
  - Ping operations limit exceeded
  - Write deadline exceeded
- Ensured consistency between `exporter.yml` and `rules.yml` files.
- Improved overall NATS and JetStream monitoring to prevent performance degradation and ensure system reliability.

This commit enhances the visibility of NATS and JetStream operations by providing key metrics to alert on potential issues and optimize system performance.

* Update rules.yml

* - minor changes, rollback rules.yml
- address comment changes
- revert to old rules.yml as they are generated

* - minor changes, rollback rules.yml
- address comment changes
- revert to old rules.yml as they are generated

* fix indentation

---------

Co-authored-by: somratdutta <duttasomratand.com>
Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
Co-authored-by: somrat.dutta <somrat.dutta@nutanix.com>
2024-08-20 20:37:03 +02:00
samber
02687db33d Publish 2024-08-20 16:32:36 +00:00
Samuel Berthe
d1715de751
fix PostgresqlInvalidIndex rule 2024-08-20 18:31:18 +02:00
dependabot[bot]
61da73d517
build(deps-dev): bump rexml from 3.3.2 to 3.3.3 (#428)
Bumps [rexml](https://github.com/ruby/rexml) from 3.3.2 to 3.3.3.
- [Release notes](https://github.com/ruby/rexml/releases)
- [Changelog](https://github.com/ruby/rexml/blob/master/NEWS.md)
- [Commits](https://github.com/ruby/rexml/compare/v3.3.2...v3.3.3)

---
updated-dependencies:
- dependency-name: rexml
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-08-02 14:14:26 +02:00
dependabot[bot]
225607cf7f
build(deps-dev): bump nokogiri from 1.15.6 to 1.16.5 (#427)
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.15.6 to 1.16.5.
- [Release notes](https://github.com/sparklemotion/nokogiri/releases)
- [Changelog](https://github.com/sparklemotion/nokogiri/blob/main/CHANGELOG.md)
- [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.15.6...v1.16.5)

---
updated-dependencies:
- dependency-name: nokogiri
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-07-30 17:25:23 +02:00
Samuel Berthe
2c764df932
fix: Gemfile & Gemfile.lock to reduce vulnerabilities (#426)
The following vulnerabilities are fixed with an upgrade:
- https://snyk.io/vuln/SNYK-RUBY-REXML-7462086

Co-authored-by: snyk-bot <snyk-bot@snyk.io>
2024-07-18 10:14:45 +02:00
samber
58ade95b8b Publish 2024-07-02 07:34:59 +00:00
Samuel Berthe
47e74f65e0
Update rules.yml 2024-07-02 09:33:51 +02:00
Greg
9557d4b50e
feat(meilisearch): add basic set of rules (#425)
* feat(meilisearch): add basic meilisearch rules

* fix(query): use == instead of =

* fix(data): set correct name and use ==

* chore(meilisearch): remove index filter
2024-07-02 09:33:08 +02:00
Samuel Berthe
b6a6c2e313
Update README.md 2024-07-02 09:33:01 +02:00
samber
60c235975c Publish 2024-06-14 18:16:53 +00:00
Samuel Berthe
ca4fb01c6d
Update rules.yml 2024-06-14 20:15:44 +02:00
samber
1ee046b739 Publish 2024-06-06 20:54:49 +00:00
Samuel Berthe
1e4ea0b3e7
Update rules.yml 2024-06-06 22:53:29 +02:00
samber
8759c50440 Publish 2024-05-23 12:45:56 +00:00
Samuel Berthe
9b0ac7d230
Update rules.yml 2024-05-23 14:44:45 +02:00
dependabot[bot]
61a40270d9
build(deps-dev): bump rexml from 3.2.5 to 3.2.8 (#420)
Bumps [rexml](https://github.com/ruby/rexml) from 3.2.5 to 3.2.8.
- [Release notes](https://github.com/ruby/rexml/releases)
- [Changelog](https://github.com/ruby/rexml/blob/master/NEWS.md)
- [Commits](https://github.com/ruby/rexml/compare/v3.2.5...v3.2.8)

---
updated-dependencies:
- dependency-name: rexml
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-16 23:28:17 +02:00
samber
7dd767c4b4 Publish 2024-05-15 06:10:06 +00:00
Samuel Berthe
1adecd9ee7
Update rules.yml 2024-05-15 08:08:58 +02:00
Enes Yalınkaya
9877561b6c
fix elasticsearch rate rules (#418)
* fix elasticsearch rate rules

* fix

* fix

* fix
2024-05-15 08:07:55 +02:00
samber
826be5877f Publish 2024-05-14 18:44:11 +00:00
R.Sicart
262e451625
kube hpa lint and improvement (#417)
* fix: hpa alerts are using  label but the queries remove it

Signed-off-by: R.Sicart <roger.sicart@gmail.com>

* fix: hpa alert is using  label but the query removes it

Signed-off-by: R.Sicart <roger.sicart@gmail.com>

* feat: hpa scale max should not alert when min and max are the same

Signed-off-by: R.Sicart <roger.sicart@gmail.com>

---------

Signed-off-by: R.Sicart <roger.sicart@gmail.com>
2024-05-14 20:43:00 +02:00
samber
81079a2a7e Publish 2024-05-14 18:35:54 +00:00
R.Sicart
8460f9008e
fix: some kube api alert lint (#416)
* fix: apiserver regexp matchers are automatically fully anchored

Signed-off-by: R.Sicart <roger.sicart@gmail.com>

* fix: apiserver errors alert is using  label but the query removes it

Signed-off-by: R.Sicart <roger.sicart@gmail.com>

* fix: apiserver latency alert is using  label but the query removes it

Signed-off-by: R.Sicart <roger.sicart@gmail.com>

---------

Signed-off-by: R.Sicart <roger.sicart@gmail.com>
2024-05-14 20:34:43 +02:00
dependabot[bot]
4963331101
build(deps-dev): bump nokogiri from 1.16.2 to 1.16.5 (#415)
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.16.2 to 1.16.5.
- [Release notes](https://github.com/sparklemotion/nokogiri/releases)
- [Changelog](https://github.com/sparklemotion/nokogiri/blob/main/CHANGELOG.md)
- [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.16.2...v1.16.5)

---
updated-dependencies:
- dependency-name: nokogiri
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-14 01:41:57 +02:00
samber
04886da968 Publish 2024-05-13 10:10:12 +00:00
Florian Schlichting
396083a2a1
Fix HaproxyBackendMaxActiveSession: look at current / limit (#413)
haproxy_backend_max_sessions is the maximum number of sessions ever encountered during the lifetime of the HAProxy process. That is, it will never go down until HAProxy is restarted, so the alert continues to fire even though the situation has cleared!

This doesn't make sense. Look at the currently active sessions instead.
2024-05-13 12:09:04 +02:00
samber
613401a960 Publish 2024-05-13 09:12:01 +00:00
Vijay Dharap
870bbd47d2
Fixed HPA rule to use more correct condition (#408)
* Fixed HPA rule to use more correct condition

* Update rules.yml

---------

Co-authored-by: Samuel Berthe <dev@samuel-berthe.fr>
2024-05-13 11:10:55 +02:00
Samuel Berthe
847143ecc9
Update README.md 2024-05-13 10:42:04 +02:00
samber
84b0569c97 Publish 2024-05-13 08:33:30 +00:00
Ali
2547288c13
Added Clickhouse (#412)
* Added Clickhouse

* Update rules.yml

Added reasonable time periods for each query to avoid false positives and in some cased give the system a short window to try to solve the issue.
Also changed the severity level of authentication alerts from critical to info which seems more appropriate

* Modified time period for alerts embedded-exporter.yml

I made a few adjustments in time periods.
See if they seem reasonable or not

* Replication alerts time periods were adjusted

IMHO, replication alerts must be sent right away.
2024-05-13 10:32:18 +02:00
samber
515fca9c10 Publish 2024-05-05 23:33:11 +00:00
enesyalinkaya
59e6a9165d
add new alerts for elasticsearch rules.yml (#411)
This commit adds new Prometheus alert definitions to monitor indexing and query metrics in Elasticsearch clusters. These alerts are essential for detecting performance issues related to indexing and querying activities.
2024-05-06 01:32:00 +02:00
samber
5c0963558a Publish 2024-05-02 18:49:56 +00:00
Sergey Shtoltz
aad1c4cd95
RedisOutOfConfiguredMaxmemory: checking if memory limit is set (#410) 2024-05-02 20:48:46 +02:00
samber
b77cb3467c Publish 2024-04-29 20:36:49 +00:00
Samuel Berthe
267c3e8e70
Update rules.yml 2024-04-29 22:35:43 +02:00
samber
6b05a59ad9 Publish 2024-03-26 15:57:31 +00:00
Rastislav Pôbiš
2494ccdf31
Added prepared statements mysqld-exporter alert (#407) 2024-03-26 16:56:15 +01:00
Samuel Berthe
85b102df08
Welcome @betterstack-community ✌️ 2024-03-21 16:25:24 +01:00