This thread contains a patchset. You're looking at the original emails,
but you may wish to use the patch review UI.
Review patch
6
2
[PATCH metrics.sr.ht] Add an alert for high rate of server errors
---
service_rules.yml | 11 +++++++++++
1 file changed, 11 insertions(+)
create mode 100644 service_rules.yml
diff --git a/service_rules.yml b/service_rules.yml
new file mode 100644
index 0000000..26fedf0
--- /dev/null
+++ b/service_rules.yml
@@ -0,0 +1,11 @@
+# vim: tw=2 sw=2 :
+groups:
+- name: service
+ rules:
+ - alert: High rate of 500 errors
+ expr: rate(http_requests_total{status="500"}[10m]) > 5 / 60
+ for: 2m
+ labels:
+ severity: important
+ annotations:
+ summary: "{{ $labels.instance }} has a high rate of 500 errors"
--
2.17.1
-- Email domain proudly hosted at https://migadu.com
Thanks! Can you bump this to "urgent" severity?
And increase the interval to 5m as well.
[PATCH metrics.sr.ht v2] Add an alert for high rate of server errors
---
service_rules.yml | 11 +++++++++++
1 file changed, 11 insertions(+)
create mode 100644 service_rules.yml
diff --git a/service_rules.yml b/service_rules.yml
new file mode 100644
index 0000000..2a71244
--- /dev/null
+++ b/service_rules.yml
@@ -0,0 +1,11 @@
+# vim: tw=2 sw=2 :
+groups:
+- name: service
+ rules:
+ - alert: High rate of 500 errors
+ expr: rate(http_requests_total{status="500"}[10m]) > 5 / 60
+ for: 5m
+ labels:
+ severity: urgent
+ annotations:
+ summary: "{{ $labels.instance }} has a high rate of 500 errors"
--
2.17.1
-- Email domain proudly hosted at https://migadu.com
Re: [PATCH metrics.sr.ht v2] Add an alert for high rate of server errors
Thanks!
To git@git.sr.ht:~sircmpwn/metrics.sr.ht
9c1389a..3435b2c master -> master
Re: [PATCH metrics.sr.ht v2] Add an alert for high rate of server errors
Re: [PATCH metrics.sr.ht v2] Add an alert for high rate of server errors
Nah, new rules just need manual intervention to enable (they have to be
added to prometheus.yml). I just took care of it.