Questions tagged [alertmanager]

5 questions
1
vote
0 answers

Prometheus alert rule test fails with long intervals

I've got an alert rule: groups: - name: somename rules: - alert: CertificateExpiry expr: certificate_expires_in_days < 20 for: 1h labels: …
badbishop
  • 898
  • 3
  • 11
  • 21
0
votes
1 answer

Disabling email for Prometheus Alertmanager based on severity

We are monitoring a HPC computing cluster using a combination of Prometheus, Alertmanager and Grafana. On our machines, things like SWAP memory filling up to essentially the limit happen frequently, and while it is useful to see the corresponding…
ml4294
  • 101
  • 2
0
votes
1 answer

Alertmanager telegram config chat_id and cannot unmarshal errror

I am trying to configure alertmanager to send alerts to my telegram group. Following the configuration I have: global: resolve_timeout: 5m route: group_by: - job group_interval: 5m group_wait: 30s receiver: "telegram" repeat_interval:…
Jose
  • 11
  • 2
0
votes
1 answer

webhook MS Teams integration with Prometheus - request failed

I'm struggling with Microsoft Teams/Prometheus integration on K8s cluster. I used helm to start all components. I have correctly working Prometheus and Alertmanager. It seems that all works fine. Prometheus communicate with Alertmanager. Then…
RedBluff
  • 1
  • 2
0
votes
0 answers

Prometheus Alertmanager - Generating alerts

When alerts are based on metric data, like CPU or memory utilization, Prometheus is the obvious tool for pushing alerts into Alertmanager. There are other examples where the required alert is based on Boolean conditions, like "is DNS working". In…
Steve Crook
  • 101
  • 1