Questions tagged [prometheus]

The Prometheus monitoring system.

117 questions
1
vote
0 answers

Prometheus: how to stop alerting on a dead system

I've got my backup server reporting the age of the last successful backup to Prometheus, and I alert when that gets too old. I've just permanently shut down a server, and of course Prometheus is now alerting. What's the correct way to deal with…
Michael Kohne
  • 2,284
  • 1
  • 16
  • 29
1
vote
2 answers

How to visualize current CPU usage of a pod with Grafana

Is there a way to visualize current CPU usage of a pod in a K8S cluster? I tried different Prometheus metrics like namespace_pod_name_container_name:container_cpu_usage_seconds_total:sum_rate and other similar ones, but I always get average value…
1
vote
2 answers

Kubernetes: horizontal auto-scaling based on metrics in another namespace

I'd like to set up horizontal auto-scaling for a deployment based on the metrics of the ingress-controller deployed in another namespace. I have a deployment (petclinic) deployed in a certain namespace (petclinic). I have an ingress controller…
Volodymyr Melnyk
  • 537
  • 5
  • 18
1
vote
2 answers

How to get CPU usage percentage for a namespace from Prometheus?

Our product lives in a Kubernetes cluster on our server. It is not in production yet, so there are multiple instances running in the cluster for different purposes, each in its own namespace. I need to run some load tests on one of the namespaces…
zslim
  • 121
  • 1
  • 1
  • 5
1
vote
1 answer

Query 5 minute average from Prometheus

I have a query to get used memory in Prometheus as a 0.0-1.0 percent. I can alert on this directly but don't want alerts on a short burst, only a high level over time or an average over time exceeding the limit. I was hoping to do this in the query,…
virullius
  • 988
  • 8
  • 22
1
vote
0 answers

iptables rules for dockerized prometheus stack and swarm

OS: ubuntu Everything runns in containers (run via docker stack deploy) 2 servers running separate swarm server - running grafana + prometheus (ip 1.1.1.1) server - running node_exporter + cadvisor (ip 2.2.2.2) Need to setup IPTABLES: On 1st…
Palino1611
  • 11
  • 3
1
vote
1 answer

How to check to exist a unit systemd on the servers by Prometheus?

I have a service 'service_1' on the servers. And I need checking by Prometheus existing this service on the servers. node_systemd_unit_state showing only metrics for existing services. Maybe I can check existing file in…
tutunak
  • 63
  • 7
1
vote
1 answer

Prometheus container connection refused

I have prometheus , nginx-vts , php and nginx-vts-exporter all in separated containers i'm trying to get metrics to prometheus from the exporter ,, it keeps telling me Get http://127.0.0.1:9913/metrics: dial tcp 127.0.0.1:9913: connect:…
Tyr_90
  • 11
  • 1
  • 1
  • 3
1
vote
1 answer

No prometheus in Gitlab Omnibus

Setup: Ubuntu 14.04 with all recent updates, Gitlab Omnibus version 11.7.3 (latest) ~$ sudo grep prometheus /etc/gitlab/gitlab.rb prometheus_monitoring['enable'] = true prometheus['enable'] = true prometheus['listen_address'] = ':9090' The…
wl2776
  • 181
  • 1
  • 9
1
vote
1 answer

kubernetes: measuring resource/limit efficiency

I'd like to get a feel for how 'efficient' my deployment/jobs are at consuming the resources they request. i.e., if a job that only ends up using 1CPU at peak ends up requesting 320, I'd like to have a dashboard/alert/metric to chase down rogue…
Eddie Parker
  • 571
  • 1
  • 4
  • 10
1
vote
1 answer

Prometheus alert not fired

I have setup 5 alerts in my Prometheus setup. 3 of them work as expected. However, I have 2 that are never triggered. I am really confused and I need some help here. So, the 2 rules that do not work are: alert: CriticalDiskSpace expr:…
Peter
  • 802
  • 2
  • 10
  • 23
1
vote
1 answer

Prometheus with remote_write and backups

I'm building a prometheus stack that uses the remote_read/remote_write feature for long term storage, and I have some questions about if/how prometheus' tsdb needs to be backed up, given that it is using long term storage. 1) Do I need to back up…
user307927
  • 133
  • 1
  • 4
1
vote
0 answers

Is there possibility in Prometheus alerts calculate another alert status?

I want to make an ALERT with IF that contains expression that depends on another alert status. Is that ever possible somehow? I know about alert inhibition with alermanager but I need not to firing some alerts at all in some cases. Prometheus…
Shtlzut
  • 280
  • 2
  • 9
1
vote
0 answers

Does Prometheus efficiently compress series of identical samples?

Does Prometheus efficiently store and de-duplicate long repeats of identical sample values for a metric? I'm working with a system where some of the inputs change at a low frequency - say every 30 minutes - but others change at a high frequency like…
Craig Ringer
  • 10,553
  • 9
  • 38
  • 59
1
vote
0 answers

Hostname instead of IPs for panel legends in Grafana with Prometheus metrics

I want to monitor different servers with Prometheus fed by exporters like node-exporter or postgres-exporter. The metrices should be presented in Grafana as dashboards and alerts. The exporters will be imported by following targets in the…
Vaiaro
  • 111
  • 3