The Prometheus monitoring system.
Questions tagged [prometheus]
117 questions
1
vote
0 answers
Prometheus: how to stop alerting on a dead system
I've got my backup server reporting the age of the last successful backup to Prometheus, and I alert when that gets too old. I've just permanently shut down a server, and of course Prometheus is now alerting.
What's the correct way to deal with…
![](../../users/profiles/2494.webp)
Michael Kohne
- 2,284
- 1
- 16
- 29
1
vote
2 answers
How to visualize current CPU usage of a pod with Grafana
Is there a way to visualize current CPU usage of a pod in a K8S cluster?
I tried different Prometheus metrics like namespace_pod_name_container_name:container_cpu_usage_seconds_total:sum_rate and other similar ones, but I always get average value…
![](../../users/profiles/270423.webp)
Volodymyr Melnyk
- 537
- 5
- 18
1
vote
2 answers
Kubernetes: horizontal auto-scaling based on metrics in another namespace
I'd like to set up horizontal auto-scaling for a deployment based on the metrics of the ingress-controller deployed in another namespace.
I have a deployment (petclinic) deployed in a certain namespace (petclinic).
I have an ingress controller…
![](../../users/profiles/270423.webp)
Volodymyr Melnyk
- 537
- 5
- 18
1
vote
2 answers
How to get CPU usage percentage for a namespace from Prometheus?
Our product lives in a Kubernetes cluster on our server. It is not in production yet, so there are multiple instances running in the cluster for different purposes, each in its own namespace. I need to run some load tests on one of the namespaces…
![](../../users/profiles/534941.webp)
zslim
- 121
- 1
- 1
- 5
1
vote
1 answer
Query 5 minute average from Prometheus
I have a query to get used memory in Prometheus as a 0.0-1.0 percent. I can alert on this directly but don't want alerts on a short burst, only a high level over time or an average over time exceeding the limit.
I was hoping to do this in the query,…
![](../../users/profiles/405852.webp)
virullius
- 988
- 8
- 22
1
vote
0 answers
iptables rules for dockerized prometheus stack and swarm
OS: ubuntu
Everything runns in containers (run via docker stack deploy)
2 servers running separate swarm
server - running grafana + prometheus (ip 1.1.1.1)
server - running node_exporter + cadvisor (ip 2.2.2.2)
Need to setup IPTABLES:
On 1st…
![](../../users/profiles/517683.webp)
Palino1611
- 11
- 3
1
vote
1 answer
How to check to exist a unit systemd on the servers by Prometheus?
I have a service 'service_1' on the servers. And I need checking by Prometheus existing this service on the servers. node_systemd_unit_state showing only metrics for existing services.
Maybe I can check existing file in…
![](../../users/profiles/509088.webp)
tutunak
- 63
- 7
1
vote
1 answer
Prometheus container connection refused
I have prometheus , nginx-vts , php and nginx-vts-exporter all in separated containers i'm trying to get metrics to prometheus from the exporter ,, it keeps telling me
Get http://127.0.0.1:9913/metrics: dial tcp 127.0.0.1:9913: connect:…
![](../../users/profiles/512090.webp)
Tyr_90
- 11
- 1
- 1
- 3
1
vote
1 answer
No prometheus in Gitlab Omnibus
Setup: Ubuntu 14.04 with all recent updates,
Gitlab Omnibus version 11.7.3 (latest)
~$ sudo grep prometheus /etc/gitlab/gitlab.rb
prometheus_monitoring['enable'] = true
prometheus['enable'] = true
prometheus['listen_address'] = ':9090'
The…
![](../../users/profiles/321077.webp)
wl2776
- 181
- 1
- 9
1
vote
1 answer
kubernetes: measuring resource/limit efficiency
I'd like to get a feel for how 'efficient' my deployment/jobs are at consuming the resources they request. i.e., if a job that only ends up using 1CPU at peak ends up requesting 320, I'd like to have a dashboard/alert/metric to chase down rogue…
![](../../users/profiles/30986.webp)
Eddie Parker
- 571
- 1
- 4
- 10
1
vote
1 answer
Prometheus alert not fired
I have setup 5 alerts in my Prometheus setup.
3 of them work as expected.
However, I have 2 that are never triggered. I am really confused and I need some help here.
So, the 2 rules that do not work are:
alert: CriticalDiskSpace
expr:…
![](../../users/profiles/102723.webp)
Peter
- 802
- 2
- 10
- 23
1
vote
1 answer
Prometheus with remote_write and backups
I'm building a prometheus stack that uses the remote_read/remote_write feature for long term storage, and I have some questions about if/how prometheus' tsdb needs to be backed up, given that it is using long term storage.
1) Do I need to back up…
![](../../users/profiles/269968.webp)
user307927
- 133
- 1
- 4
1
vote
0 answers
Is there possibility in Prometheus alerts calculate another alert status?
I want to make an ALERT with IF that contains expression that depends on another alert status. Is that ever possible somehow? I know about alert inhibition with alermanager but I need not to firing some alerts at all in some cases.
Prometheus…
![](../../users/profiles/139291.webp)
Shtlzut
- 280
- 2
- 9
1
vote
0 answers
Does Prometheus efficiently compress series of identical samples?
Does Prometheus efficiently store and de-duplicate long repeats of identical sample values for a metric?
I'm working with a system where some of the inputs change at a low frequency - say every 30 minutes - but others change at a high frequency like…
![](../../users/profiles/102814.webp)
Craig Ringer
- 10,553
- 9
- 38
- 59
1
vote
0 answers
Hostname instead of IPs for panel legends in Grafana with Prometheus metrics
I want to monitor different servers with Prometheus fed by exporters like node-exporter or postgres-exporter. The metrices should be presented in Grafana as dashboards and alerts.
The exporters will be imported by following targets in the…
![](../../users/profiles/558913.webp)
Vaiaro
- 111
- 3