Highest Voted 'prometheus' Questions - Server Fault Stack Exchange

1

vote

0 answers

Prometheus: how to stop alerting on a dead system

I've got my backup server reporting the age of the last successful backup to Prometheus, and I alert when that gets too old. I've just permanently shut down a server, and of course Prometheus is now alerting. What's the correct way to deal with…

prometheus

asked Oct 28 '19 at 11:10

Michael Kohne

2,284
1
16
29

1

vote

2 answers

How to visualize current CPU usage of a pod with Grafana

Is there a way to visualize current CPU usage of a pod in a K8S cluster? I tried different Prometheus metrics like namespace_pod_name_container_name:container_cpu_usage_seconds_total:sum_rate and other similar ones, but I always get average value…

monitoring kubernetes cpu-usage grafana prometheus

asked Sep 17 '19 at 15:37

Volodymyr Melnyk

537
5
18

1

vote

2 answers

Kubernetes: horizontal auto-scaling based on metrics in another namespace

I'd like to set up horizontal auto-scaling for a deployment based on the metrics of the ingress-controller deployed in another namespace. I have a deployment (petclinic) deployed in a certain namespace (petclinic). I have an ingress controller…

kubernetes autoscaling prometheus ingress

asked Aug 20 '19 at 13:11

Volodymyr Melnyk

537
5
18

1

vote

2 answers

How to get CPU usage percentage for a namespace from Prometheus?

Our product lives in a Kubernetes cluster on our server. It is not in production yet, so there are multiple instances running in the cluster for different purposes, each in its own namespace. I need to run some load tests on one of the namespaces…

kubernetes grafana prometheus

asked Aug 12 '19 at 15:15

zslim

121
1
1
5

1

vote

1 answer

Query 5 minute average from Prometheus

I have a query to get used memory in Prometheus as a 0.0-1.0 percent. I can alert on this directly but don't want alerts on a short burst, only a high level over time or an average over time exceeding the limit. I was hoping to do this in the query,…

prometheus

asked May 31 '19 at 14:25

virullius

988
8
22

1

vote

0 answers

iptables rules for dockerized prometheus stack and swarm

OS: ubuntu Everything runns in containers (run via docker stack deploy) 2 servers running separate swarm server - running grafana + prometheus (ip 1.1.1.1) server - running node_exporter + cadvisor (ip 2.2.2.2) Need to setup IPTABLES: On 1st…

iptables docker port prometheus

asked Apr 04 '19 at 09:34

Palino1611

11
3

1

vote

1 answer

How to check to exist a unit systemd on the servers by Prometheus?

I have a service 'service_1' on the servers. And I need checking by Prometheus existing this service on the servers. node_systemd_unit_state showing only metrics for existing services. Maybe I can check existing file in…

monitoring systemd prometheus

asked Mar 06 '19 at 13:34

tutunak

63
7

1

vote

1 answer

Prometheus container connection refused

I have prometheus , nginx-vts , php and nginx-vts-exporter all in separated containers i'm trying to get metrics to prometheus from the exporter ,, it keeps telling me Get http://127.0.0.1:9913/metrics: dial tcp 127.0.0.1:9913: connect:…

nginx docker-compose prometheus

asked Mar 01 '19 at 14:47

Tyr_90

11
1
1
3

1

vote

1 answer

No prometheus in Gitlab Omnibus

Setup: Ubuntu 14.04 with all recent updates, Gitlab Omnibus version 11.7.3 (latest) ~$ sudo grep prometheus /etc/gitlab/gitlab.rb prometheus_monitoring['enable'] = true prometheus['enable'] = true prometheus['listen_address'] = ':9090' The…

gitlab prometheus

asked Feb 01 '19 at 08:37

wl2776

181
1
9

1

vote

1 answer

kubernetes: measuring resource/limit efficiency

I'd like to get a feel for how 'efficient' my deployment/jobs are at consuming the resources they request. i.e., if a job that only ends up using 1CPU at peak ends up requesting 320, I'd like to have a dashboard/alert/metric to chase down rogue…

kubernetes grafana prometheus

asked Nov 29 '18 at 00:05

Eddie Parker

571
1
4
10

1

vote

1 answer

Prometheus alert not fired

I have setup 5 alerts in my Prometheus setup. 3 of them work as expected. However, I have 2 that are never triggered. I am really confused and I need some help here. So, the 2 rules that do not work are: alert: CriticalDiskSpace expr:…

prometheus

asked Oct 17 '18 at 15:19

Peter

802
2
10
23

1

vote

1 answer

Prometheus with remote_write and backups

I'm building a prometheus stack that uses the remote_read/remote_write feature for long term storage, and I have some questions about if/how prometheus' tsdb needs to be backed up, given that it is using long term storage. 1) Do I need to back up…

backup-restoration metrics prometheus

asked Apr 19 '18 at 17:24

user307927

133
1
4

1

vote

0 answers

Is there possibility in Prometheus alerts calculate another alert status?

I want to make an ALERT with IF that contains expression that depends on another alert status. Is that ever possible somehow? I know about alert inhibition with alermanager but I need not to firing some alerts at all in some cases. Prometheus…

alerting prometheus

asked Oct 03 '17 at 09:36

Shtlzut

280
2
9

1

vote

0 answers

Does Prometheus efficiently compress series of identical samples?

Does Prometheus efficiently store and de-duplicate long repeats of identical sample values for a metric? I'm working with a system where some of the inputs change at a low frequency - say every 30 minutes - but others change at a high frequency like…

prometheus metrics

asked May 26 '22 at 02:12

Craig Ringer

10,553
9
38
59

1

vote

0 answers

Hostname instead of IPs for panel legends in Grafana with Prometheus metrics

I want to monitor different servers with Prometheus fed by exporters like node-exporter or postgres-exporter. The metrices should be presented in Grafana as dashboards and alerts. The exporters will be imported by following targets in the…

prometheus grafana

asked May 18 '22 at 10:31

Vaiaro

111
3

Questions tagged [prometheus]