Questions tagged [ganglia]

Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and grids.

Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and grids.

Ganglia is based on a hierarchical design targeted at federations of clusters. It leverages widely used technologies such as XML for data representation, XDR for compact, portable data transport, and RRDtool for data storage and visualization. It uses carefully engineered data structures and algorithms to achieve very low per-node overheads and high concurrency. The implementation is robust, has been ported to an extensive set of operating systems and processor architectures, and is currently in use on thousands of clusters around the world. It has been used to link clusters across university campuses and around the world and can scale to handle clusters with 2000 nodes.

The ganglia system comprises two unique daemons, a PHP-based web front-end, and a few other small utility programs.

47 questions
9
votes
6 answers

Nagios graphing solutions vs Munin/Cacti/Ganglia

I've got a nagios server setup for monitoring ~ 30 Windows servers. I want to add some trending charts. I've read that nagios graphing plugins are simple and many people use seperate, standalone charting/trending tools. What are the restrictions of…
sumek
  • 213
  • 1
  • 2
  • 6
5
votes
4 answers

Ganglia without multicast

I'm running Ganglia 3.1.2 on a network where there is no multicast (nor can I turn it on). Does anyone have an elegant solution for getting ganglia to work correctly? I found this: http://code.google.com/p/ganglia-multicast-hack/ but it does not…
jedberg
  • 2,291
  • 22
  • 21
4
votes
1 answer

How to set up alerts on Ganglia?

How can I set up Ganglia so that I get an email if a machine in the cluster is using, for example, greater than 95% of physical RAM?
Tanmay
  • 143
  • 1
  • 4
3
votes
0 answers

How to create custom graphs in Ganglia?

How can I create custom graphs in ganglia. I have couple of scripts written, which can return the data in intergers, I want to graph them in ganglia. I can get the output from my scripts every minute.
Ramesh Kumar
  • 1,690
  • 5
  • 18
  • 29
3
votes
3 answers

Using Ganglia on EC2

In Amazon EC2 I have a setup of proxies and 1 monitor (MON). I've installed gmond in the proxies and gmetad in MON. My data source for gmetad.conf in MON look like this: data_source "proxies" proxy1:8654 proxy2:8654 proxy3:8654 In the proxies's…
hdanniel
  • 4,253
  • 22
  • 25
2
votes
0 answers

Can Ganglia's gmond be configured so that it doesn't need to be restarted if the Ganglia server is restarted?

I use ganglia to monitor my computer cluster: (source of the image) When I restart the Ganglia server, the gmond daemon on the other servers in the cluster stop sending information to the Ganglia server. I have to run sudo…
Franck Dernoncourt
  • 940
  • 1
  • 12
  • 28
2
votes
0 answers

Ganglia gmetad segfaults after running for a while (on AWS EC2)

We are using Ganglia to monitoring our cloud infrastructure on Amazon AWS. Everything is working correctly (metrics are flowing etc), except that occasionally the gmetad process will segfault out of the blue. The gmetad process is running on an…
SamBarham
  • 123
  • 1
  • 4
2
votes
0 answers

How to make Statsd talk to Ganglia on EC2 (localhost)

As per topic, Im just trying to make this simple setup work. The services are running fine, but as far as I can tell statsd doesn't send anything over to ganglia. Ganglia is working fine I guess since I get the web interface and basic system info…
Dreen
  • 153
  • 4
2
votes
1 answer

Can ZenOSS integrate Ganglia smoothly?

I like Ganglia for its Gmetric function, and I like its multi-layer capability. But Ganglia does not have healthiness check, alerting and etc. for the server monitor functionality. So it would be great to bring this two species together. Sure, we…
chen
  • 319
  • 1
  • 5
  • 13
2
votes
1 answer

Redirecting syslog output for certain processes to a particular file

I must admit to being a complete syslog newbie. I haven't touched it until now, but I use its output on a daily basis, like most of us :) I currently have a problem with the gmetad daemon spamming my /var/log/messages file with messages like the…
ticktockhouse
  • 731
  • 1
  • 10
  • 17
2
votes
1 answer

Alternative Ganglia interfaces?

I'm evaluating various metric collection systems and appreciate just about everything in Ganglia, except for the less than clean interface. Does anyone know of alternative Ganglia interfaces or additions to the interface that allow custom grouping…
CarpeNoctem
  • 2,397
  • 4
  • 23
  • 32
2
votes
3 answers

Ganglia doesnt update graph

I started to notice my web interface hasn't updated the graph in hours. Each time I restart the gmond process on my clients, I see that the graphs does work. I come back an hour or so later and my graph is blank, just a white graph and nothing has…
sdot257
  • 3,039
  • 5
  • 29
  • 38
2
votes
2 answers

Monitoring System for the cloud?

I need a monitoring system, much like ganglia / nagios that is build for the cloud. I need it to support : Adding / removing nodes dynamically. (Node shuts down, dose not imply node failure...) Dynamic node based categorization, meaning node can…
Maxim Veksler
  • 2,555
  • 10
  • 27
  • 32
2
votes
1 answer

What does cpu_aidle mean in Ganglia reports?

The cpu_aidle graph mysteriously flat at roughly 12 percent on all machines reporting on a friend's cluster. Given how everything else CPU related is spiky, this seems unusual. Can anyone shed light on what that number means?
jldugger
  • 14,122
  • 19
  • 73
  • 129
1
vote
2 answers

Ganglia's GPU Nvidia module: do we need to patch the ganglia-webfrontend?

I am trying to add the GPU Nvidia module in ganglia (/ganglia/gmond_python_modules/gpu/nvidia/). Do we need to apply the ganglia_web.patch patch? If I do not apply the patch, I don't see any GPU metrics when I go to http://localhost/ganglia/ If I…
Franck Dernoncourt
  • 940
  • 1
  • 12
  • 28
1
2 3 4