A short Ganglia overview for sysadmins new to Linux clusters.
If somebody is interested I made a general overview of cluster monitoring topic with Ganglia as a main example (including adding your own metrics).
Quick read + links for first-timers.
Thanks for this tool.
I wonder if people use it on clusters larger than 100 nodes - I guess it'd be difficult to handle this with GUI. Instead, I use a simple script with 'for' loop that does ssh to every node and runs the command - works fine so far, but perhaps there is a better way.