Dpsyco is a automated system to distribute system configurations to several computers. It is written mainly for the Debian distribution but should be portable (without too much difficulties) to other distributions or Unixes as well. It consists of a number of shell scripts to perform the desired actions. With it you can handle users, add ssh-public-keys, patch the system, update things using cfengine, install files (overriding other package files), and more.
Enhanced Cluster Tools (ECT) for Linux is a set of additional tools for the enhancement of Cluster Systems Management (CSM), which assists an administrator in managing a whole set of Linux machines. ECT for Linux includes features such as parallel install, hardware control, configuration file management, event monitoring, and more that help make it easier for an administrator to set up and maintain machines.
openMosix is a a set of extensions to the standard Linux kernel allowing you to build a cluster of out of off-the-shelf PC hardware. openMosix scales perfectly up to thousands of nodes. You do not need to modify your applications to benefit from your cluster (unlike PVM, MPI, Linda, etc.). Processes in openMosix migrate transparently between nodes and the cluster will always auto-balance.
chpox provides transparent checkpointing and restarting of processes on Linux clusters. It was originally designed for recovering tasks that have a long execution time (i.e. numerical simulations) in case of system crashes, power failures, etc. It may work with openMosix, is SMP safe, does not require program recompiling/relinking, and supports virtual memory, regular open files, pipes, Unix domain sockets, current directory, and child processes.
PCP is a system for replicating files on multiple nodes of a PC cluster. Replication is done by building an n-ary tree of TCP sockets and using parallelized, pipelined data transfers which use RSA authentication. For large file transfers or replication on many nodes, PCP provides highly efficient data transfers when compared to existing alternatives (e.g. NFS).
Warewulf is an operating system management toolkit designed to facilitate large scale deployments of homogeneous and heterogeneous systems on physical, virtual and cloud based infrastructures. Originally, the Warewulf project pioneered the concept of stateless computing in HPC, setting the standard for large-scale cluster provisioning. It provided two functions, provisioning and monitoring but the two functions did not communicate within Warewulf itself, nor was it possible to hook other functions directly into Warewulf itself. Today, Warewulf is more than just a basic provisioning and monitoring solution as it now implements an abstract, object-oriented data store and a modular interface that facilitates a highly extensible, customizable feature set. Current and planned modules include monitoring (operating system, services, filesystems, etc.), provisioning, power management, user management, configuration management, event/trigger handling and notification, scheduler integration, cloud services (both local and remote), etc.
GEXEC is a scalable cluster remote execution system which provides fast, RSA-authenticated remote execution of parallel and distributed jobs. It provides transparent forwarding of stdin, stdout, stderr, and signals to and from remote processes, provides local environment propagation, and is designed to be robust and to scale to systems of over 1000 nodes.
C3 Power Tools are a set of Python scripts that ease the use and maintenance of single and multiple clusters. Of the seven core tools, only two are restricted to root use. The C3 tools stand out from similar cluster tools in that they transparently support multiple clusters as easily as a single cluster. These tools are so flexible that they can also be used on groups of workstations not configured as a cluster.
Multixterm creates multiple xterms that can be driven together or separately. It can be used to login via SSH to multiple hosts and control them simultaneously, or for ad hoc things where you want to see the results as you type. Each xterm may also be driven separately. Multixterm is scriptable so that you can easily fire up, for example, a dozen xterms with a single command, tiled nicely on your screen. In addition to SSH, multixterm can drive rlogin, telnet, passwd, or any program that runs in an xterm.