MLPACK is a C++ machine learning library with an emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and maximum flexibility for expert users. It contains algorithms such as k-means, Gaussian mixture models, hidden Markov models, density estimation trees, kernel PCA, locality-sensitive hashing, sparse coding, linear regression and least-angle regression.
Proto Balance Mail is an enterprise SMTP cluster solution that supports distribution of email accounts. It scales up to 1,000,000 mailboxes apportioned over up to 125 backend mail servers (8000 mailboxes per server). No NFS or SAN is required. SOA is configurable with SOAP/XML. Anti-spam settings can be set per-user. Grey-listing is supported. Mal-ware is automatically detected and infected client PCs are automatically black-listed. POP load balancing is done. SMTP AUTH is supported. There is a Web-based management interface. Spam blocking is done by on-the-fly connection behavior analysis. It handles up to 10,000 concurrent SMTP connections. Streamlined CRM integration is done with HTTP+XML posts. Email-alias lists, forwarding, and out-of-office auto-reply are supported.
StarCluster is a utility for creating traditional computing clusters used in research labs or for general distributed computing applications on Amazon's Elastic Compute Cloud (EC2). It uses a simple configuration file provided by the user to request cloud resources from Amazon and to automatically configure them with a queuing system, an NFS shared /home directory, passwordless SSH, OpenMPI, and ~140GB scratch disk space. It consists of a Python library and a simple command line interface to the library. For end-users, the command line interface provides simple intuitive options for getting started with distributed computing on EC2 (i.e. starting/stopping clusters, managing AMIs, etc). For developers, the library wraps the EC2 API to provide a simplified interface for launching/terminating nodes, executing commands on the nodes, copying files to/from the nodes, etc.
jmemcached is a fast network available cache daemon. It is protocol-compatible with memcached, but written in Java and suitable for applications with portability concerns, where Java is the preferred solution, or for using the memcached protocol in embedded applications with alternate storage engines. Existing clients for memcache work unmodified. It can run as a standalone daemon or be embedded inside an existing Java application.