April 16, 2013
ScaleOut hServer™ Opens Up Hadoop Analysis for Live Data
Developers now can perform Hadoop MapReduce analytics on fast changing, in-memory data.
BELLEVUE, Wash., April 16, 2013 - ScaleOut Software, a leading provider of in-memory data grids (IMDGs), today announced the general availability of ScaleOut hServer, an IMDG that enables Hadoop analysis of grid-based data. ScaleOut hServer includes a specialized version of ScaleOut’s IMDG plus open-source API libraries to give Hadoop programs access to fast-changing, live data held in hServer’s IMDG. This release of hServer is the first in a series of products from ScaleOut Software designed to bring real-time analytics performance to Hadoop.
ScaleOut hServer enables the storage of live data in an IMDG, where it can be updated directly by applications using hServer’s APIs while simultaneously being accessed by Hadoop programs for analysis. Unlike conventional Hadoop usage, which analyzes static data sets, the ability to continuously perform Hadoop MapReduce analysis on live data enables important trends to be spotted as they occur. In addition, hosting data in hServer’s in-memory data grid dramatically reduces access latency in comparison to the use of file systems and database servers to hold data sets for analysis by Hadoop.
ScaleOut hServer also provides a fully transparent, distributed cache for HDFS data, using memory-based storage to eliminate file I/O and accelerate data access for Hadoop’s MapReduce. Tests have demonstrated an 11X reduction in access latency for benchmark data sets. ScaleOut hServer automatically retrieves and stores HDFS data as key/value pairs in its IMDG, enabling subsequent analyses to bypass HDFS and access data directly from the distributed cache. Only a two-line code change is required for a Hadoop program to use hServer as an HDFS cache.
“While it’s a powerful platform for analyzing large, static data sets, Hadoop has always been limited by its inability to perform analytics on live data,” said Bill Bain, ScaleOut Software CEO. “There is an increasing drumbeat for real-time analytics using Hadoop, and we’re excited to take an important step towards meeting that need with this release.”
A recent survey commissioned by ScaleOut Software determined that 93% of respondents felt that their organizations required or would benefit from real-time data analytics on the Hadoop platform. In addition, 83 percent of Hadoop users run analyses multiple times on the same data set, and more than 61 percent of the data sets being analyzed are smaller than 10TB. This survey data supports the need for products like hServer that help Hadoop perform real-time analytics.
ScaleOut hServer Available in Both Community and Commercial Editions
ScaleOut hServer will be available in both a free community edition and in several commercial editions. The community edition enables up to a four-server combined Hadoop/hServer grid for analyzing memory-based data sets of up to 256GB. It will be supported by a forum site sponsored by ScaleOut Software. The commercial editions will be licensed using an annual subscription-pricing model based on the size of the deployment and will include ScaleOut’s standard support and maintenance, with additional support options available to users.
About ScaleOut Software, Inc.
ScaleOut Software develops software products that provide scalable, highly available memory-based storage and analysis for workload data in server farms and compute grids. It has offices in Bellevue, Washington and Beaverton, Oregon. The company was founded by Dr. William L. Bain, whose previous company, Valence Research, developed and distributed Web load-balancing software that was acquired by Microsoft Corporation and is now called Network Load Balancing within the Windows Server operating system.
LEWIS PR for ScaleOut Software