The browser version you are using is not recommended for this site.Please consider upgrading to the latest version of your browser by clicking one of the following links.
We are sorry, This PDF is available in download format only
Balancing the compute, storage, and network resources in an Apache Hadoop* cluster enabled the full benefit of the latest Intel® processors, solid state drives, Intel® Ethernet Converged Network Adapters, and the Intel® Distribution for Apache Hadoop Software. Building a balanced infrastructure from these components enabled reducing the time required to complete a workload from about four hours to about seven minutes, a reduction of approximately 97 percent. Results such as these pave the way for low-cost, near-real-time data analytics.
Transforming data analytics
Realizing Value from Big Data
Apache Hive* overview
Linda Feldt highlights big data research—video
The Intel® Distribution for Apache Hadoop* Software
Apache HDFS* overview.