Solve Your Biggest Challenges Faster
Integrated architecture for powerful performance
The Intel® Xeon Phi™ processor is a bootable host processor that delivers massive parallelism and vectorization to support the most demanding high-performance computing applications. The integrated and power-efficient architecture delivers significantly more compute per unit of energy consumed versus comparable platforms to give you an improved total cost of ownership.1 The integration of memory and fabric topples the memory wall and reduces cost to help you solve your biggest challenges faster.
Manufactured on Intel’s 14nm technology process, the Intel® Xeon Phi™ processor provides up to 72 out-of-order cores, Intel® Advanced Vector Extensions 512 instructions, and up to 16GB of on-package high-bandwidth memory along with the capacity for 384GB DDR4 platform memory. The result of this cutting-edge architecture is over 3 double-precision teraFLOPS (floating-point operations per second) at a mere 215W per processor.
The integrated architecture of the Intel® Xeon Phi™ processor improves performance and lowers your costs by reducing bottlenecks and system complexity. The Intel® Xeon Phi™ processor delivers up to 490 GB/s of sustained memory bandwidth without the need for additional discrete memory cards, and 100 GB/s I/O without the added cost and power needed for two fabric adapters.
Supported by a comprehensive Intel roadmap, the Intel® Xeon Phi™ processor is a future-ready solution that maximizes your return on investment by using open standards for code that is flexible, portable, and reusable.
With Intel® Xeon® processors, workloads with parallel and serial components will achieve leading performance; however, for applications requiring high parallelism and vectorization, the Intel® Xeon Phi™ processor is the right tool for the job. Applications that will see the greatest improvement will make extensive use of the 72 cores with ultra-wide vector capabilities (Intel® AVX-512). Examples of segments with highly parallel applications include animation, energy, finance, life sciences, manufacturing, medical, public sector, weather, and more.
For a list of optimized applications, please visit the Application Showcase.
The Intel® Xeon Phi™ processor is a foundational element of the Intel® Scalable System Framework (Intel® SSF), which combines compute, memory/storage, fabric, software to reduce system bottlenecks and complexity. Intel® SSF is a holistic solution for developing high performance, balanced, efficient, and reliable HPC systems.
Based on comparison with a system with a 2-socket E5-2697 v4 running DGEMM. Xeon Phi™ 7250 was measured as 2070/215 (GFLOP/Watt) vs. 1054/290 (GFLOP/Watt) on the E5-2697 v4. Source: Intel measured or estimated as of March 2016.
Intel® Xeon® E5-2697 v4 Configuration Parameters:
1-Node, 2 x Intel® Xeon® Processor E5-2697 v4 on Grantley-EP (Wellsburg) with 128 GB Total Memory on Red Hat Enterprise Linux* 7.1 kernel 3.10.0-229 using stream_omp v5.4 with Intel compiler 22.214.171.124 with following command: icc stream_omp.c -O3 -openmp -o stream_omp -static -freestanding -o stream_omp_v5.4_IC126.96.36.199_80M.
Intel® Xeon Phi™ Processor Configuration Parameters:
Platform Used Inside Intel for Testing: Intel Adams Pass Product Concept Board (ADP PC), 96 GB DDR4 (6 x 16GB @ 2133 MHz)
BIOS: CRB BIOS 08.R00.RC085
Processors used for this edition:
OS: RHEL* 7
Kernel options: noreplace-paravirt idle=halt mce=on
Environment Variable(s): See how each individual workload was executed for specific environment variables
KNL Self Boot Software Package MPSP 1.2.2
MICPERF 1.3.0 Early Release
ComposerXE 2016 or equivalent redistributable package installed
MKL-based HPL Package 11.3.2.009
Intel MPI version 5.1.2-150
DGEMM: 20000 x 20000 or 26000 x 26000
SGEMM: 30000 x 30000
LINPACK Problem Size: 100000