Intel 5th Gen Xeon AVX-512 Comparison

2 x INTEL XEON PLATINUM 8592 AVX-512 Emerald Rapids vs. Sapphire Rapids by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2312146-NE-INTELAVX523
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

CPU Massive 2 Tests
Creator Workloads 6 Tests
HPC - High Performance Computing 4 Tests
Machine Learning 3 Tests
Multi-Core 6 Tests
Intel oneAPI 6 Tests
Python Tests 2 Tests
Raytracing 2 Tests
Renderers 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Comparison
Transpose Comparison

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Sapphire Rapids: AVX-512 On
December 02 2023
  3 Hours, 51 Minutes
Sapphire Rapids: AVX-512 Off
December 03 2023
  4 Hours, 20 Minutes
Emerald Rapids: AVX-512 On
December 11 2023
  4 Hours, 22 Minutes
Emerald Rapids: AVX-512 Off
December 11 2023
  4 Hours, 57 Minutes
Invert Hiding All Results Option
  4 Hours, 23 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionSapphire RapidsEmerald Rapids AVX-512 On AVX-512 Off AVX-512 On AVX-512 Off2 x Intel Xeon Platinum 8490H @ 3.50GHz (120 Cores / 240 Threads)Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS)Intel Device 1bce1008GB3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Intel X710 for 10GBASE-TUbuntu 23.106.5.0-13-generic (x86_64)GCC 13.2.0ext41920x10802 x INTEL XEON PLATINUM 8592+ @ 3.90GHz (128 Cores / 256 Threads)Quanta Cloud S6Q-MB-MPS (3B05.TEL4P1 BIOS)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Sapphire Rapids: AVX-512 On: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0004d0- Sapphire Rapids: AVX-512 Off: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0004d0- Emerald Rapids: AVX-512 On: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x21000161- Emerald Rapids: AVX-512 Off: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x21000161Python Details- Python 3.11.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

y-cruncher: 5By-cruncher: 500My-cruncher: 1Bminibude: OpenMP - BM1minibude: OpenMP - BM1minibude: OpenMP - BM2minibude: OpenMP - BM2tensorflow: CPU - 16 - ResNet-50tensorflow: CPU - 32 - ResNet-50tensorflow: CPU - 64 - ResNet-50onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer ISPC - Asian Dragon Objopenvkl: vklBenchmarkCPU ISPCospray: gravity_spheres_volume/dim_512/ao/real_timeospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/pathtracer/real_timeospray-studio: 1 - 4K - 1 - Path Tracer - CPUospray-studio: 3 - 4K - 1 - Path Tracer - CPUospray-studio: 1 - 4K - 16 - Path Tracer - CPUospray-studio: 1 - 4K - 32 - Path Tracer - CPUospray-studio: 3 - 4K - 16 - Path Tracer - CPUospray-studio: 3 - 4K - 32 - Path Tracer - CPUospray-studio: 1 - 1080p - 1 - Path Tracer - CPUospray-studio: 3 - 1080p - 1 - Path Tracer - CPUospray-studio: 1 - 1080p - 16 - Path Tracer - CPUospray-studio: 1 - 1080p - 32 - Path Tracer - CPUospray-studio: 3 - 1080p - 16 - Path Tracer - CPUospray-studio: 3 - 1080p - 32 - Path Tracer - CPUSapphire RapidsEmerald Rapids AVX-512 On AVX-512 Off AVX-512 On AVX-512 Off28.6042.2934.9143418.575136.7433466.394138.65666.1885.64103.28830.959218.62137.09637.7347.01637.7847.003887.307.70412.86290.1514944.268.021528.7319.605759.7120.7326917.014.3820980.695.711894.4963.27742.5140.1947113.292.537556.5215.764381.7527.33144820.350.37151891.460.29124.0863152.5351131.8466270535.888235.243215.0832889106214054330261679638483226268354570894231845736.6372.9076.3382090.44583.6182214.18288.56747.2257.8468.42896.52921.891355.33214.94139.44213.06140.641749.9817.1285.02352.086859.8617.46923.4732.453688.3232.512463.9848.4813852.888.651645.0372.89212.63140.688773.9813.662564.0911.691075.65111.6365805.381.75129576.780.57113.1300141.1111122.3348215523.658522.52119.225211035122616486383091960444460263309416683264917993427.0582.7545.2335313.366212.5354975.695199.02849.2672.0697.851331.74236.14135.37747.4942.76747.7742.744765.956.69538.27237.3618384.836.951884.9916.958921.9714.3332234.043.9324769.105.152389.5553.511097.8229.0749132.992.4410161.3112.555105.9524.87114369.660.46123447.740.4151.4667200.9029174.3843324542.654541.610917.040974988612125238251434532723189223301760533574714633.6803.1096.5403392.568135.7033112.005124.48039.0653.6066.082341.1424.671287.11238.22134.18237.77134.442116.6415.1094.57337.698208.8815.581159.9327.564277.3129.912822.7145.3215101.298.461882.9767.93276.63115.359439.9213.552988.9110.691166.18109.7563672.971.7101244.000.63137.6227181.8347159.4832236523.410020.937410.566748931055142573360317151387872252793585721842428596OpenBenchmarking.org

Y-Cruncher

Emerald RapidsSapphire RapidsOpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.2Pi Digits To Calculate: 5BAVX-512 OffAVX-512 On816243240SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 333.6827.0636.6428.60

Emerald RapidsSapphire RapidsOpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.2Pi Digits To Calculate: 500MAVX-512 OffAVX-512 On0.69951.3992.09852.7983.4975SE +/- 0.017, N = 7SE +/- 0.022, N = 7SE +/- 0.005, N = 7SE +/- 0.003, N = 73.1092.7542.9072.293

Emerald RapidsSapphire RapidsOpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.2Pi Digits To Calculate: 1BAVX-512 OffAVX-512 On246810SE +/- 0.026, N = 5SE +/- 0.021, N = 5SE +/- 0.007, N = 5SE +/- 0.003, N = 56.5405.2336.3384.914

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1AVX-512 OffAVX-512 On11002200330044005500SE +/- 0.74, N = 7SE +/- 3.87, N = 8SE +/- 17.21, N = 5SE +/- 12.90, N = 73392.575313.372090.453418.581. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Emerald RapidsSapphire RapidsOpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1AVX-512 OffAVX-512 On50100150200250SE +/- 0.03, N = 7SE +/- 0.15, N = 8SE +/- 0.69, N = 5SE +/- 0.52, N = 7135.70212.5483.62136.741. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Emerald RapidsSapphire RapidsOpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2AVX-512 OffAVX-512 On11002200330044005500SE +/- 24.22, N = 15SE +/- 49.58, N = 15SE +/- 5.42, N = 3SE +/- 17.85, N = 33112.014975.702214.183466.391. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Emerald RapidsSapphire RapidsOpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2AVX-512 OffAVX-512 On4080120160200SE +/- 0.97, N = 15SE +/- 1.98, N = 15SE +/- 0.22, N = 3SE +/- 0.71, N = 3124.48199.0388.57138.661. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: ResNet-50AVX-512 OffAVX-512 On1530456075SE +/- 0.20, N = 3SE +/- 0.41, N = 3SE +/- 0.28, N = 3SE +/- 0.31, N = 339.0649.2647.2266.18

Emerald RapidsSapphire RapidsOpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50AVX-512 OffAVX-512 On20406080100SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.36, N = 3SE +/- 0.22, N = 353.6072.0657.8485.64

Emerald RapidsSapphire RapidsOpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50AVX-512 OffAVX-512 On20406080100SE +/- 0.58, N = 3SE +/- 0.16, N = 3SE +/- 0.39, N = 3SE +/- 0.75, N = 366.0897.8568.42103.28

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUAVX-512 OffAVX-512 On5001000150020002500SE +/- 54.32, N = 15SE +/- 31.97, N = 15SE +/- 6.64, N = 3SE +/- 10.80, N = 32341.141331.74896.53830.961. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUAVX-512 OffAVX-512 On50100150200250SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 324.67236.1421.89218.621. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUAVX-512 OffAVX-512 On30060090012001500SE +/- 2.43, N = 3SE +/- 0.03, N = 3SE +/- 4.01, N = 3SE +/- 0.07, N = 31287.11135.371355.33137.091. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUAVX-512 OffAVX-512 On160320480640800SE +/- 0.31, N = 3SE +/- 0.82, N = 3SE +/- 0.08, N = 3SE +/- 0.66, N = 3238.22747.49214.94637.731. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUAVX-512 OffAVX-512 On306090120150SE +/- 0.18, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3134.1842.76139.4447.011. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUAVX-512 OffAVX-512 On160320480640800SE +/- 0.09, N = 3SE +/- 1.03, N = 3SE +/- 0.26, N = 3SE +/- 0.38, N = 3237.77747.77213.06637.781. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUAVX-512 OffAVX-512 On306090120150SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.18, N = 3SE +/- 0.03, N = 3134.4442.74140.6447.001. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUAVX-512 OffAVX-512 On10002000300040005000SE +/- 1.56, N = 3SE +/- 1.59, N = 3SE +/- 0.53, N = 3SE +/- 1.67, N = 32116.644765.951749.983887.301. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUAVX-512 OffAVX-512 On48121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 315.106.6917.127.701. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On120240360480600SE +/- 0.10, N = 3SE +/- 0.53, N = 3SE +/- 0.06, N = 3SE +/- 2.15, N = 394.57538.2785.02412.861. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On80160240320400SE +/- 0.38, N = 3SE +/- 0.24, N = 3SE +/- 0.22, N = 3SE +/- 1.52, N = 3337.69237.36352.08290.151. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUAVX-512 OffAVX-512 On4K8K12K16K20KSE +/- 2.90, N = 3SE +/- 5.71, N = 3SE +/- 3.57, N = 3SE +/- 24.24, N = 38208.8818384.836859.8614944.261. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUAVX-512 OffAVX-512 On48121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 315.586.9517.468.021. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUAVX-512 OffAVX-512 On400800120016002000SE +/- 0.98, N = 3SE +/- 1.06, N = 3SE +/- 0.24, N = 3SE +/- 1.66, N = 31159.931884.99923.471528.731. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUAVX-512 OffAVX-512 On816243240SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 327.5616.9532.4519.601. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On2K4K6K8K10KSE +/- 0.66, N = 3SE +/- 8.09, N = 3SE +/- 3.58, N = 3SE +/- 11.71, N = 34277.318921.973688.325759.711. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 329.9114.3332.5120.731. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUAVX-512 OffAVX-512 On7K14K21K28K35KSE +/- 0.50, N = 3SE +/- 24.15, N = 3SE +/- 2.47, N = 3SE +/- 29.04, N = 32822.7132234.042463.9826917.011. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUAVX-512 OffAVX-512 On1122334455SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 345.323.9348.484.381. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On5K10K15K20K25KSE +/- 8.51, N = 3SE +/- 20.85, N = 3SE +/- 9.76, N = 3SE +/- 25.65, N = 315101.2924769.1013852.8820980.691. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On246810SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.465.158.655.711. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On5001000150020002500SE +/- 0.40, N = 3SE +/- 2.91, N = 3SE +/- 0.20, N = 3SE +/- 1.25, N = 31882.972389.551645.031894.491. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On1632486480SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 367.9353.5172.8963.271. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUAVX-512 OffAVX-512 On2004006008001000SE +/- 0.41, N = 3SE +/- 1.48, N = 3SE +/- 1.82, N = 3SE +/- 1.33, N = 3276.631097.82212.63742.511. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUAVX-512 OffAVX-512 On306090120150SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 1.21, N = 3SE +/- 0.08, N = 3115.3529.07140.6840.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On11K22K33K44K55KSE +/- 3.35, N = 3SE +/- 80.54, N = 3SE +/- 3.89, N = 3SE +/- 54.78, N = 39439.9249132.998773.9847113.291. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 313.552.4413.662.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUAVX-512 OffAVX-512 On2K4K6K8K10KSE +/- 0.72, N = 3SE +/- 3.93, N = 3SE +/- 1.81, N = 3SE +/- 6.80, N = 32988.9110161.312564.097556.521. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUAVX-512 OffAVX-512 On48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.6912.5511.6915.761. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUAVX-512 OffAVX-512 On11002200330044005500SE +/- 10.88, N = 7SE +/- 0.84, N = 3SE +/- 10.21, N = 15SE +/- 26.63, N = 31166.185105.951075.654381.751. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUAVX-512 OffAVX-512 On20406080100SE +/- 1.00, N = 7SE +/- 0.01, N = 3SE +/- 1.05, N = 15SE +/- 0.15, N = 3109.7524.87111.6327.331. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUAVX-512 OffAVX-512 On30K60K90K120K150KSE +/- 52.05, N = 3SE +/- 898.88, N = 3SE +/- 51.71, N = 3SE +/- 735.86, N = 363672.97114369.6665805.38144820.351. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUAVX-512 OffAVX-512 On0.39380.78761.18141.57521.969SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.700.461.750.371. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On30K60K90K120K150KSE +/- 496.38, N = 3SE +/- 940.07, N = 3SE +/- 1069.55, N = 3SE +/- 547.74, N = 3101244.00123447.74129576.78151891.461. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUAVX-512 OffAVX-512 On0.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.630.400.570.291. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: CrownAVX-512 OffAVX-512 On306090120150SE +/- 0.23, N = 7SE +/- 0.55, N = 7SE +/- 0.27, N = 6SE +/- 0.36, N = 6137.62151.47113.13124.09

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian DragonAVX-512 OffAVX-512 On4080120160200SE +/- 0.30, N = 7SE +/- 0.38, N = 8SE +/- 0.22, N = 7SE +/- 0.33, N = 7181.83200.90141.11152.54

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon ObjAVX-512 OffAVX-512 On4080120160200SE +/- 0.19, N = 4SE +/- 0.52, N = 4SE +/- 0.21, N = 3SE +/- 0.26, N = 4159.48174.38122.33131.85

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 2.0.0Benchmark: vklBenchmarkCPU ISPCAVX-512 OffAVX-512 On7001400210028003500SE +/- 6.36, N = 3SE +/- 30.78, N = 3SE +/- 4.04, N = 3SE +/- 3.84, N = 32365324521552705

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timeAVX-512 OffAVX-512 On1020304050SE +/- 0.07, N = 3SE +/- 0.19, N = 3SE +/- 0.09, N = 3SE +/- 0.20, N = 323.4142.6523.6635.89

Emerald RapidsSapphire RapidsOpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeAVX-512 OffAVX-512 On918273645SE +/- 0.16, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 320.9441.6122.5235.24

Emerald RapidsSapphire RapidsOpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeAVX-512 OffAVX-512 On48121620SE +/- 0.07557, N = 15SE +/- 0.20744, N = 4SE +/- 0.13104, N = 15SE +/- 0.14483, N = 1510.5667417.040909.2252115.08320

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On2004006008001000SE +/- 2.03, N = 3SE +/- 1.20, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 38937491035889

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On30060090012001500SE +/- 6.17, N = 3SE +/- 0.88, N = 3SE +/- 1.67, N = 3SE +/- 0.00, N = 3105588612261062

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On4K8K12K16K20KSE +/- 152.25, N = 5SE +/- 96.72, N = 3SE +/- 24.36, N = 3SE +/- 14.66, N = 314257121251648614054

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On8K16K24K32K40KSE +/- 274.25, N = 3SE +/- 74.75, N = 3SE +/- 150.37, N = 3SE +/- 114.21, N = 333603238253830933026

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On4K8K12K16K20KSE +/- 155.10, N = 7SE +/- 108.89, N = 15SE +/- 18.90, N = 3SE +/- 52.30, N = 317151143451960416796

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On10K20K30K40K50KSE +/- 82.99, N = 3SE +/- 167.00, N = 3SE +/- 123.86, N = 3SE +/- 213.70, N = 338787327234446038483

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On60120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3225189263226

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On70140210280350SE +/- 2.41, N = 15SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3279223309268

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On9001800270036004500SE +/- 23.43, N = 3SE +/- 7.00, N = 3SE +/- 5.17, N = 3SE +/- 1.86, N = 33585301741663545

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On2K4K6K8K10KSE +/- 60.01, N = 3SE +/- 27.24, N = 3SE +/- 8.69, N = 3SE +/- 17.64, N = 37218605383267089

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On11002200330044005500SE +/- 19.40, N = 3SE +/- 10.48, N = 3SE +/- 8.21, N = 3SE +/- 2.40, N = 34242357449174231

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OffAVX-512 On2K4K6K8K10KSE +/- 101.69, N = 3SE +/- 43.03, N = 3SE +/- 37.24, N = 3SE +/- 11.85, N = 38596714699348457

CPU Peak Freq (Highest CPU Core Frequency) Monitor

Emerald RapidsSapphire RapidsOpenBenchmarking.orgMegahertzCPU Peak Freq (Highest CPU Core Frequency) MonitorPhoronix Test Suite System MonitoringAVX-512 OffAVX-512 On10002000300040005000Min: 500 / Avg: 3017.93 / Max: 3917Min: 800 / Avg: 2954.57 / Max: 3913Min: 800 / Avg: 2870.28 / Max: 5743Min: 500 / Avg: 2841.09 / Max: 3519

CPU Power Consumption Monitor

Emerald RapidsSapphire RapidsOpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringAVX-512 OffAVX-512 On160320480640800Min: 103.63 / Avg: 583.08 / Max: 772.16Min: 99.6 / Avg: 589.71 / Max: 894.8Min: 67.03 / Avg: 598.76 / Max: 762.66Min: 72.02 / Avg: 574.93 / Max: 770.09

CPU Temperature Monitor

Emerald RapidsSapphire RapidsOpenBenchmarking.orgCelsiusCPU Temperature MonitorPhoronix Test Suite System MonitoringAVX-512 OffAVX-512 On1326395265Min: 29 / Avg: 53.19 / Max: 66Min: 30 / Avg: 54 / Max: 64Min: 25 / Avg: 51.04 / Max: 61Min: 22 / Avg: 48.5 / Max: 59

67 Results Shown

Y-Cruncher:
  5B
  500M
  1B
miniBUDE:
  OpenMP - BM1:
    GFInst/s
    Billion Interactions/s
  OpenMP - BM2:
    GFInst/s
    Billion Interactions/s
TensorFlow:
  CPU - 16 - ResNet-50
  CPU - 32 - ResNet-50
  CPU - 64 - ResNet-50
oneDNN
OpenVINO:
  Face Detection FP16 - CPU:
    FPS
    ms
  Person Detection FP16 - CPU:
    FPS
    ms
  Person Detection FP32 - CPU:
    FPS
    ms
  Vehicle Detection FP16 - CPU:
    FPS
    ms
  Face Detection FP16-INT8 - CPU:
    FPS
    ms
  Face Detection Retail FP16 - CPU:
    FPS
    ms
  Road Segmentation ADAS FP16 - CPU:
    FPS
    ms
  Vehicle Detection FP16-INT8 - CPU:
    FPS
    ms
  Weld Porosity Detection FP16 - CPU:
    FPS
    ms
  Face Detection Retail FP16-INT8 - CPU:
    FPS
    ms
  Road Segmentation ADAS FP16-INT8 - CPU:
    FPS
    ms
  Machine Translation EN To DE FP16 - CPU:
    FPS
    ms
  Weld Porosity Detection FP16-INT8 - CPU:
    FPS
    ms
  Person Vehicle Bike Detection FP16 - CPU:
    FPS
    ms
  Handwritten English Recognition FP16 - CPU:
    FPS
    ms
  Age Gender Recognition Retail 0013 FP16 - CPU:
    FPS
    ms
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
    FPS
    ms
Embree:
  Pathtracer ISPC - Crown
  Pathtracer ISPC - Asian Dragon
  Pathtracer ISPC - Asian Dragon Obj
OpenVKL
OSPRay:
  gravity_spheres_volume/dim_512/ao/real_time
  gravity_spheres_volume/dim_512/scivis/real_time
  gravity_spheres_volume/dim_512/pathtracer/real_time
OSPRay Studio:
  1 - 4K - 1 - Path Tracer - CPU
  3 - 4K - 1 - Path Tracer - CPU
  1 - 4K - 16 - Path Tracer - CPU
  1 - 4K - 32 - Path Tracer - CPU
  3 - 4K - 16 - Path Tracer - CPU
  3 - 4K - 32 - Path Tracer - CPU
  1 - 1080p - 1 - Path Tracer - CPU
  3 - 1080p - 1 - Path Tracer - CPU
  1 - 1080p - 16 - Path Tracer - CPU
  1 - 1080p - 32 - Path Tracer - CPU
  3 - 1080p - 16 - Path Tracer - CPU
  3 - 1080p - 32 - Path Tracer - CPU
CPU Peak Freq (Highest CPU Core Frequency) Monitor:
  Phoronix Test Suite System Monitoring:
    Megahertz
    Watts
    Celsius