Intel 5th Gen Xeon AVX-512 Comparison

2 x INTEL XEON PLATINUM 8592 AVX-512 Emerald Rapids vs. Sapphire Rapids by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2312146-NE-INTELAVX523
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Comparison
Transpose Comparison

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Sapphire Rapids: AVX-512 On
December 02 2023
  3 Hours, 51 Minutes
Sapphire Rapids: AVX-512 Off
December 03 2023
  4 Hours, 20 Minutes
Emerald Rapids: AVX-512 On
December 11 2023
  4 Hours, 22 Minutes
Emerald Rapids: AVX-512 Off
December 11 2023
  4 Hours, 57 Minutes
Invert Behavior (Only Show Selected Data)
  4 Hours, 23 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionSapphire RapidsEmerald Rapids AVX-512 On AVX-512 Off AVX-512 On AVX-512 Off2 x Intel Xeon Platinum 8490H @ 3.50GHz (120 Cores / 240 Threads)Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS)Intel Device 1bce1008GB3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Intel X710 for 10GBASE-TUbuntu 23.106.5.0-13-generic (x86_64)GCC 13.2.0ext41920x10802 x INTEL XEON PLATINUM 8592+ @ 3.90GHz (128 Cores / 256 Threads)Quanta Cloud S6Q-MB-MPS (3B05.TEL4P1 BIOS)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Sapphire Rapids: AVX-512 On: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0004d0- Sapphire Rapids: AVX-512 Off: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0004d0- Emerald Rapids: AVX-512 On: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x21000161- Emerald Rapids: AVX-512 Off: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x21000161Python Details- Python 3.11.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

openvino: Face Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Vehicle Detection FP16 - CPUminibude: OpenMP - BM1minibude: OpenMP - BM1openvino: Face Detection Retail FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUminibude: OpenMP - BM2minibude: OpenMP - BM2openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUospray: gravity_spheres_volume/dim_512/scivis/real_timeopenvino: Road Segmentation ADAS FP16 - CPUospray: gravity_spheres_volume/dim_512/pathtracer/real_timeospray: gravity_spheres_volume/dim_512/ao/real_timeopenvino: Face Detection Retail FP16-INT8 - CPUtensorflow: CPU - 16 - ResNet-50openvino: Face Detection Retail FP16-INT8 - CPUospray-studio: 1 - 4K - 32 - Path Tracer - CPUtensorflow: CPU - 32 - ResNet-50tensorflow: CPU - 64 - ResNet-50openvkl: vklBenchmarkCPU ISPCopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer ISPC - Asian Dragonospray-studio: 1 - 1080p - 1 - Path Tracer - CPUospray-studio: 3 - 1080p - 32 - Path Tracer - CPUospray-studio: 3 - 1080p - 1 - Path Tracer - CPUospray-studio: 3 - 4K - 1 - Path Tracer - CPUospray-studio: 1 - 4K - 1 - Path Tracer - CPUospray-studio: 1 - 1080p - 16 - Path Tracer - CPUospray-studio: 3 - 1080p - 16 - Path Tracer - CPUospray-studio: 1 - 1080p - 32 - Path Tracer - CPUospray-studio: 3 - 4K - 16 - Path Tracer - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUospray-studio: 1 - 4K - 16 - Path Tracer - CPUospray-studio: 3 - 4K - 32 - Path Tracer - CPUy-cruncher: 500My-cruncher: 5Bembree: Pathtracer ISPC - Crowny-cruncher: 1Bonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUSapphire RapidsEmerald Rapids AVX-512 On AVX-512 Off AVX-512 On AVX-512 Off412.8647113.292.53742.5140.194381.750.3727.337556.52637.78637.7347.0047.013887.3014944.267.703418.575136.7438.025759.71144820.3520.73138.6563466.3940.291528.7326917.014.38218.62137.0935.243219.6015.083235.888220980.6966.185.713302685.64103.282705151891.46290.1515.761894.49131.8466152.5351226845726810628893545423170891679663.2714054384832.29328.604124.08634.914830.95985.028773.9813.66212.63140.681075.651.75111.632564.09213.06214.94140.64139.441749.986859.8617.122090.44583.61817.463688.3265805.3832.5188.5672214.1820.57923.472463.9848.4821.891355.3322.521132.459.2252123.658513852.8847.228.653830957.8468.422155129576.78352.0811.691645.03122.3348141.11112639934309122610354166491783261960472.8916486444602.90736.637113.13006.338896.529538.2749132.992.441097.8229.075105.950.4624.8710161.31747.77747.4942.7442.764765.9518384.836.695313.366212.5356.958921.97114369.6614.33199.0284975.6950.41884.9932234.043.93236.14135.3741.610916.9517.040942.654524769.1049.265.152382572.0697.853245123447.74237.3612.552389.55174.3843200.902918971462238867493017357460531434553.5112125327232.75427.058151.46675.2331331.7494.579439.9213.55276.63115.351166.181.7109.752988.91237.77238.22134.44134.182116.648208.8815.103392.568135.70315.584277.3163672.9729.91124.4803112.0050.631159.932822.7145.3224.671287.1120.937427.5610.5667423.410015101.2939.068.463360353.6066.082365101244.00337.6910.691882.97159.4832181.8347225859627910558933585424272181715167.9314257387873.10933.680137.62276.5402341.14OpenBenchmarking.org

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off120240360480600SE +/- 0.53, N = 3SE +/- 0.10, N = 3SE +/- 2.15, N = 3SE +/- 0.06, N = 3538.2794.57412.8685.021. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off11K22K33K44K55KSE +/- 80.54, N = 3SE +/- 3.35, N = 3SE +/- 54.78, N = 3SE +/- 3.89, N = 349132.999439.9247113.298773.981. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.4413.552.5313.661. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUAVX-512 OnAVX-512 Off2004006008001000SE +/- 1.48, N = 3SE +/- 0.41, N = 3SE +/- 1.33, N = 3SE +/- 1.82, N = 31097.82276.63742.51212.631. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUAVX-512 OnAVX-512 Off306090120150SE +/- 0.04, N = 3SE +/- 0.15, N = 3SE +/- 0.08, N = 3SE +/- 1.21, N = 329.07115.3540.19140.681. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUAVX-512 OnAVX-512 Off11002200330044005500SE +/- 0.84, N = 3SE +/- 10.88, N = 7SE +/- 26.63, N = 3SE +/- 10.21, N = 155105.951166.184381.751075.651. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUAVX-512 OnAVX-512 Off0.39380.78761.18141.57521.969SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.371.750.461.701. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -std=c++11 -fPIC -fvisibility=hidden -mavx2 -mfma -MD -MT -MF

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUAVX-512 OnAVX-512 Off20406080100SE +/- 0.01, N = 3SE +/- 1.00, N = 7SE +/- 0.15, N = 3SE +/- 1.05, N = 1524.87109.7527.33111.631. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 3.93, N = 3SE +/- 0.72, N = 3SE +/- 6.80, N = 3SE +/- 1.81, N = 310161.312988.917556.522564.091. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUAVX-512 OnAVX-512 Off160320480640800SE +/- 1.03, N = 3SE +/- 0.09, N = 3SE +/- 0.38, N = 3SE +/- 0.26, N = 3747.77237.77637.78213.061. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off160320480640800SE +/- 0.82, N = 3SE +/- 0.31, N = 3SE +/- 0.66, N = 3SE +/- 0.08, N = 3747.49238.22637.73214.941. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUAVX-512 OnAVX-512 Off306090120150SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.18, N = 342.74134.4447.00140.641. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off306090120150SE +/- 0.04, N = 3SE +/- 0.18, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 342.76134.1847.01139.441. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off10002000300040005000SE +/- 1.59, N = 3SE +/- 1.56, N = 3SE +/- 1.67, N = 3SE +/- 0.53, N = 34765.952116.643887.301749.981. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUAVX-512 OnAVX-512 Off4K8K12K16K20KSE +/- 5.71, N = 3SE +/- 2.90, N = 3SE +/- 24.24, N = 3SE +/- 3.57, N = 318384.838208.8814944.266859.861. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off48121620SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 36.6915.107.7017.121. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1AVX-512 OnAVX-512 Off11002200330044005500SE +/- 3.87, N = 8SE +/- 0.74, N = 7SE +/- 12.90, N = 7SE +/- 17.21, N = 55313.373392.573418.582090.451. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Emerald RapidsSapphire RapidsOpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1AVX-512 OnAVX-512 Off50100150200250SE +/- 0.15, N = 8SE +/- 0.03, N = 7SE +/- 0.52, N = 7SE +/- 0.69, N = 5212.54135.70136.7483.621. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUAVX-512 OnAVX-512 Off48121620SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.9515.588.0217.461. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 8.09, N = 3SE +/- 0.66, N = 3SE +/- 11.71, N = 3SE +/- 3.58, N = 38921.974277.315759.713688.321. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUAVX-512 OnAVX-512 Off30K60K90K120K150KSE +/- 735.86, N = 3SE +/- 51.71, N = 3SE +/- 898.88, N = 3SE +/- 52.05, N = 3144820.3565805.38114369.6663672.971. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 314.3329.9120.7332.511. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2AVX-512 OnAVX-512 Off4080120160200SE +/- 1.98, N = 15SE +/- 0.97, N = 15SE +/- 0.71, N = 3SE +/- 0.22, N = 3199.03124.48138.6688.571. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Emerald RapidsSapphire RapidsOpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2AVX-512 OnAVX-512 Off11002200330044005500SE +/- 49.58, N = 15SE +/- 24.22, N = 15SE +/- 17.85, N = 3SE +/- 5.42, N = 34975.703112.013466.392214.181. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off0.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.290.570.400.631. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUAVX-512 OnAVX-512 Off400800120016002000SE +/- 1.06, N = 3SE +/- 0.98, N = 3SE +/- 1.66, N = 3SE +/- 0.24, N = 31884.991159.931528.73923.471. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off7K14K21K28K35KSE +/- 24.15, N = 3SE +/- 0.50, N = 3SE +/- 29.04, N = 3SE +/- 2.47, N = 332234.042822.7126917.012463.981. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off1122334455SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 33.9345.324.3848.481. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off50100150200250SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3236.1424.67218.6221.891. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off30060090012001500SE +/- 0.03, N = 3SE +/- 2.43, N = 3SE +/- 0.07, N = 3SE +/- 4.01, N = 3135.371287.11137.091355.331. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeAVX-512 OnAVX-512 Off918273645SE +/- 0.07, N = 3SE +/- 0.16, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 341.6120.9435.2422.52

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUAVX-512 OnAVX-512 Off816243240SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 316.9527.5619.6032.451. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeAVX-512 OnAVX-512 Off48121620SE +/- 0.20744, N = 4SE +/- 0.07557, N = 15SE +/- 0.14483, N = 15SE +/- 0.13104, N = 1517.0409010.5667415.083209.22521

Emerald RapidsSapphire RapidsOpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timeAVX-512 OnAVX-512 Off1020304050SE +/- 0.19, N = 3SE +/- 0.07, N = 3SE +/- 0.20, N = 3SE +/- 0.09, N = 342.6523.4135.8923.66

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off5K10K15K20K25KSE +/- 20.85, N = 3SE +/- 8.51, N = 3SE +/- 25.65, N = 3SE +/- 9.76, N = 324769.1015101.2920980.6913852.881. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: ResNet-50AVX-512 OnAVX-512 Off1530456075SE +/- 0.31, N = 3SE +/- 0.28, N = 3SE +/- 0.41, N = 3SE +/- 0.20, N = 366.1847.2249.2639.06

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.158.465.718.651. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off8K16K24K32K40KSE +/- 74.75, N = 3SE +/- 274.25, N = 3SE +/- 114.21, N = 3SE +/- 150.37, N = 323825336033302638309

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50AVX-512 OnAVX-512 Off20406080100SE +/- 0.22, N = 3SE +/- 0.36, N = 3SE +/- 0.13, N = 3SE +/- 0.14, N = 385.6457.8472.0653.60

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50AVX-512 OnAVX-512 Off20406080100SE +/- 0.75, N = 3SE +/- 0.39, N = 3SE +/- 0.16, N = 3SE +/- 0.58, N = 3103.2868.4297.8566.08

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 2.0.0Benchmark: vklBenchmarkCPU ISPCAVX-512 OnAVX-512 Off7001400210028003500SE +/- 30.78, N = 3SE +/- 6.36, N = 3SE +/- 3.84, N = 3SE +/- 4.04, N = 33245236527052155

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off30K60K90K120K150KSE +/- 547.74, N = 3SE +/- 1069.55, N = 3SE +/- 940.07, N = 3SE +/- 496.38, N = 3151891.46129576.78123447.74101244.001. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off80160240320400SE +/- 0.24, N = 3SE +/- 0.38, N = 3SE +/- 1.52, N = 3SE +/- 0.22, N = 3237.36337.69290.15352.081. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUAVX-512 OffAVX-512 On48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.6912.5511.6915.761. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off5001000150020002500SE +/- 2.91, N = 3SE +/- 0.40, N = 3SE +/- 1.25, N = 3SE +/- 0.20, N = 32389.551882.971894.491645.031. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon ObjAVX-512 OnAVX-512 Off4080120160200SE +/- 0.52, N = 4SE +/- 0.19, N = 4SE +/- 0.26, N = 4SE +/- 0.21, N = 3174.38159.48131.85122.33

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian DragonAVX-512 OnAVX-512 Off4080120160200SE +/- 0.38, N = 8SE +/- 0.30, N = 7SE +/- 0.33, N = 7SE +/- 0.22, N = 7200.90181.83152.54141.11

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off60120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3189225226263

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 43.03, N = 3SE +/- 101.69, N = 3SE +/- 11.85, N = 3SE +/- 37.24, N = 37146859684579934

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off70140210280350SE +/- 0.00, N = 3SE +/- 2.41, N = 15SE +/- 0.00, N = 3SE +/- 0.33, N = 3223279268309

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off30060090012001500SE +/- 0.88, N = 3SE +/- 6.17, N = 3SE +/- 0.00, N = 3SE +/- 1.67, N = 3886105510621226

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off2004006008001000SE +/- 1.20, N = 3SE +/- 2.03, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 37498938891035

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off9001800270036004500SE +/- 7.00, N = 3SE +/- 23.43, N = 3SE +/- 1.86, N = 3SE +/- 5.17, N = 33017358535454166

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off11002200330044005500SE +/- 10.48, N = 3SE +/- 19.40, N = 3SE +/- 2.40, N = 3SE +/- 8.21, N = 33574424242314917

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 27.24, N = 3SE +/- 60.01, N = 3SE +/- 17.64, N = 3SE +/- 8.69, N = 36053721870898326

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off4K8K12K16K20KSE +/- 108.89, N = 15SE +/- 155.10, N = 7SE +/- 52.30, N = 3SE +/- 18.90, N = 314345171511679619604

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off1632486480SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 353.5167.9363.2772.891. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off4K8K12K16K20KSE +/- 96.72, N = 3SE +/- 152.25, N = 5SE +/- 14.66, N = 3SE +/- 24.36, N = 312125142571405416486

Emerald RapidsSapphire RapidsOpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.13Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUAVX-512 OnAVX-512 Off10K20K30K40K50KSE +/- 167.00, N = 3SE +/- 82.99, N = 3SE +/- 213.70, N = 3SE +/- 123.86, N = 332723387873848344460

Y-Cruncher

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.2Pi Digits To Calculate: 500MAVX-512 OnAVX-512 Off0.69951.3992.09852.7983.4975SE +/- 0.003, N = 7SE +/- 0.005, N = 7SE +/- 0.022, N = 7SE +/- 0.017, N = 72.2932.9072.7543.109

Emerald RapidsSapphire RapidsOpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.2Pi Digits To Calculate: 5BAVX-512 OnAVX-512 Off816243240SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 327.0633.6828.6036.64

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

Emerald RapidsSapphire RapidsOpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: CrownAVX-512 OnAVX-512 Off306090120150SE +/- 0.55, N = 7SE +/- 0.23, N = 7SE +/- 0.36, N = 6SE +/- 0.27, N = 6151.47137.62124.09113.13

Y-Cruncher

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.2Pi Digits To Calculate: 1BAVX-512 OnAVX-512 Off246810SE +/- 0.003, N = 5SE +/- 0.007, N = 5SE +/- 0.021, N = 5SE +/- 0.026, N = 54.9146.3385.2336.540

CPU Temperature Monitor

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgCelsiusCPU Temperature MonitorPhoronix Test Suite System MonitoringAVX-512 OnAVX-512 Off1326395265Min: 22 / Avg: 48.5 / Max: 59Min: 25 / Avg: 51.04 / Max: 61Min: 29 / Avg: 53.19 / Max: 66Min: 30 / Avg: 54 / Max: 64

CPU Power Consumption Monitor

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringAVX-512 OnAVX-512 Off160320480640800Min: 72.02 / Avg: 574.93 / Max: 770.09Min: 67.03 / Avg: 598.76 / Max: 762.66Min: 103.63 / Avg: 583.08 / Max: 772.16Min: 99.6 / Avg: 589.71 / Max: 894.8

CPU Peak Freq (Highest CPU Core Frequency) Monitor

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgMegahertzCPU Peak Freq (Highest CPU Core Frequency) MonitorPhoronix Test Suite System MonitoringAVX-512 OnAVX-512 Off10002000300040005000Min: 500 / Avg: 2841.09 / Max: 3519Min: 800 / Avg: 2870.28 / Max: 5743Min: 800 / Avg: 2954.57 / Max: 3913Min: 500 / Avg: 3017.93 / Max: 3917

oneDNN

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgCelsius, Fewer Is BetteroneDNN 3.3CPU Temperature MonitorAVX-512 OnAVX-512 Off1224364860Min: 36 / Avg: 47.82 / Max: 52Min: 38 / Avg: 50.66 / Max: 54Min: 35 / Avg: 52.1 / Max: 59Min: 35 / Avg: 52.63 / Max: 57

Emerald RapidsSapphire RapidsOpenBenchmarking.orgWatts, Fewer Is BetteroneDNN 3.3CPU Power Consumption MonitorAVX-512 OnAVX-512 Off130260390520650Min: 206.56 / Avg: 535.79 / Max: 754.55Min: 104.84 / Avg: 561.81 / Max: 756.79Min: 123.9 / Avg: 567.66 / Max: 734.19Min: 131.29 / Avg: 591.96 / Max: 755.76

Emerald RapidsSapphire RapidsOpenBenchmarking.orgMegahertz, More Is BetteroneDNN 3.3CPU Peak Freq (Highest CPU Core Frequency) MonitorAVX-512 OnAVX-512 Off7001400210028003500Min: 800 / Avg: 2962.65 / Max: 3907Min: 800 / Avg: 2951.28 / Max: 3906Min: 800 / Avg: 2905.96 / Max: 3509Min: 800 / Avg: 2857.85 / Max: 3512

Sapphire RapidsEmerald RapidsOpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUAVX-512 OnAVX-512 Off5001000150020002500SE +/- 10.80, N = 3SE +/- 6.64, N = 3SE +/- 31.97, N = 15SE +/- 54.32, N = 15830.96896.531331.742341.141. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

70 Results Shown

OpenVINO:
  Face Detection FP16-INT8 - CPU
  Weld Porosity Detection FP16-INT8 - CPU
  Weld Porosity Detection FP16-INT8 - CPU
  Machine Translation EN To DE FP16 - CPU
  Machine Translation EN To DE FP16 - CPU
  Handwritten English Recognition FP16 - CPU
  Age Gender Recognition Retail 0013 FP16 - CPU
  Handwritten English Recognition FP16 - CPU
  Person Vehicle Bike Detection FP16 - CPU
  Person Detection FP32 - CPU
  Person Detection FP16 - CPU
  Person Detection FP32 - CPU
  Person Detection FP16 - CPU
  Vehicle Detection FP16 - CPU
  Face Detection Retail FP16 - CPU
  Vehicle Detection FP16 - CPU
miniBUDE:
  OpenMP - BM1:
    GFInst/s
    Billion Interactions/s
OpenVINO:
  Face Detection Retail FP16 - CPU
  Vehicle Detection FP16-INT8 - CPU
  Age Gender Recognition Retail 0013 FP16 - CPU
  Vehicle Detection FP16-INT8 - CPU
miniBUDE:
  OpenMP - BM2:
    Billion Interactions/s
    GFInst/s
OpenVINO:
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU
  Road Segmentation ADAS FP16 - CPU
  Weld Porosity Detection FP16 - CPU
  Weld Porosity Detection FP16 - CPU
  Face Detection FP16 - CPU
  Face Detection FP16 - CPU
OSPRay
OpenVINO
OSPRay:
  gravity_spheres_volume/dim_512/pathtracer/real_time
  gravity_spheres_volume/dim_512/ao/real_time
OpenVINO
TensorFlow
OpenVINO
OSPRay Studio
TensorFlow:
  CPU - 32 - ResNet-50
  CPU - 64 - ResNet-50
OpenVKL
OpenVINO:
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU
  Face Detection FP16-INT8 - CPU
  Person Vehicle Bike Detection FP16 - CPU
  Road Segmentation ADAS FP16-INT8 - CPU
Embree:
  Pathtracer ISPC - Asian Dragon Obj
  Pathtracer ISPC - Asian Dragon
OSPRay Studio:
  1 - 1080p - 1 - Path Tracer - CPU
  3 - 1080p - 32 - Path Tracer - CPU
  3 - 1080p - 1 - Path Tracer - CPU
  3 - 4K - 1 - Path Tracer - CPU
  1 - 4K - 1 - Path Tracer - CPU
  1 - 1080p - 16 - Path Tracer - CPU
  3 - 1080p - 16 - Path Tracer - CPU
  1 - 1080p - 32 - Path Tracer - CPU
  3 - 4K - 16 - Path Tracer - CPU
OpenVINO
OSPRay Studio:
  1 - 4K - 16 - Path Tracer - CPU
  3 - 4K - 32 - Path Tracer - CPU
Y-Cruncher:
  500M
  5B
Embree
Y-Cruncher
CPU Temperature Monitor:
  Phoronix Test Suite System Monitoring:
    Celsius
    Watts
    Megahertz
  CPU Temp Monitor:
    Celsius
  CPU Power Consumption Monitor:
    Watts
  CPU Peak Freq (Highest CPU Core Frequency) Monitor:
    Megahertz
oneDNN