Cascade Lake Clear Linux 2021 2 x Intel Xeon Platinum 8280 testing with a GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS) and llvmpipe on Clear Linux OS 34420 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2103266-IB-CASCADELA26 .
Cascade Lake Clear Linux 2021 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution Clear Linux 34420 2 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads) GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS) Intel Sky Lake-E DMI3 Registers 378GB 280GB INTEL SSDPED1D280GA llvmpipe VE228 2 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbE Clear Linux OS 34420 5.10.19-1032.native (x86_64) GNOME Shell 3.38.4 X Server 1.20.10 4.5 Mesa 20.3.4 (LLVM 10.0.1 256 bits) GCC 10.2.1 20210324 releases/gcc-10.2.0-1013-g592388d4f6 + Clang 10.0.1 + LLVM 10.0.1 ext4 1920x1080 OpenBenchmarking.org - Transparent Huge Pages: always - FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags -Wa,-mbranches-within-32B-boundaries" CXXFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mtune=skylake -Wa,-mbranches-within-32B-boundaries -fvisibility-inlines-hidden -Wl,--enable-new-dtags" MESA_GLSL_CACHE_DISABLE=0 FCFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" CFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mtune=skylake -Wa,-mbranches-within-32B-boundaries" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" - --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell - Scaling Governor: intel_pstate performance - CPU Microcode: 0x5003006 - Python 3.9.2 - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Cascade Lake Clear Linux 2021 npb: EP.D npb: LU.C rodinia: OpenMP LavaMD rodinia: OpenMP HotSpot3D rodinia: OpenMP Leukocyte rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster namd: ATPase Simulation - 327,506 Atoms incompact3d: X3D-benchmarking input.i3d incompact3d: input.i3d 129 Cells Per Direction incompact3d: input.i3d 193 Cells Per Direction simdjson: Kostya simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit ospray: San Miguel - Path Tracer ospray: XFrog Forest - Path Tracer ospray: NASA Streamlines - Path Tracer ospray: Magnetic Reconnection - Path Tracer embree: Pathtracer - Crown embree: Pathtracer ISPC - Crown embree: Pathtracer - Asian Dragon embree: Pathtracer ISPC - Asian Dragon svt-av1: Enc Mode 0 - 1080p svt-av1: Enc Mode 4 - 1080p svt-av1: Enc Mode 8 - 1080p svt-hevc: 1 - Bosphorus 1080p svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 1080p oidn: Memorial openvkl: vklBenchmark stockfish: Total Time avifenc: 2 avifenc: 6 avifenc: 10 avifenc: 6, Lossless avifenc: 10, Lossless onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: IP Shapes 1D - bf16bf16bf16 - CPU onednn: IP Shapes 3D - bf16bf16bf16 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU synthmark: VoiceMark_100 liquid-dsp: 1 - 256 - 57 liquid-dsp: 16 - 256 - 57 liquid-dsp: 32 - 256 - 57 liquid-dsp: 64 - 256 - 57 liquid-dsp: 112 - 256 - 57 gromacs: water_GMX50_bare tensorflow-lite: SqueezeNet tensorflow-lite: Inception V4 tensorflow-lite: NASNet Mobile tensorflow-lite: Mobilenet Float tensorflow-lite: Mobilenet Quant tensorflow-lite: Inception ResNet V2 mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 indigobench: CPU - Bedroom indigobench: CPU - Supercar blender: BMW27 - CPU-Only blender: Fishy Cat - CPU-Only v-ray: CPU Clear Linux 34420 6759.15 125567.33 53.547 111.145 52.679 6.541 12.901 0.37170 458.174825 4.48924637 19.3331966 2.52 0.88 4.01 4.21 48.00 6096.9 46.81 6135.4 84.3 1889.2 834.8 38.9 2087.9 1038.26 396.46 1010.38 240.09 6.62 6.67 18.52 500 51.3869 54.7089 61.2117 70.4820 0.144 9.009 74.632 30.09 270.36 480.50 348.26 356.90 291.24 37.44 511 136190317 30.121 11.734 3.669 28.713 6.442 1.71888 1.038714 3.62375 2.11061 3.81893 0.383772 0.309397 900.339 3.19591 4.03906 4.42830 476.656 895.445 477.715 0.330218 0.833080 614.457 58712333 822863333 1518633333 2315900000 2853633333 5.759 61863.5 883114 97445.7 51318.3 49894.1 811638 8.157 25.733 4.750 2.672 32.477 330.630 330.159 8.576 20.202 37.84 55.71 49082 OpenBenchmarking.org
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D Clear Linux 34420 1400 2800 4200 5600 7000 SE +/- 29.20, N = 3 6759.15 1. (F9X) gfortran options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C Clear Linux 34420 30K 60K 90K 120K 150K SE +/- 241.57, N = 3 125567.33 1. (F9X) gfortran options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Clear Linux 34420 12 24 36 48 60 SE +/- 0.10, N = 3 53.55 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Clear Linux 34420 20 40 60 80 100 SE +/- 0.17, N = 3 111.15 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Clear Linux 34420 12 24 36 48 60 SE +/- 1.63, N = 15 52.68 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver Clear Linux 34420 2 4 6 8 10 SE +/- 0.127, N = 15 6.541 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Clear Linux 34420 3 6 9 12 15 SE +/- 0.13, N = 5 12.90 1. (CXX) g++ options: -O2 -lOpenCL
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Clear Linux 34420 0.0836 0.1672 0.2508 0.3344 0.418 SE +/- 0.00145, N = 3 0.37170
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d Clear Linux 34420 100 200 300 400 500 SE +/- 0.16, N = 3 458.17 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction Clear Linux 34420 1.0101 2.0202 3.0303 4.0404 5.0505 SE +/- 0.00386211, N = 3 4.48924637 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Clear Linux 34420 5 10 15 20 25 SE +/- 0.02, N = 3 19.33 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: Kostya Clear Linux 34420 0.567 1.134 1.701 2.268 2.835 SE +/- 0.01, N = 3 2.52 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: LargeRandom Clear Linux 34420 0.198 0.396 0.594 0.792 0.99 SE +/- 0.00, N = 3 0.88 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: PartialTweets Clear Linux 34420 0.9023 1.8046 2.7069 3.6092 4.5115 SE +/- 0.01, N = 3 4.01 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.8.2 Throughput Test: DistinctUserID Clear Linux 34420 0.9473 1.8946 2.8419 3.7892 4.7365 SE +/- 0.01, N = 3 4.21 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed Clear Linux 34420 11 22 33 44 55 SE +/- 0.03, N = 3 48.00 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed Clear Linux 34420 1300 2600 3900 5200 6500 SE +/- 177.02, N = 3 6096.9 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed Clear Linux 34420 11 22 33 44 55 SE +/- 0.16, N = 3 46.81 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed Clear Linux 34420 1300 2600 3900 5200 6500 SE +/- 83.53, N = 3 6135.4 1. (CC) gcc options: -O3
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Compression Speed Clear Linux 34420 20 40 60 80 100 SE +/- 0.68, N = 15 84.3 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lz
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19 - Decompression Speed Clear Linux 34420 400 800 1200 1600 2000 SE +/- 1.54, N = 15 1889.2 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lz
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 8, Long Mode - Compression Speed Clear Linux 34420 200 400 600 800 1000 SE +/- 2.10, N = 3 834.8 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lz
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Compression Speed Clear Linux 34420 9 18 27 36 45 SE +/- 0.43, N = 4 38.9 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lz
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.4.9 Compression Level: 19, Long Mode - Decompression Speed Clear Linux 34420 400 800 1200 1600 2000 SE +/- 2.64, N = 4 2087.9 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lz
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Chimera 1080p Clear Linux 34420 200 400 600 800 1000 SE +/- 8.00, N = 10 1038.26 MIN: 367.3 / MAX: 1324.25 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 4K Clear Linux 34420 90 180 270 360 450 SE +/- 1.88, N = 3 396.46 MIN: 163.78 / MAX: 435.2 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Summer Nature 1080p Clear Linux 34420 200 400 600 800 1000 SE +/- 4.74, N = 3 1010.38 MIN: 375.41 / MAX: 1131.64 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.8.2 Video Input: Chimera 1080p 10-bit Clear Linux 34420 50 100 150 200 250 SE +/- 0.21, N = 3 240.09 MIN: 170.42 / MAX: 370.95 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread
OSPray Demo: San Miguel - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: San Miguel - Renderer: Path Tracer Clear Linux 34420 2 4 6 8 10 SE +/- 0.00, N = 3 6.62 MIN: 5.08 / MAX: 6.67
OSPray Demo: XFrog Forest - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: XFrog Forest - Renderer: Path Tracer Clear Linux 34420 2 4 6 8 10 SE +/- 0.00, N = 3 6.67 MIN: 5.38 / MAX: 6.71
OSPray Demo: NASA Streamlines - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: NASA Streamlines - Renderer: Path Tracer Clear Linux 34420 5 10 15 20 25 SE +/- 0.00, N = 3 18.52 MIN: 14.29 / MAX: 19.23
OSPray Demo: Magnetic Reconnection - Renderer: Path Tracer OpenBenchmarking.org FPS, More Is Better OSPray 1.8.5 Demo: Magnetic Reconnection - Renderer: Path Tracer Clear Linux 34420 110 220 330 440 550 500 MIN: 250
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Crown Clear Linux 34420 12 24 36 48 60 SE +/- 0.28, N = 3 51.39 MIN: 48.91 / MAX: 53.74
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Crown Clear Linux 34420 12 24 36 48 60 SE +/- 0.20, N = 3 54.71 MIN: 52.61 / MAX: 57.82
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer - Model: Asian Dragon Clear Linux 34420 14 28 42 56 70 SE +/- 0.42, N = 3 61.21 MIN: 59 / MAX: 63.14
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.9.0 Binary: Pathtracer ISPC - Model: Asian Dragon Clear Linux 34420 16 32 48 64 80 SE +/- 0.56, N = 3 70.48 MIN: 67.48 / MAX: 74.56
SVT-AV1 Encoder Mode: Enc Mode 0 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 0 - Input: 1080p Clear Linux 34420 0.0324 0.0648 0.0972 0.1296 0.162 SE +/- 0.001, N = 3 0.144 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-AV1 Encoder Mode: Enc Mode 4 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p Clear Linux 34420 3 6 9 12 15 SE +/- 0.032, N = 3 9.009 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-AV1 Encoder Mode: Enc Mode 8 - Input: 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p Clear Linux 34420 20 40 60 80 100 SE +/- 0.78, N = 5 74.63 1. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Clear Linux 34420 7 14 21 28 35 SE +/- 0.10, N = 3 30.09 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Clear Linux 34420 60 120 180 240 300 SE +/- 2.46, N = 3 270.36 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Clear Linux 34420 100 200 300 400 500 SE +/- 5.37, N = 3 480.50 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Clear Linux 34420 80 160 240 320 400 SE +/- 2.43, N = 3 348.26 1. (CC) gcc options: -O3 -fcommon -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Clear Linux 34420 80 160 240 320 400 SE +/- 1.54, N = 3 356.90 1. (CC) gcc options: -O3 -fcommon -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Clear Linux 34420 60 120 180 240 300 SE +/- 1.15, N = 3 291.24 1. (CC) gcc options: -O3 -fcommon -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Intel Open Image Denoise Scene: Memorial OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.2.0 Scene: Memorial Clear Linux 34420 9 18 27 36 45 SE +/- 0.28, N = 3 37.44
OpenVKL Benchmark: vklBenchmark OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark Clear Linux 34420 110 220 330 440 550 SE +/- 2.91, N = 3 511 MIN: 1 / MAX: 1812
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time Clear Linux 34420 30M 60M 90M 120M 150M SE +/- 1037933.70, N = 15 136190317 1. (CXX) g++ options: -lgcov -m64 -lpthread -O3 -pipe -fexceptions -fstack-protector -ffat-lto-objects -fno-trapping-math -mtune=skylake -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 2 Clear Linux 34420 7 14 21 28 35 SE +/- 0.42, N = 3 30.12 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 Clear Linux 34420 3 6 9 12 15 SE +/- 0.11, N = 7 11.73 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Clear Linux 34420 0.8255 1.651 2.4765 3.302 4.1275 SE +/- 0.020, N = 3 3.669 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Clear Linux 34420 7 14 21 28 35 SE +/- 0.32, N = 5 28.71 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Clear Linux 34420 2 4 6 8 10 SE +/- 0.039, N = 3 6.442 1. (CXX) g++ options: -O3 -fPIC -lm
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Clear Linux 34420 0.3867 0.7734 1.1601 1.5468 1.9335 SE +/- 0.01330, N = 3 1.71888 MIN: 1.52 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Clear Linux 34420 0.2337 0.4674 0.7011 0.9348 1.1685 SE +/- 0.013211, N = 15 1.038714 MIN: 0.69 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU Clear Linux 34420 0.8153 1.6306 2.4459 3.2612 4.0765 SE +/- 0.01053, N = 3 3.62375 MIN: 3.41 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU Clear Linux 34420 0.4749 0.9498 1.4247 1.8996 2.3745 SE +/- 0.00683, N = 3 2.11061 MIN: 2.02 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Clear Linux 34420 0.8593 1.7186 2.5779 3.4372 4.2965 SE +/- 0.02129, N = 3 3.81893 MIN: 3.72 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Clear Linux 34420 0.0863 0.1726 0.2589 0.3452 0.4315 SE +/- 0.001058, N = 3 0.383772 MIN: 0.34 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Clear Linux 34420 0.0696 0.1392 0.2088 0.2784 0.348 SE +/- 0.003975, N = 15 0.309397 MIN: 0.28 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Clear Linux 34420 200 400 600 800 1000 SE +/- 3.62, N = 3 900.34 MIN: 883.53 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Clear Linux 34420 0.7191 1.4382 2.1573 2.8764 3.5955 SE +/- 0.00586, N = 3 3.19591 MIN: 2.99 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU Clear Linux 34420 0.9088 1.8176 2.7264 3.6352 4.544 SE +/- 0.01370, N = 3 4.03906 MIN: 3.94 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU Clear Linux 34420 0.9964 1.9928 2.9892 3.9856 4.982 SE +/- 0.00032, N = 3 4.42830 MIN: 4.39 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Clear Linux 34420 100 200 300 400 500 SE +/- 1.18, N = 3 476.66 MIN: 466 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Clear Linux 34420 200 400 600 800 1000 SE +/- 2.11, N = 3 895.45 MIN: 885.58 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Clear Linux 34420 100 200 300 400 500 SE +/- 2.69, N = 3 477.72 MIN: 464.94 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Clear Linux 34420 0.0743 0.1486 0.2229 0.2972 0.3715 SE +/- 0.002990, N = 7 0.330218 MIN: 0.28 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Clear Linux 34420 0.1874 0.3748 0.5622 0.7496 0.937 SE +/- 0.006956, N = 3 0.833080 MIN: 0.77 1. (CXX) g++ options: -O3 -march=native -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 Clear Linux 34420 130 260 390 520 650 SE +/- 0.92, N = 3 614.46 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 Clear Linux 34420 13M 26M 39M 52M 65M SE +/- 6887.99, N = 3 58712333 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 Clear Linux 34420 200M 400M 600M 800M 1000M SE +/- 1029827.39, N = 3 822863333 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lm -lc -lliquid
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 Clear Linux 34420 300M 600M 900M 1200M 1500M SE +/- 3507293.99, N = 3 1518633333 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lm -lc -lliquid
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 Clear Linux 34420 500M 1000M 1500M 2000M 2500M SE +/- 22409893.65, N = 3 2315900000 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lm -lc -lliquid
Liquid-DSP Threads: 112 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 112 - Buffer Length: 256 - Filter Length: 57 Clear Linux 34420 600M 1200M 1800M 2400M 3000M SE +/- 3573202.73, N = 3 2853633333 1. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lm -lc -lliquid
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare Clear Linux 34420 1.2958 2.5916 3.8874 5.1832 6.479 SE +/- 0.020, N = 3 5.759 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Clear Linux 34420 13K 26K 39K 52K 65K SE +/- 312.31, N = 3 61863.5
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Clear Linux 34420 200K 400K 600K 800K 1000K SE +/- 3472.47, N = 3 883114
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Clear Linux 34420 20K 40K 60K 80K 100K SE +/- 1598.38, N = 15 97445.7
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Clear Linux 34420 11K 22K 33K 44K 55K SE +/- 591.27, N = 3 51318.3
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Clear Linux 34420 11K 22K 33K 44K 55K SE +/- 584.51, N = 3 49894.1
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Clear Linux 34420 200K 400K 600K 800K 1000K SE +/- 5456.47, N = 3 811638
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 Clear Linux 34420 2 4 6 8 10 SE +/- 0.099, N = 15 8.157 MIN: 5.82 / MAX: 12.06 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: resnet-v2-50 Clear Linux 34420 6 12 18 24 30 SE +/- 0.13, N = 15 25.73 MIN: 24.56 / MAX: 67.2 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: MobileNetV2_224 Clear Linux 34420 1.0688 2.1376 3.2064 4.2752 5.344 SE +/- 0.059, N = 15 4.750 MIN: 3.76 / MAX: 9.4 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 Clear Linux 34420 0.6012 1.2024 1.8036 2.4048 3.006 SE +/- 0.010, N = 15 2.672 MIN: 2.48 / MAX: 4.13 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.3 Model: inception-v3 Clear Linux 34420 8 16 24 32 40 SE +/- 0.11, N = 15 32.48 MIN: 29.61 / MAX: 94.15 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 Clear Linux 34420 70 140 210 280 350 SE +/- 0.21, N = 3 330.63 MIN: 329.78 / MAX: 333.62 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 Clear Linux 34420 70 140 210 280 350 SE +/- 0.12, N = 3 330.16 MIN: 329.51 / MAX: 332.56 1. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fopenmp -pthread -fvisibility=hidden -rdynamic -ldl
IndigoBench Acceleration: CPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Bedroom Clear Linux 34420 2 4 6 8 10 SE +/- 0.013, N = 3 8.576
IndigoBench Acceleration: CPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar Clear Linux 34420 5 10 15 20 25 SE +/- 0.11, N = 3 20.20
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only Clear Linux 34420 9 18 27 36 45 SE +/- 0.10, N = 3 37.84
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: CPU-Only Clear Linux 34420 13 26 39 52 65 SE +/- 0.05, N = 3 55.71
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU Clear Linux 34420 11K 22K 33K 44K 55K SE +/- 217.08, N = 3 49082
Phoronix Test Suite v10.8.4