Xeon E3 1270 v5 Xmas

Intel Xeon E3-1270 v5 testing with a ASUS E3 PRO GAMING V5 (2606 BIOS) and ASUS NVIDIA NV84 256MB on Clear Linux OS 31470 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012279-HA-XEONE312725&rdt&grr.

Xeon E3 1270 v5 XmasProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution123Intel Xeon E3-1270 v5 @ 4.00GHz (4 Cores / 8 Threads)ASUS E3 PRO GAMING V5 (2606 BIOS)Intel Xeon E3-1200 v5/E3-15008GB256GB Samsung SSD 850ASUS NVIDIA NV84 256MBRealtek ALC1150DELL S2409WIntel I219-LMClear Linux OS 314705.3.8-854.native (x86_64)GNOME Shell 3.34.1X Server 1.20.5nouveau 1.0.163.3 Mesa 19.3.0-develGCC 9.2.1 20191101 gcc-9-branch@277702 + Clang 9.0.0 + LLVM 9.0.0ext41920x1080OpenBenchmarking.orgEnvironment Details- CFFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags" CXXFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mtune=skylake -fvisibility-inlines-hidden -Wl,--enable-new-dtags" MESA_GLSL_CACHE_DISABLE=0 CFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mtune=skylake" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" Compiler Details- --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell Disk Details- BFQ / relatime,rw,stripe=256 / Block Size: 4096Processor Details- Scaling Governor: intel_pstate performance - CPU Microcode: 0xccPython Details- Python 3.7.5Security Details- l1tf: Mitigation of PTE Inversion + mds: Vulnerable: Clear buffers attempted no microcode; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling

Xeon E3 1270 v5 Xmasbasis: UASTC Level 2 + RDO Post-Processingastcenc: Exhaustivegromacs: Water Benchmarkkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumnumpy: asmfish: 1024 Hash Memory, 26 Depthbasis: UASTC Level 3hmmer: Pfam Database Searchbuild-ffmpeg: Time To Compileonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetstockfish: Total Timeclomp: Static OMP Speedupkvazaar: Bosphorus 4K - Very Fastonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUx265: Bosphorus 4Knode-web-tooling: build-eigen: Time To Compilebasis: ETC1Sastcenc: Thoroughbasis: UASTC Level 2simdjson: PartialTweetssqlite-speedtest: Timed Time - Size 1,000simdjson: DistinctUserIDcompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedindigobench: CPU - Bedroomcompress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedindigobench: CPU - Supercarrav1e: 5kvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumrav1e: 1simdjson: Kostyasimdjson: LargeRandkvazaar: Bosphorus 4K - Ultra Fastrav1e: 6espeak: Text-To-Speech Synthesisredis: GETcompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedcompilebench: Compilerav1e: 10encode-wavpack: WAV To WavPackcoremark: CoreMark Size 666 - Iterations Per Secondkvazaar: Bosphorus 1080p - Very Fastonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUencode-ape: WAV To APEencode-ogg: WAV To Oggx265: Bosphorus 1080predis: SETonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUencode-opus: WAV To Opus Encodeastcenc: Mediumonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUkvazaar: Bosphorus 1080p - Ultra Fastredis: LPUSHredis: LPOPredis: SADDbasis: UASTC Level 0onednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUastcenc: Fastonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUlammps: Rhodopsin Proteinonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUcompilebench: Read Compiled Treecompilebench: Initial Create123795.732541.150.5232.442.49353.6712809865135.591115.982110.5716901.216906.916890.2416.0528.7236.7544.5317.4620.1088.6619.492.3610.356.268.236.137.7127.9483916181.56.953674.983684.403671.167.3511.3773.60072.26666.8468.9172.9367.4083.038044.546.870.7878035.348.151.8061.04710.6110.890.3662.290.8612.761.39430.4852316062.038068.87759.311046.883.00515.193162404.27088428.2410.1322910.257412.41020.40133.931775770.477.491153.436998.99010.205.696925.7122651.751555951.752550048.171987429.719.6162.6841410.63067.8520.872122.01753.0577.1714413.35621293.47491.40795.494539.890.5262.452.49351.8012702982135.617115.996110.5626913.786895.936864.5316.0428.6836.6744.4317.4320.0988.1519.462.3510.346.258.246.157.727.8984471931.56.953682.673678.693660.607.3911.3673.51772.36066.8268.9402.9466.7513.038040.047.350.7908025.948.101.8161.04810.6110.910.3662.290.8512.741.38630.0082257650.588115.17707.371045.423.01715.172166523.86702928.329.9076610.255412.42920.42233.621734581.087.485823.441248.99410.25.688805.7069451.811553337.421596414.461965680.879.5902.6914010.63187.8520.841622.00253.0637.1648313.36521298.04462.79796.132539.490.5242.442.49352.8412531518135.626116.043110.5506903.026886.746877.2116.0528.7236.5944.4417.4520.0488.3319.412.3510.306.268.246.137.7127.9484253541.56.953671.723683.503665.027.3311.3673.49072.36366.8068.9302.9467.5253.028014.847.110.7908013.148.191.8171.04610.6110.910.3662.280.8512.731.38930.1742236387.878101.97687.131015.723.00615.157164904.36404928.319.9972910.193112.49720.40233.771771304.377.480153.438108.98610.25.703035.7073851.841562717.171587957.581971803.879.5872.6929010.63037.8520.830322.02033.0697.1696413.3661993.62473.49OpenBenchmarking.org

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1232004006008001000SE +/- 0.56, N = 3SE +/- 0.40, N = 3SE +/- 0.03, N = 3795.73795.49796.131. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive123120240360480600SE +/- 1.57, N = 3SE +/- 0.22, N = 3SE +/- 0.12, N = 3541.15539.89539.491. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.11840.23680.35520.47360.592SE +/- 0.003, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 30.5230.5260.5241. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lrt -lpthread -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1230.55131.10261.65392.20522.7565SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.442.452.441. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1230.56031.12061.68092.24122.8015SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.492.492.491. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lpthread -lm -lrt

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12380160240320400SE +/- 1.64, N = 3SE +/- 1.12, N = 3SE +/- 0.90, N = 3353.67351.80352.84

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1233M6M9M12M15MSE +/- 114329.80, N = 3SE +/- 102809.00, N = 3SE +/- 115883.84, N = 3128098651270298212531518

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3123306090120150SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3135.59135.62135.631. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search123306090120150SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3115.98116.00116.041. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lhmmer -leasel -lm

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile12320406080100SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3110.57110.56110.55

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU12315003000450060007500SE +/- 22.22, N = 3SE +/- 2.64, N = 3SE +/- 13.53, N = 36901.216913.786903.02MIN: 6786.8MIN: 6834.84MIN: 6804.481. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12315003000450060007500SE +/- 18.32, N = 3SE +/- 13.21, N = 3SE +/- 15.39, N = 36906.916895.936886.74MIN: 6791.64MIN: 6790.76MIN: 6781.261. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12315003000450060007500SE +/- 15.93, N = 3SE +/- 21.21, N = 3SE +/- 15.30, N = 36890.246864.536877.21MIN: 6803.22MIN: 6760.08MIN: 6781.481. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m12348121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 316.0516.0416.05MIN: 15.98 / MAX: 16.15MIN: 15.98 / MAX: 16.19MIN: 15.97 / MAX: 17.21. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd123714212835SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 328.7228.6828.72MIN: 28.52 / MAX: 28.98MIN: 28.51 / MAX: 29.57MIN: 28.56 / MAX: 37.321. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny123816243240SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 336.7536.6736.59MIN: 36.55 / MAX: 45.59MIN: 36.34 / MAX: 37.78MIN: 36.35 / MAX: 45.551. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501231020304050SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 344.5344.4344.44MIN: 44.2 / MAX: 53.53MIN: 44.16 / MAX: 44.84MIN: 44.05 / MAX: 53.611. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet12348121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 317.4617.4317.45MIN: 17.18 / MAX: 17.79MIN: 17.19 / MAX: 17.69MIN: 17.21 / MAX: 17.711. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18123510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 320.1020.0920.04MIN: 19.65 / MAX: 20.5MIN: 19.78 / MAX: 20.46MIN: 19.64 / MAX: 26.191. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1612320406080100SE +/- 0.14, N = 3SE +/- 0.28, N = 3SE +/- 0.26, N = 388.6688.1588.33MIN: 87.86 / MAX: 97.1MIN: 87.13 / MAX: 95.68MIN: 87.3 / MAX: 89.541. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet123510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 319.4919.4619.41MIN: 19.41 / MAX: 20.57MIN: 19.34 / MAX: 19.62MIN: 19.35 / MAX: 19.661. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1230.5311.0621.5932.1242.655SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.362.352.35MIN: 2.32 / MAX: 2.4MIN: 2.32 / MAX: 2.42MIN: 2.32 / MAX: 2.391. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01233691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 310.3510.3410.30MIN: 10.31 / MAX: 10.41MIN: 10.26 / MAX: 17.07MIN: 10.26 / MAX: 10.351. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet123246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.266.256.26MIN: 6.21 / MAX: 6.32MIN: 6.22 / MAX: 6.31MIN: 6.23 / MAX: 6.311. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2123246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.238.248.24MIN: 8.18 / MAX: 8.44MIN: 8.19 / MAX: 8.31MIN: 8.18 / MAX: 8.361. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3123246810SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 36.136.156.13MIN: 6.11 / MAX: 6.3MIN: 6.11 / MAX: 6.23MIN: 6.09 / MAX: 6.171. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2123246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 37.717.707.71MIN: 7.66 / MAX: 7.79MIN: 7.65 / MAX: 7.99MIN: 7.64 / MAX: 7.891. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet123714212835SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 327.9427.8927.94MIN: 27.83 / MAX: 28.25MIN: 27.69 / MAX: 28.15MIN: 27.76 / MAX: 57.151. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lgomp -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1232M4M6M8M10MSE +/- 38275.35, N = 3SE +/- 120522.70, N = 4SE +/- 26943.43, N = 38391618844719384253541. (CXX) g++ options: -m64 -lpthread -O3 -pipe -fexceptions -fstack-protector -ffat-lto-objects -fno-trapping-math -mtune=skylake -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1230.33750.6751.01251.351.6875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.51.51.51. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast123246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.956.956.951. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lpthread -lm -lrt

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1238001600240032004000SE +/- 4.94, N = 3SE +/- 11.74, N = 3SE +/- 4.36, N = 33674.983682.673671.72MIN: 3619.26MIN: 3611.92MIN: 3623.531. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1238001600240032004000SE +/- 5.10, N = 3SE +/- 15.81, N = 3SE +/- 4.57, N = 33684.403678.693683.50MIN: 3627.75MIN: 3596.98MIN: 3623.361. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1238001600240032004000SE +/- 9.05, N = 3SE +/- 7.55, N = 3SE +/- 4.53, N = 33671.163660.603665.02MIN: 3612.47MIN: 3599.27MIN: 36141. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K123246810SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 37.357.397.331. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lpthread -lrt -ldl -lnuma

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 311.3711.3611.361. Nodejs v12.13.0

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1231632486480SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 373.6073.5273.49

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1231632486480SE +/- 0.19, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 372.2772.3672.361. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1231530456075SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 366.8466.8266.801. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 21231530456075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 368.9268.9468.931. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.66151.3231.98452.6463.3075SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.932.942.941. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001231530456075SE +/- 0.23, N = 3SE +/- 0.08, N = 3SE +/- 0.45, N = 367.4166.7567.531. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -ldl -lz -lpthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.68181.36362.04542.72723.409SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.033.033.021. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1232K4K6K8K10KSE +/- 1.82, N = 3SE +/- 4.43, N = 3SE +/- 1.47, N = 38044.58040.08014.81. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1231122334455SE +/- 0.37, N = 3SE +/- 0.01, N = 3SE +/- 0.15, N = 346.8747.3547.111. (CC) gcc options: -O3

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1230.17780.35560.53340.71120.889SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 30.7870.7900.790

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1232K4K6K8K10KSE +/- 6.16, N = 3SE +/- 3.90, N = 3SE +/- 8.78, N = 38035.38025.98013.11. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1231122334455SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 348.1548.1048.191. (CC) gcc options: -O3

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1230.40880.81761.22641.63522.044SE +/- 0.004, N = 3SE +/- 0.004, N = 3SE +/- 0.005, N = 31.8061.8161.817

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 51230.23580.47160.70740.94321.179SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.003, N = 31.0471.0481.046

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow1233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.6110.6110.611. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium1233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.8910.9110.911. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lpthread -lm -lrt

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11230.08240.16480.24720.32960.412SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.3660.3660.366

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.51531.03061.54592.06122.5765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.292.292.281. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.19350.3870.58050.7740.9675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.860.850.851. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast1233691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 312.7612.7412.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lpthread -lm -lrt

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61230.31370.62740.94111.25481.5685SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.004, N = 31.3941.3861.389

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis123714212835SE +/- 0.09, N = 4SE +/- 0.03, N = 4SE +/- 0.03, N = 430.4930.0130.171. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c99 -lpthread -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 29970.17, N = 15SE +/- 12822.26, N = 3SE +/- 18610.84, N = 152316062.032257650.582236387.871. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1232K4K6K8K10KSE +/- 4.89, N = 3SE +/- 6.60, N = 3SE +/- 13.85, N = 38068.88115.18101.91. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed12317003400510068008500SE +/- 10.41, N = 3SE +/- 31.16, N = 3SE +/- 44.31, N = 37759.317707.377687.131. (CC) gcc options: -O3

Compile Bench

Test: Compile

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Compile1232004006008001000SE +/- 7.56, N = 3SE +/- 3.80, N = 3SE +/- 6.75, N = 31046.881045.421015.72

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.67881.35762.03642.71523.394SE +/- 0.028, N = 3SE +/- 0.023, N = 3SE +/- 0.015, N = 33.0053.0173.006

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12348121620SE +/- 0.02, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 515.1915.1715.161. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12340K80K120K160K200KSE +/- 707.58, N = 3SE +/- 2647.76, N = 3SE +/- 1019.61, N = 3162404.27166523.87164904.361. (CC) gcc options: -O2 -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lrt" -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast123714212835SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 328.2428.3228.311. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lpthread -lm -lrt

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.08192, N = 3SE +/- 0.02840, N = 3SE +/- 0.10539, N = 310.132299.907669.99729MIN: 9.9MIN: 9.83MIN: 9.841. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1233691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 310.2610.2610.19MIN: 10.16MIN: 10.14MIN: 10.11. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1233691215SE +/- 0.04, N = 5SE +/- 0.04, N = 5SE +/- 0.04, N = 512.4112.4312.501. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pedantic -rdynamic -lrt

Ogg Audio Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To Ogg123510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 320.4020.4220.401. (CC) gcc options: -O2 -ffast-math -fsigned-char -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p123816243240SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 333.9333.6233.771. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic -lpthread -lrt -ldl -lnuma

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123400K800K1200K1600K2000KSE +/- 16979.65, N = 9SE +/- 28054.56, N = 3SE +/- 14894.97, N = 31775770.471734581.081771304.371. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU123246810SE +/- 0.00245, N = 3SE +/- 0.01082, N = 3SE +/- 0.00655, N = 37.491157.485827.48015MIN: 7.27MIN: 7.25MIN: 7.261. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.77431.54862.32293.09723.8715SE +/- 0.00076, N = 3SE +/- 0.00368, N = 3SE +/- 0.00077, N = 33.436993.441243.43810MIN: 3.41MIN: 3.41MIN: 3.411. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1233691215SE +/- 0.012, N = 5SE +/- 0.013, N = 5SE +/- 0.012, N = 58.9908.9948.9861. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fvisibility=hidden -logg -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310.2010.2010.201. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1231.28322.56643.84965.13286.416SE +/- 0.00435, N = 3SE +/- 0.00822, N = 3SE +/- 0.01227, N = 35.696925.688805.70303MIN: 5.45MIN: 5.45MIN: 5.441. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1231.28532.57063.85595.14126.4265SE +/- 0.00136, N = 3SE +/- 0.00499, N = 3SE +/- 0.00345, N = 35.712265.706945.70738MIN: 5.57MIN: 5.58MIN: 5.571. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast1231224364860SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 351.7551.8151.841. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lpthread -lm -lrt

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123300K600K900K1200K1500KSE +/- 12177.30, N = 3SE +/- 24273.96, N = 3SE +/- 5194.67, N = 31555951.751553337.421562717.171. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123500K1000K1500K2000K2500KSE +/- 36104.85, N = 3SE +/- 5729.05, N = 3SE +/- 12529.50, N = 32550048.171596414.461587957.581. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123400K800K1200K1600K2000KSE +/- 9920.35, N = 3SE +/- 25503.06, N = 3SE +/- 21310.59, N = 31987429.711965680.871971803.871. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01233691215SE +/- 0.016, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 39.6169.5909.5871. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.60591.21181.81772.42363.0295SE +/- 0.00502, N = 3SE +/- 0.00691, N = 3SE +/- 0.00357, N = 32.684142.691402.69290MIN: 2.62MIN: 2.63MIN: 2.641. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310.6310.6310.63MIN: 10.5MIN: 10.48MIN: 10.481. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast123246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.857.857.851. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 320.8720.8420.83MIN: 20.68MIN: 20.6MIN: 20.561. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 322.0222.0022.02MIN: 21.82MIN: 21.85MIN: 21.841. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1230.69051.3812.07152.7623.4525SE +/- 0.006, N = 3SE +/- 0.008, N = 3SE +/- 0.006, N = 33.0573.0633.0691. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -lm

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.00373, N = 3SE +/- 0.00530, N = 3SE +/- 0.00921, N = 37.171447.164837.16964MIN: 7.14MIN: 7.13MIN: 7.131. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1233691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 313.3613.3713.37MIN: 13.24MIN: 13.2MIN: 13.241. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Compile Bench

Test: Read Compiled Tree

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Read Compiled Tree12330060090012001500SE +/- 12.37, N = 3SE +/- 6.46, N = 3SE +/- 248.56, N = 31293.471298.04993.62

Compile Bench

Test: Initial Create

OpenBenchmarking.orgMB/s, More Is BetterCompile Bench 0.6Test: Initial Create123110220330440550SE +/- 16.31, N = 3SE +/- 37.60, N = 3SE +/- 34.14, N = 3491.40462.79473.49


Phoronix Test Suite v10.8.4