Ryzen 7 1800X Ubuntu 2020

AMD Ryzen 7 1800X Eight-Core testing with a MSI X370 XPOWER GAMING TITANIUM (MS-7A31) v1.0 (1.F0 BIOS) and AMD Radeon RX 460/560D / Pro 450/455/460/555/555X/560/560X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012221-HA-RYZEN718094&grr.

Ryzen 7 1800X Ubuntu 2020ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution1234AMD Ryzen 7 1800X Eight-Core @ 3.60GHz (8 Cores / 16 Threads)MSI X370 XPOWER GAMING TITANIUM (MS-7A31) v1.0 (1.F0 BIOS)AMD 17h8GBSamsung SSD 950 PRO 256GBAMD Radeon RX 460/560D / Pro 450/455/460/555/555X/560/560X 2GB (1212/1750MHz)AMD Baffin HDMI/DPLG Ultra HDIntel I211Ubuntu 20.105.8.0-21-generic (x86_64)GNOME Shell 3.38.0X Server 1.20.8modesetting 1.20.84.6 Mesa 20.2.0 (LLVM 11.0.0)1.2.131GCC 10.2.0ext43840x2160GNOME Shell 3.38.14.6 Mesa 20.2.1 (LLVM 11.0.0)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8001137Java Details- OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Ryzen 7 1800X Ubuntu 2020basis: UASTC Level 2 + RDO Post-Processinghpcc: G-HPLbuild-clash: Time To Compilegromacs: Water Benchmarknumpy: brl-cad: VGR Performance Metricastcenc: Exhaustivebuild2: Time To Compileasmfish: 1024 Hash Memory, 26 Depthclomp: Static OMP Speedupkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumhmmer: Pfam Database Searchsimdjson: LargeRandsimdjson: PartialTweetsembree: Pathtracer ISPC - Crownonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUbuild-eigen: Time To Compilencnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetnode-web-tooling: embree: Pathtracer ISPC - Asian Dragon Objcompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUembree: Pathtracer - Asian Dragon Objonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUcompress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedbuild-ffmpeg: Time To Compileembree: Pathtracer - Crownx265: Bosphorus 4Ksqlite-speedtest: Timed Time - Size 1,000basis: UASTC Level 3embree: Pathtracer ISPC - Asian Dragonrav1e: 1rav1e: 5stockfish: Total Timeembree: Pathtracer - Asian Dragonindigobench: CPU - Bedroomindigobench: CPU - Supercaronednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastsunflow: Global Illumination + Image Synthesissimdjson: DistinctUserIDbasis: ETC1Sespeak: Text-To-Speech Synthesisrav1e: 6simdjson: Kostyaredis: GETonednn: IP Shapes 1D - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUbasis: UASTC Level 2astcenc: Thoroughkvazaar: Bosphorus 1080p - Slowphpbench: PHP Benchmark Suitekvazaar: Bosphorus 1080p - Mediumcompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedrav1e: 10crafty: Elapsed Timeredis: LPOPencode-ape: WAV To APEcoremark: CoreMark Size 666 - Iterations Per Secondencode-wavpack: WAV To WavPackastcenc: Mediumx265: Bosphorus 1080pkvazaar: Bosphorus 1080p - Very Fastonednn: IP Shapes 1D - u8s8f32 - CPUencode-opus: WAV To Opus Encodelammps: Rhodopsin Proteinonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUredis: LPUSHredis: SETredis: SADDastcenc: Fastonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUbasis: UASTC Level 0kvazaar: Bosphorus 1080p - Ultra Fastonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUhpcc: Max Ping Pong Bandwidthhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMhpcc: G-Ffte1234749.56164.57053311.8090.597275.1994326265.75203.706221919248.23.653.72129.7770.380.507.55117668.247691.667647.51100.16528.2431.3940.7849.4015.2021.1173.5926.133.0612.648.809.278.689.6932.988.658.09298632.641.7613.30268.72994159.904131.164140.238678.043.2173.9658.24288.3272.35871.2028.86610.2940.886158067049.37901.5423.2715.4258110.2518.171.4830.5155.89032.9441.1890.422249907.2510.7163512.364537.90432.2916.1254695416.499086.18097.822.74264278752375150.8315.048310974.49356514.0739.8236.0437.806.174828.2805.0823.953261219147.041530872.961809735.136.683.8287211.13188.93868.4120.278919.874011.521916.737211544.7691.457700.561160.026203.432632.4882419.762674.44974749.11665.50565310.7180.599278.3792840265.51203.427226697447.73.663.72129.6860.380.507.71307632.597622.547614.3699.91327.1031.0140.3348.6815.0521.3173.3125.553.0612.468.699.068.309.4733.188.628.10778609.242.7412.81098.70414128.184123.224109.428644.642.6273.7258.35728.3571.63771.1248.99930.2960.887158783869.31041.5443.2515.4937110.2418.211.4680.5156.22632.9171.1920.422005461.679.1434610.3885937.91132.3216.1655175416.538892.28113.282.74066777732023932.3015.023311216.66694114.0869.8836.5937.985.999188.2745.0354.018531235368.711506982.371742844.06.653.2175910.99218.97568.7820.783820.113411.647315.597211544.7881.462820.567750.026153.471122.8374220.098874.46749749.46566.14843310.6710.597274.8492276270.54203.144224990287.93.663.74129.8660.380.57.69947632.297602.277601.1599.63827.4630.9140.9949.4715.2921.3374.3626.043.2113.389.059.368.459.7432.418.598.12258540.541.4612.83278.69894033.644041.054019.428604.342.4273.8038.31618.2071.52971.4859.01160.2960.890161209969.42911.5413.2675.4761610.2018.261.4890.5155.80033.2261.1930.411958857.877.901819.6357738.05332.8216.1854663716.558941.18087.542.74868157511253261.4614.960311939.82106814.0459.0536.9238.015.944568.2715.1814.765181263884.501529187.751799244.426.862.4933410.13798.99968.9921.723921.153611.539414.760611926.9061.466260.558800.026233.468862.8459620.311334.46806750.05267.87867312.5110.597272.7293406270.67203.498223559458.13.633.69131.0380.370.487.65487629.667637.107615.8699.86927.0831.2940.9450.4315.1621.0974.1626.353.2512.418.369.228.199.4932.468.588.10558581.139.9511.71758.74844036.804050.964053.798527.742.9173.9478.38158.4171.69271.5918.89840.2960.887158651769.47391.5403.2595.4644610.1717.751.5040.5156.24533.1131.1880.411987133.408.131949.7907938.13632.8216.0454482016.418886.78150.132.72469425051981072.6315.019278575.83213213.9969.9436.1837.635.970668.4304.9834.726331273184.811515400.671795470.256.822.499309.804238.97968.4921.568020.969411.464514.737711599.9711.460810.567030.026213.468792.7780820.322274.41886OpenBenchmarking.org

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1234160320480640800SE +/- 0.61, N = 3SE +/- 0.30, N = 3SE +/- 0.52, N = 3SE +/- 0.62, N = 3749.56749.12749.47750.051. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL12341530456075SE +/- 0.51, N = 3SE +/- 0.93, N = 4SE +/- 0.03, N = 3SE +/- 0.39, N = 364.5765.5166.1567.881. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Timed Clash Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Clash CompilationTime To Compile123470140210280350SE +/- 0.57, N = 3SE +/- 0.85, N = 3SE +/- 0.68, N = 3SE +/- 0.75, N = 3311.81310.72310.67312.51

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark12340.13480.26960.40440.53920.674SE +/- 0.000, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.5970.5990.5970.5971. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark123460120180240300SE +/- 1.13, N = 3SE +/- 0.56, N = 3SE +/- 0.53, N = 3SE +/- 0.70, N = 3275.19278.37274.84272.72

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric123420K40K60K80K100K943269284092276934061. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive123460120180240300SE +/- 0.31, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.70, N = 3265.75265.51270.54270.671. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12344080120160200SE +/- 0.68, N = 3SE +/- 1.04, N = 3SE +/- 1.32, N = 3SE +/- 0.38, N = 3203.71203.43203.14203.50

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth12345M10M15M20M25MSE +/- 126857.99, N = 3SE +/- 181612.11, N = 3SE +/- 139738.13, N = 3SE +/- 107724.99, N = 322191924226697442249902822355945

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1234246810SE +/- 0.12, N = 3SE +/- 0.07, N = 15SE +/- 0.12, N = 3SE +/- 0.07, N = 108.27.77.98.11. (CC) gcc options: -fopenmp -O3 -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow12340.82351.6472.47053.2944.1175SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.653.663.663.631. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium12340.84151.6832.52453.3664.2075SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.723.723.743.691. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search1234306090120150SE +/- 0.12, N = 3SE +/- 0.30, N = 3SE +/- 0.29, N = 3SE +/- 0.35, N = 3129.78129.69129.87131.041. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom12340.08550.1710.25650.3420.4275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 150.380.380.380.371. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets12340.11250.2250.33750.450.5625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 150.500.500.500.481. (CXX) g++ options: -O3 -pthread

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown1234246810SE +/- 0.0837, N = 7SE +/- 0.0357, N = 3SE +/- 0.0561, N = 3SE +/- 0.0273, N = 37.55117.71307.69947.6548MIN: 6.99 / MAX: 7.77MIN: 7.63 / MAX: 7.89MIN: 7.43 / MAX: 7.87MIN: 7.54 / MAX: 7.79

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU123416003200480064008000SE +/- 13.57, N = 3SE +/- 10.03, N = 3SE +/- 5.63, N = 3SE +/- 17.72, N = 37668.247632.597632.297629.66MIN: 7615.61MIN: 7582.41MIN: 7596.46MIN: 7565.321. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU123416003200480064008000SE +/- 62.42, N = 3SE +/- 12.97, N = 3SE +/- 5.57, N = 3SE +/- 3.89, N = 37691.667622.547602.277637.10MIN: 7573.71MIN: 7569.78MIN: 7559.13MIN: 7599.541. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU123416003200480064008000SE +/- 6.51, N = 3SE +/- 18.50, N = 3SE +/- 16.63, N = 3SE +/- 20.02, N = 37647.517614.367601.157615.86MIN: 7608.91MIN: 7557.67MIN: 7549.66MIN: 7564.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile123420406080100SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.29, N = 3SE +/- 0.14, N = 3100.1799.9199.6499.87

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m1234714212835SE +/- 1.01, N = 3SE +/- 0.33, N = 3SE +/- 0.17, N = 3SE +/- 0.16, N = 328.2427.1027.4627.08MIN: 25.09 / MAX: 90.56MIN: 25.08 / MAX: 69.15MIN: 25.03 / MAX: 86.63MIN: 25.17 / MAX: 62.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1234714212835SE +/- 0.55, N = 3SE +/- 0.20, N = 3SE +/- 0.30, N = 3SE +/- 0.28, N = 331.3931.0130.9131.29MIN: 27.72 / MAX: 93.62MIN: 27.42 / MAX: 92.87MIN: 27.74 / MAX: 74.48MIN: 27.91 / MAX: 91.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1234918273645SE +/- 0.14, N = 3SE +/- 0.26, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 340.7840.3340.9940.94MIN: 34.36 / MAX: 65.77MIN: 33.98 / MAX: 69.99MIN: 34.83 / MAX: 65.41MIN: 34.04 / MAX: 100.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet5012341122334455SE +/- 0.49, N = 3SE +/- 0.27, N = 3SE +/- 0.19, N = 3SE +/- 0.27, N = 349.4048.6849.4750.43MIN: 41.46 / MAX: 109.15MIN: 40.99 / MAX: 104.67MIN: 41.96 / MAX: 101.85MIN: 42.18 / MAX: 103.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet123448121620SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 315.2015.0515.2915.16MIN: 14.19 / MAX: 53.26MIN: 14.16 / MAX: 39.79MIN: 14.23 / MAX: 41.93MIN: 14.26 / MAX: 42.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181234510152025SE +/- 0.12, N = 3SE +/- 0.27, N = 3SE +/- 0.14, N = 3SE +/- 0.10, N = 321.1121.3121.3321.09MIN: 17.27 / MAX: 53.65MIN: 17.11 / MAX: 64MIN: 17.27 / MAX: 74.94MIN: 17.35 / MAX: 50.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16123420406080100SE +/- 0.24, N = 3SE +/- 0.06, N = 3SE +/- 0.31, N = 3SE +/- 0.33, N = 373.5973.3174.3674.16MIN: 68.98 / MAX: 102.41MIN: 69.22 / MAX: 103.56MIN: 69.28 / MAX: 98.86MIN: 68.87 / MAX: 113.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet1234612182430SE +/- 0.13, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.48, N = 326.1325.5526.0426.35MIN: 20.41 / MAX: 71.52MIN: 20.25 / MAX: 56.58MIN: 20.27 / MAX: 72.59MIN: 20.56 / MAX: 67.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface12340.73131.46262.19392.92523.6565SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 33.063.063.213.25MIN: 2.77 / MAX: 13.58MIN: 2.78 / MAX: 9.98MIN: 2.79 / MAX: 31.43MIN: 2.84 / MAX: 31.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b012343691215SE +/- 0.40, N = 3SE +/- 0.29, N = 3SE +/- 0.40, N = 3SE +/- 0.16, N = 312.6412.4613.3812.41MIN: 11.17 / MAX: 59.1MIN: 11.22 / MAX: 52.06MIN: 11.27 / MAX: 55.65MIN: 11.19 / MAX: 47.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet12343691215SE +/- 0.19, N = 3SE +/- 0.22, N = 2SE +/- 0.05, N = 3SE +/- 0.07, N = 28.808.699.058.36MIN: 7.59 / MAX: 67.87MIN: 7.71 / MAX: 44.91MIN: 7.58 / MAX: 97.5MIN: 7.74 / MAX: 26.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v212343691215SE +/- 0.36, N = 3SE +/- 0.07, N = 3SE +/- 0.19, N = 3SE +/- 0.02, N = 39.279.069.369.22MIN: 8.28 / MAX: 67.75MIN: 8.42 / MAX: 39.79MIN: 8.57 / MAX: 69.21MIN: 8.63 / MAX: 31.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31234246810SE +/- 0.41, N = 3SE +/- 0.29, N = 3SE +/- 0.28, N = 3SE +/- 0.09, N = 38.688.308.458.19MIN: 7.2 / MAX: 61.54MIN: 7.15 / MAX: 68.15MIN: 7.19 / MAX: 61.48MIN: 7.06 / MAX: 34.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v212343691215SE +/- 0.26, N = 3SE +/- 0.17, N = 3SE +/- 0.24, N = 3SE +/- 0.26, N = 39.699.479.749.49MIN: 8.2 / MAX: 69.64MIN: 8.22 / MAX: 59.67MIN: 8.42 / MAX: 52.88MIN: 8.37 / MAX: 32.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet1234816243240SE +/- 0.19, N = 3SE +/- 0.39, N = 3SE +/- 0.33, N = 3SE +/- 0.19, N = 332.9833.1832.4132.46MIN: 28.26 / MAX: 78.43MIN: 28.2 / MAX: 91.27MIN: 28.41 / MAX: 63.13MIN: 28.48 / MAX: 811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1234246810SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 38.658.628.598.581. Nodejs v12.18.2

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj1234246810SE +/- 0.0098, N = 3SE +/- 0.0066, N = 3SE +/- 0.0169, N = 3SE +/- 0.0021, N = 38.09298.10778.12258.1055MIN: 8.04 / MAX: 8.18MIN: 8.07 / MAX: 8.19MIN: 8.06 / MAX: 8.22MIN: 8.07 / MAX: 8.21

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed12342K4K6K8K10KSE +/- 35.17, N = 3SE +/- 48.72, N = 3SE +/- 3.95, N = 3SE +/- 8.05, N = 58632.68609.28540.58581.11. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed12341020304050SE +/- 0.18, N = 3SE +/- 0.59, N = 3SE +/- 0.68, N = 3SE +/- 0.48, N = 541.7642.7441.4639.951. (CC) gcc options: -O3

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU12343691215SE +/- 0.30, N = 15SE +/- 0.35, N = 15SE +/- 0.33, N = 15SE +/- 0.03, N = 313.3012.8112.8311.72MIN: 11.83MIN: 11.43MIN: 11.13MIN: 11.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj1234246810SE +/- 0.0257, N = 3SE +/- 0.0331, N = 3SE +/- 0.0181, N = 3SE +/- 0.0146, N = 38.72998.70418.69898.7484MIN: 8.66 / MAX: 8.87MIN: 8.61 / MAX: 8.85MIN: 8.63 / MAX: 8.81MIN: 8.69 / MAX: 8.87

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12349001800270036004500SE +/- 18.33, N = 3SE +/- 5.21, N = 3SE +/- 9.61, N = 3SE +/- 2.27, N = 34159.904128.184033.644036.80MIN: 4097.03MIN: 4094.43MIN: 3993.31MIN: 4003.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12349001800270036004500SE +/- 7.69, N = 3SE +/- 4.74, N = 3SE +/- 4.50, N = 3SE +/- 8.55, N = 34131.164123.224041.054050.96MIN: 4094.86MIN: 4081.81MIN: 4008.21MIN: 4004.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU12349001800270036004500SE +/- 6.45, N = 3SE +/- 11.44, N = 3SE +/- 10.88, N = 3SE +/- 12.86, N = 34140.234109.424019.424053.79MIN: 4108.52MIN: 4057.07MIN: 3984.28MIN: 4006.161. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed12342K4K6K8K10KSE +/- 4.85, N = 3SE +/- 26.35, N = 3SE +/- 42.93, N = 3SE +/- 3.80, N = 48678.08644.68604.38527.71. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed12341020304050SE +/- 0.64, N = 3SE +/- 0.12, N = 3SE +/- 0.18, N = 3SE +/- 0.60, N = 443.2142.6242.4242.911. (CC) gcc options: -O3

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile12341632486480SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 373.9773.7373.8073.95

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1234246810SE +/- 0.0132, N = 3SE +/- 0.0330, N = 3SE +/- 0.0343, N = 3SE +/- 0.0144, N = 38.24288.35728.31618.3815MIN: 8.17 / MAX: 8.38MIN: 8.25 / MAX: 8.51MIN: 8.2 / MAX: 8.47MIN: 8.31 / MAX: 8.52

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K1234246810SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 38.328.358.208.411. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00012341632486480SE +/- 0.03, N = 3SE +/- 0.31, N = 3SE +/- 0.35, N = 3SE +/- 0.14, N = 372.3671.6471.5371.691. (CC) gcc options: -O2 -ldl -lz -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 312341632486480SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 371.2071.1271.4971.591. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon12343691215SE +/- 0.0541, N = 3SE +/- 0.0651, N = 3SE +/- 0.0605, N = 3SE +/- 0.0112, N = 38.86618.99939.01168.8984MIN: 8.75 / MAX: 9.05MIN: 8.87 / MAX: 9.22MIN: 8.87 / MAX: 9.2MIN: 8.84 / MAX: 9.01

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 112340.06660.13320.19980.26640.333SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.2940.2960.2960.296

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 512340.20030.40060.60090.80121.0015SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 30.8860.8870.8900.887

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time12343M6M9M12M15MSE +/- 155452.58, N = 3SE +/- 45505.69, N = 3SE +/- 133376.42, N = 3SE +/- 251354.29, N = 3158067041587838616120996158651761. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon12343691215SE +/- 0.1023, N = 3SE +/- 0.0676, N = 3SE +/- 0.0053, N = 3SE +/- 0.0870, N = 39.37909.31049.42919.4739MIN: 9.18 / MAX: 9.67MIN: 9.15 / MAX: 9.54MIN: 9.36 / MAX: 9.56MIN: 9.3 / MAX: 9.76

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom12340.34740.69481.04221.38961.737SE +/- 0.006, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 31.5421.5441.5411.540

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar12340.7361.4722.2082.9443.68SE +/- 0.006, N = 3SE +/- 0.026, N = 3SE +/- 0.005, N = 3SE +/- 0.020, N = 33.2713.2513.2673.259

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU12341.23612.47223.70834.94446.1805SE +/- 0.06192, N = 15SE +/- 0.05422, N = 15SE +/- 0.06592, N = 15SE +/- 0.18172, N = 155.425815.493715.476165.46446MIN: 4.84MIN: 4.8MIN: 4.66MIN: 4.661. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast12343691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 310.2510.2410.2010.171. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast123448121620SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.32, N = 1218.1718.2118.2617.751. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis12340.33840.67681.01521.35361.692SE +/- 0.025, N = 12SE +/- 0.015, N = 3SE +/- 0.021, N = 15SE +/- 0.019, N = 151.4831.4681.4891.504MIN: 1.2 / MAX: 2.51MIN: 1.3 / MAX: 2.03MIN: 1.19 / MAX: 2.55MIN: 1.25 / MAX: 2.55

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID12340.11480.22960.34440.45920.574SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.510.510.510.511. (CXX) g++ options: -O3 -pthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S12341326395265SE +/- 0.31, N = 3SE +/- 0.28, N = 3SE +/- 0.21, N = 3SE +/- 0.39, N = 355.8956.2355.8056.251. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1234816243240SE +/- 0.21, N = 4SE +/- 0.07, N = 4SE +/- 0.36, N = 7SE +/- 0.10, N = 432.9432.9233.2333.111. (CC) gcc options: -O2 -std=c99

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 612340.26840.53680.80521.07361.342SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 31.1891.1921.1931.188

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya12340.09450.1890.28350.3780.4725SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.420.420.410.411. (CXX) g++ options: -O3 -pthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1234500K1000K1500K2000K2500KSE +/- 28538.84, N = 3SE +/- 22781.06, N = 15SE +/- 17321.00, N = 15SE +/- 23864.60, N = 152249907.252005461.671958857.871987133.401. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU12343691215SE +/- 0.21181, N = 15SE +/- 0.18072, N = 12SE +/- 0.01281, N = 3SE +/- 0.04220, N = 310.716359.143467.901818.13194MIN: 9.13MIN: 7.91MIN: 7.35MIN: 7.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU12343691215SE +/- 0.14975, N = 3SE +/- 0.11415, N = 13SE +/- 0.02059, N = 3SE +/- 0.02113, N = 312.3645010.388599.635779.79079MIN: 11.22MIN: 9.55MIN: 9.24MIN: 9.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 21234918273645SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 337.9037.9138.0538.141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1234816243240SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 332.2932.3232.8232.821. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123448121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 316.1216.1616.1816.041. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite1234120K240K360K480K600KSE +/- 2554.27, N = 3SE +/- 2501.46, N = 3SE +/- 2838.45, N = 3SE +/- 4775.98, N = 3546954551754546637544820

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123448121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 316.4916.5316.5516.411. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed12342K4K6K8K10KSE +/- 24.79, N = 3SE +/- 60.99, N = 3SE +/- 52.54, N = 3SE +/- 98.41, N = 39086.18892.28941.18886.71. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed12342K4K6K8K10KSE +/- 28.48, N = 3SE +/- 17.78, N = 3SE +/- 32.31, N = 3SE +/- 131.10, N = 38097.828113.288087.548150.131. (CC) gcc options: -O3

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 1012340.61831.23661.85492.47323.0915SE +/- 0.004, N = 3SE +/- 0.005, N = 3SE +/- 0.001, N = 3SE +/- 0.008, N = 32.7422.7402.7482.724

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time12341.5M3M4.5M6M7.5MSE +/- 5253.17, N = 3SE +/- 16676.43, N = 3SE +/- 36420.36, N = 3SE +/- 14408.91, N = 364278756677773681575169425051. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1234500K1000K1500K2000K2500KSE +/- 38987.55, N = 3SE +/- 130082.76, N = 12SE +/- 13558.64, N = 3SE +/- 128187.02, N = 122375150.832023932.301253261.461981072.631. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE123448121620SE +/- 0.04, N = 5SE +/- 0.05, N = 5SE +/- 0.05, N = 5SE +/- 0.03, N = 515.0515.0214.9615.021. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second123470K140K210K280K350KSE +/- 687.11, N = 3SE +/- 712.12, N = 3SE +/- 726.18, N = 3SE +/- 754.42, N = 3310974.49311216.67311939.82278575.831. (CC) gcc options: -O2 -lrt" -lrt

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack123448121620SE +/- 0.04, N = 5SE +/- 0.04, N = 5SE +/- 0.06, N = 5SE +/- 0.02, N = 514.0714.0914.0514.001. (CXX) g++ options: -rdynamic

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium12343691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.48, N = 15SE +/- 0.03, N = 39.829.889.059.941. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p1234816243240SE +/- 0.11, N = 3SE +/- 0.41, N = 3SE +/- 0.53, N = 3SE +/- 0.02, N = 336.0436.5936.9236.181. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast1234918273645SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 337.8037.9838.0137.631. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1234246810SE +/- 0.02396, N = 3SE +/- 0.03310, N = 3SE +/- 0.01457, N = 3SE +/- 0.01932, N = 36.174825.999185.944565.97066MIN: 5.91MIN: 5.73MIN: 5.67MIN: 5.671. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1234246810SE +/- 0.032, N = 5SE +/- 0.073, N = 5SE +/- 0.072, N = 5SE +/- 0.085, N = 58.2808.2748.2718.4301. (CXX) g++ options: -fvisibility=hidden -logg -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein12341.16572.33143.49714.66285.8285SE +/- 0.048, N = 15SE +/- 0.084, N = 3SE +/- 0.028, N = 3SE +/- 0.067, N = 155.0825.0355.1814.9831. (CXX) g++ options: -O3 -pthread -lm

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU12341.07222.14443.21664.28885.361SE +/- 0.00294, N = 3SE +/- 0.01890, N = 3SE +/- 0.00744, N = 3SE +/- 0.00471, N = 33.953264.018534.765184.72633MIN: 3.76MIN: 3.77MIN: 4.45MIN: 4.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1234300K600K900K1200K1500KSE +/- 4917.09, N = 3SE +/- 20084.29, N = 3SE +/- 1897.80, N = 3SE +/- 18142.13, N = 41219147.041235368.711263884.501273184.811. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET1234300K600K900K1200K1500KSE +/- 8681.25, N = 3SE +/- 24558.39, N = 3SE +/- 5357.49, N = 3SE +/- 23926.24, N = 31530872.961506982.371529187.751515400.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1234400K800K1200K1600K2000KSE +/- 5744.03, N = 3SE +/- 19144.33, N = 3SE +/- 18395.63, N = 3SE +/- 1945.31, N = 31809735.131742844.001799244.421795470.251. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1234246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.07, N = 86.686.656.866.821. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU12340.86151.7232.58453.4464.3075SE +/- 0.02757, N = 3SE +/- 0.03800, N = 3SE +/- 0.00288, N = 3SE +/- 0.00658, N = 33.828723.217592.493342.49930MIN: 3.54MIN: 2.82MIN: 2.24MIN: 2.221. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU12343691215SE +/- 0.02924, N = 3SE +/- 0.05948, N = 3SE +/- 0.01279, N = 3SE +/- 0.00941, N = 311.1318010.9921010.137909.80423MIN: 10.5MIN: 10.44MIN: 9.63MIN: 9.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 012343691215SE +/- 0.057, N = 3SE +/- 0.010, N = 3SE +/- 0.013, N = 3SE +/- 0.020, N = 38.9388.9758.9998.9791. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast12341530456075SE +/- 0.05, N = 3SE +/- 0.14, N = 3SE +/- 0.23, N = 3SE +/- 0.09, N = 368.4168.7868.9968.491. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1234510152025SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 320.2820.7821.7221.57MIN: 19.49MIN: 19.98MIN: 20.68MIN: 20.541. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1234510152025SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 319.8720.1121.1520.97MIN: 19.26MIN: 19.41MIN: 20.01MIN: 19.861. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU12343691215SE +/- 0.12, N = 3SE +/- 0.21, N = 14SE +/- 0.00, N = 3SE +/- 0.02, N = 311.5211.6511.5411.46MIN: 11.08MIN: 11.1MIN: 11.26MIN: 11.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU123448121620SE +/- 0.22, N = 4SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 316.7415.6014.7614.74MIN: 16MIN: 15.32MIN: 14.61MIN: 14.631. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth12343K6K9K12K15KSE +/- 77.45, N = 3SE +/- 76.15, N = 3SE +/- 142.05, N = 3SE +/- 414.56, N = 311544.7711544.7911926.9111599.971. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth12340.32990.65980.98971.31961.6495SE +/- 0.00651, N = 3SE +/- 0.00883, N = 3SE +/- 0.01126, N = 3SE +/- 0.00683, N = 31.457701.462821.466261.460811. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency12340.12770.25540.38310.51080.6385SE +/- 0.00487, N = 3SE +/- 0.00616, N = 3SE +/- 0.00273, N = 3SE +/- 0.00200, N = 30.561160.567750.558800.567031. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access12340.00590.01180.01770.02360.0295SE +/- 0.00003, N = 3SE +/- 0.00005, N = 3SE +/- 0.00003, N = 3SE +/- 0.00002, N = 30.026200.026150.026230.026211. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad12340.7811.5622.3433.1243.905SE +/- 0.01066, N = 3SE +/- 0.00723, N = 3SE +/- 0.00101, N = 3SE +/- 0.00653, N = 33.432633.471123.468863.468791. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans12340.64031.28061.92092.56123.2015SE +/- 0.16798, N = 3SE +/- 0.01878, N = 3SE +/- 0.03257, N = 3SE +/- 0.05255, N = 32.488242.837422.845962.778081. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM1234510152025SE +/- 0.37, N = 3SE +/- 0.20, N = 3SE +/- 0.17, N = 3SE +/- 0.10, N = 319.7620.1020.3120.321. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte12341.00532.01063.01594.02125.0265SE +/- 0.11339, N = 3SE +/- 0.07093, N = 3SE +/- 0.04593, N = 3SE +/- 0.05917, N = 34.449744.467494.468064.418861. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3


Phoronix Test Suite v10.8.4