Core i7 7900X 2021

Intel Core i7-7900X testing with a ASRock X299 Extreme4 (P1.50 BIOS) and Zotac NVIDIA GeForce GT 610 1GB on Ubuntu 19.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102017-HA-COREI779055&rdt&grr.

Core i7 7900X 2021ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDisplay ServerDisplay DriverCompilerFile-SystemScreen Resolution132Intel Core i7-7900X @ 4.50GHz (10 Cores / 20 Threads)ASRock X299 Extreme4 (P1.50 BIOS)Intel Sky Lake-E DMI3 Registers16GB120GB Corsair Force MP500Zotac NVIDIA GeForce GT 610 1GBRealtek ALC1220LG Ultra HDIntel I219-VUbuntu 19.045.0.0-38-generic (x86_64)X Server 1.20.4zotacGCC 11.0.0 20200929ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --disable-multilib --enable-checking=release --enable-languages=c,c++,fortran Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2000064Python Details- Python 2.7.16 + Python 3.7.3Security Details- itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

Core i7 7900X 2021openfoam: Motorbike 60Mgcrypt: build-godot: Time To Compilebuild2: Time To Compileopenfoam: Motorbike 30Mfinancebench: Bonds OpenMPonnx: fcn-resnet101-11 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: yolov4 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0build-eigen: Time To Compileonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUgnupg: 2.7GB Sample File Encryptionunpack-firefox: firefox-84.0.source.tar.xzkripke: ncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetcompress-zstd: 19onednn: IP Shapes 1D - f32 - CPUrav1e: 5rav1e: 1financebench: Repo OpenMPrav1e: 6compress-7zip: Compress Speed Testcryptsetup: Twofish-XTS 512b Decryptioncryptsetup: Twofish-XTS 512b Encryptioncryptsetup: Serpent-XTS 512b Decryptioncryptsetup: Serpent-XTS 512b Encryptioncryptsetup: AES-XTS 512b Decryptioncryptsetup: AES-XTS 512b Encryptioncryptsetup: Twofish-XTS 256b Decryptioncryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Serpent-XTS 256b Decryptioncryptsetup: Serpent-XTS 256b Encryptioncryptsetup: AES-XTS 256b Decryptioncryptsetup: AES-XTS 256b Encryptioncryptsetup: PBKDF2-whirlpoolcryptsetup: PBKDF2-sha512lzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressioncoremark: CoreMark Size 666 - Iterations Per Secondlzbench: Crush 0 - Decompressionlzbench: Crush 0 - Compressionsynthmark: VoiceMark_100qmcpack: simple-H2Orav1e: 10quantlib: compress-zstd: 3lzbench: Brotli 2 - Decompressionlzbench: Brotli 2 - Compressionlzbench: Brotli 0 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Zstd 8 - Decompressionlzbench: Zstd 8 - Compressiontnn: CPU - MobileNet v2onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUtnn: CPU - SqueezeNet v1.1amg: onednn: IP Shapes 1D - u8s8f32 - CPUredis: LPUSHonednn: IP Shapes 1D - bf16bf16bf16 - CPUredis: SETredis: LPOPredis: SADDredis: GETlzbench: Libdeflate 1 - Compressiononednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUlulesh: onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUlammps: Rhodopsin Protein132701.37196.670142.104139.470139.5552224.90234410372848311550708141.0674.9854.55840.1707.35187.9292104.772104.792104.211170.551172.551168.7866.81421.4295819264319.3919.5225.8623.039.0011.7341.0213.812.456.734.936.264.535.4017.4454.23.225271.0930.39437075.5026041.43760523455.9454.1836.3818.12274.42312.5455.9452.0838.8813.92415.02488.3769880180273112446393486.160982532110617.57529.2243.1122765.14359.77661996614831709534172390314.94515.74431.467553.27351296.1224296668751.232171889307.218.075502149992.082987638.082420991.582861475.422682.063080.8641362.868173.633921.675606.200636332.795312.57782.203819.9966510.607416.40303.509707.425701.62196.151142.016138.146139.1752234.41406210372548611540713341.0214.9764.45540.2547.34787.8702187.262146.862164.691239.501214.471218.9566.62619.8135748426019.4919.2625.8722.828.9911.8140.4113.522.366.694.826.094.365.2517.4154.43.617731.0930.39436914.4192711.43960267455.7452.8837.5816.72266.12307.5455.9453.7837.4817.02410.02492.4769505180582912445393426.968130532110616.74429.0263.1102744.64364.87661996604841709535172189315.06915.72081.480043.36635296.0934356194001.246721927407.218.095212174653.002013855.592433452.332765324.332682.039810.8444652.839833.760171.816776.204326401.292612.56962.403599.8873010.519616.87073.699797.420701.05196.534144.277137.884139.5053124.96093810372048611588711041.0214.9684.52440.1957.34088.0532088.202110.872096.251157.221154.541154.7266.45820.0785758257319.4119.4925.8623.089.0011.7841.2913.252.336.634.866.104.325.2417.7054.33.176831.0920.39436848.8411461.43860395454.8453.8836.8817.92264.72306.8456.1454.0837.7817.82407.62494.4769505180170812446392945.702377532109618.25029.0723.1192737.54365.47661996624841707532172089315.02115.74461.465423.23948296.0164356462671.195201905897.888.070762181880.081984735.792410939.672750396.002682.069010.8944792.894873.606671.647855.809726440.079212.59022.163139.9891510.606716.34853.505737.323OpenBenchmarking.org

OpenFOAM

Input: Motorbike 60M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60M132150300450600750SE +/- 0.21, N = 3SE +/- 0.49, N = 3SE +/- 0.10, N = 3701.37701.62701.051. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.91324080120160200SE +/- 0.56, N = 3SE +/- 0.19, N = 3SE +/- 0.22, N = 3196.67196.15196.531. (CC) gcc options: -O2 -fvisibility=hidden

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile132306090120150SE +/- 0.08, N = 3SE +/- 0.16, N = 3SE +/- 1.71, N = 5142.10142.02144.28

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile132306090120150SE +/- 1.89, N = 4SE +/- 0.49, N = 3SE +/- 1.71, N = 3139.47138.15137.88

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M132306090120150SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3139.55139.17139.501. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP13211K22K33K44K55KSE +/- 109.70, N = 3SE +/- 73.43, N = 3SE +/- 680.64, N = 1552224.9052234.4153124.961. (CXX) g++ options: -O3 -march=native -fopenmp

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU13220406080100SE +/- 0.29, N = 3SE +/- 0.17, N = 3SE +/- 0.44, N = 31031031031. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU132160320480640800SE +/- 0.73, N = 3SE +/- 2.35, N = 3SE +/- 0.93, N = 37287257201. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU132110220330440550SE +/- 1.45, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 34834864861. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU1322K4K6K8K10KSE +/- 10.09, N = 3SE +/- 52.90, N = 3SE +/- 16.55, N = 31155011540115881. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU13215003000450060007500SE +/- 9.39, N = 3SE +/- 22.59, N = 3SE +/- 17.72, N = 37081713371101. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v3132918273645SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 341.0741.0241.02MIN: 40.75 / MAX: 44.98MIN: 40.55 / MAX: 45.48MIN: 40.66 / MAX: 45.661. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.01321.12162.24323.36484.48645.608SE +/- 0.021, N = 3SE +/- 0.018, N = 3SE +/- 0.018, N = 34.9854.9764.968MIN: 4.88 / MAX: 6.46MIN: 4.86 / MAX: 6.43MIN: 4.88 / MAX: 6.391. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241321.02562.05123.07684.10245.128SE +/- 0.024, N = 3SE +/- 0.056, N = 3SE +/- 0.039, N = 34.5584.4554.524MIN: 4.3 / MAX: 5.13MIN: 4.11 / MAX: 5.29MIN: 4.24 / MAX: 5.441. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-50132918273645SE +/- 0.11, N = 3SE +/- 0.16, N = 3SE +/- 0.03, N = 340.1740.2540.20MIN: 39.79 / MAX: 45.68MIN: 39.87 / MAX: 41.87MIN: 39.88 / MAX: 44.041. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.0132246810SE +/- 0.029, N = 3SE +/- 0.025, N = 3SE +/- 0.023, N = 37.3517.3477.340MIN: 7.03 / MAX: 8.65MIN: 7.04 / MAX: 10.82MIN: 7.01 / MAX: 8.961. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile13220406080100SE +/- 0.25, N = 3SE +/- 0.18, N = 3SE +/- 0.19, N = 387.9387.8788.05

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1325001000150020002500SE +/- 2.16, N = 3SE +/- 9.79, N = 3SE +/- 5.39, N = 32104.772187.262088.20MIN: 2095.67MIN: 2167.59MIN: 2073.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1325001000150020002500SE +/- 2.17, N = 3SE +/- 11.05, N = 3SE +/- 10.38, N = 32104.792146.862110.87MIN: 2087.83MIN: 2120.06MIN: 2079.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1325001000150020002500SE +/- 3.56, N = 3SE +/- 18.06, N = 3SE +/- 3.05, N = 32104.212164.692096.25MIN: 2088.02MIN: 2131.8MIN: 2078.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU13230060090012001500SE +/- 1.50, N = 3SE +/- 12.35, N = 3SE +/- 3.58, N = 31170.551239.501157.22MIN: 1163.52MIN: 1216.45MIN: 1147.671. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU13230060090012001500SE +/- 1.66, N = 3SE +/- 7.34, N = 3SE +/- 0.72, N = 31172.551214.471154.54MIN: 1166.22MIN: 1197.62MIN: 1148.491. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU13230060090012001500SE +/- 1.92, N = 3SE +/- 13.30, N = 3SE +/- 0.17, N = 31168.781218.951154.72MIN: 1161.69MIN: 1190.49MIN: 1150.381. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

GnuPG

2.7GB Sample File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption1321530456075SE +/- 0.41, N = 3SE +/- 0.22, N = 3SE +/- 0.12, N = 366.8166.6366.461. (CC) gcc options: -O2

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz132510152025SE +/- 0.96, N = 20SE +/- 0.07, N = 4SE +/- 0.12, N = 421.4319.8120.08

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.413212M24M36M48M60MSE +/- 266395.21, N = 3SE +/- 233375.54, N = 3SE +/- 36554.18, N = 35819264357484260575825731. (CXX) g++ options: -O3 -fopenmp

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m132510152025SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 319.3919.4919.41MIN: 19.13 / MAX: 20.04MIN: 19.22 / MAX: 35.84MIN: 19.11 / MAX: 21.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd132510152025SE +/- 0.19, N = 3SE +/- 0.14, N = 3SE +/- 0.13, N = 319.5219.2619.49MIN: 19.22 / MAX: 20.82MIN: 18.97 / MAX: 19.61MIN: 19.18 / MAX: 20.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny132612182430SE +/- 0.25, N = 3SE +/- 0.21, N = 3SE +/- 0.22, N = 325.8625.8725.86MIN: 25.54 / MAX: 26.92MIN: 25.52 / MAX: 27.64MIN: 25.5 / MAX: 27.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50132612182430SE +/- 0.06, N = 3SE +/- 0.23, N = 3SE +/- 0.09, N = 323.0322.8223.08MIN: 22.81 / MAX: 23.45MIN: 22.23 / MAX: 24.44MIN: 22.8 / MAX: 27.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1323691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 39.008.999.00MIN: 8.94 / MAX: 11.7MIN: 8.92 / MAX: 9.61MIN: 8.93 / MAX: 9.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181323691215SE +/- 0.16, N = 3SE +/- 0.22, N = 3SE +/- 0.19, N = 311.7311.8111.78MIN: 11.32 / MAX: 13.34MIN: 11.28 / MAX: 15.06MIN: 11.35 / MAX: 12.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16132918273645SE +/- 0.21, N = 3SE +/- 0.52, N = 3SE +/- 0.31, N = 341.0240.4141.29MIN: 40.71 / MAX: 42.29MIN: 39.37 / MAX: 42.53MIN: 40.61 / MAX: 48.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet13248121620SE +/- 0.07, N = 3SE +/- 0.24, N = 3SE +/- 0.02, N = 313.8113.5213.25MIN: 13.67 / MAX: 14.03MIN: 13.16 / MAX: 14.54MIN: 13.16 / MAX: 13.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1320.55131.10261.65392.20522.7565SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 32.452.362.33MIN: 2.28 / MAX: 2.68MIN: 2.25 / MAX: 8.7MIN: 2.25 / MAX: 2.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0132246810SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 36.736.696.63MIN: 6.46 / MAX: 8.32MIN: 6.54 / MAX: 6.85MIN: 6.5 / MAX: 7.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1321.10932.21863.32794.43725.5465SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 34.934.824.86MIN: 4.7 / MAX: 5.15MIN: 4.65 / MAX: 5.05MIN: 4.61 / MAX: 5.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2132246810SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 36.266.096.10MIN: 6.2 / MAX: 6.36MIN: 5.97 / MAX: 6.3MIN: 5.95 / MAX: 6.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31321.01932.03863.05794.07725.0965SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 34.534.364.32MIN: 4.36 / MAX: 4.86MIN: 4.25 / MAX: 5.81MIN: 4.24 / MAX: 4.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21321.2152.433.6454.866.075SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 35.405.255.24MIN: 5.12 / MAX: 6.21MIN: 5.12 / MAX: 8.36MIN: 5.05 / MAX: 6.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet13248121620SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 317.4417.4117.70MIN: 17.18 / MAX: 19.14MIN: 17.21 / MAX: 17.72MIN: 17.51 / MAX: 18.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Zstd Compression

Compression Level: 19

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 191321224364860SE +/- 0.32, N = 3SE +/- 0.06, N = 3SE +/- 0.20, N = 354.254.454.31. (CC) gcc options: -O3 -pthread -lz -llzma

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1320.8141.6282.4423.2564.07SE +/- 0.03023, N = 15SE +/- 0.10171, N = 15SE +/- 0.00055, N = 33.225273.617733.17683MIN: 3.08MIN: 3.08MIN: 3.031. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51320.24590.49180.73770.98361.2295SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 31.0931.0931.092

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11320.08870.17740.26610.35480.4435SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.3940.3940.394

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP1328K16K24K32K40KSE +/- 163.40, N = 3SE +/- 56.51, N = 3SE +/- 15.63, N = 337075.5036914.4236848.841. (CXX) g++ options: -O3 -march=native -fopenmp

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61320.32380.64760.97141.29521.619SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.002, N = 31.4371.4391.438

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test13213K26K39K52K65KSE +/- 160.37, N = 3SE +/- 543.97, N = 3SE +/- 519.08, N = 36052360267603951. (CXX) g++ options: -pipe -lpthread

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption132100200300400500SE +/- 0.20, N = 3SE +/- 0.12, N = 3SE +/- 0.85, N = 2455.9455.7454.8

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption132100200300400500SE +/- 0.39, N = 3SE +/- 0.79, N = 3SE +/- 0.21, N = 3454.1452.8453.8

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption1322004006008001000SE +/- 1.94, N = 3SE +/- 0.32, N = 3SE +/- 0.93, N = 3836.3837.5836.8

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption1322004006008001000SE +/- 0.09, N = 3SE +/- 0.57, N = 3SE +/- 0.91, N = 3818.1816.7817.9

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption1325001000150020002500SE +/- 4.20, N = 3SE +/- 1.94, N = 3SE +/- 5.64, N = 32274.42266.12264.7

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption1325001000150020002500SE +/- 7.46, N = 3SE +/- 6.30, N = 3SE +/- 5.67, N = 32312.52307.52306.8

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption132100200300400500SE +/- 0.42, N = 3SE +/- 0.22, N = 3SE +/- 0.26, N = 3455.9455.9456.1

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption132100200300400500SE +/- 2.45, N = 3SE +/- 0.82, N = 3SE +/- 0.67, N = 3452.0453.7454.0

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption1322004006008001000SE +/- 0.65, N = 3SE +/- 0.09, N = 3SE +/- 0.95, N = 3838.8837.4837.7

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption1322004006008001000SE +/- 5.12, N = 3SE +/- 0.99, N = 3SE +/- 0.15, N = 3813.9817.0817.8

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption1325001000150020002500SE +/- 5.28, N = 3SE +/- 3.03, N = 3SE +/- 3.87, N = 32415.02410.02407.6

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption1325001000150020002500SE +/- 13.15, N = 3SE +/- 1.44, N = 3SE +/- 6.84, N = 32488.32492.42494.4

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool132160K320K480K640K800KSE +/- 652.69, N = 3SE +/- 995.18, N = 3SE +/- 995.18, N = 3769880769505769505

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha512132400K800K1200K1600K2000KSE +/- 4507.19, N = 3SE +/- 3732.03, N = 3SE +/- 5362.16, N = 3180273118058291801708

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression1323060901201501241241241. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression1321020304050SE +/- 0.33, N = 34645461. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second13280K160K240K320K400KSE +/- 324.29, N = 3SE +/- 551.11, N = 3SE +/- 493.57, N = 3393486.16393426.97392945.701. (CC) gcc options: -O2 -lrt" -lrt

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression132120240360480600SE +/- 0.33, N = 35325325321. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression132204060801001101101091. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100132130260390520650SE +/- 2.52, N = 3SE +/- 2.92, N = 3SE +/- 3.42, N = 3617.58616.74618.251. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O132714212835SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 329.2229.0329.071. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -lm -pthread

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101320.70181.40362.10542.80723.509SE +/- 0.005, N = 3SE +/- 0.008, N = 3SE +/- 0.013, N = 33.1123.1103.119

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.211326001200180024003000SE +/- 3.07, N = 3SE +/- 29.12, N = 3SE +/- 32.26, N = 32765.12744.62737.51. (CXX) g++ options: -O3 -march=native -rdynamic -lboost_timer -lboost_system -lboost_chrono

Zstd Compression

Compression Level: 3

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 31329001800270036004500SE +/- 1.62, N = 3SE +/- 10.23, N = 3SE +/- 2.12, N = 34359.74364.84365.41. (CC) gcc options: -O3 -pthread -lz -llzma

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression132170340510680850SE +/- 0.67, N = 3SE +/- 0.33, N = 37667667661. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression13240801201602001991991991. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression132140280420560700SE +/- 2.00, N = 3SE +/- 0.33, N = 36616606621. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression132100200300400500SE +/- 1.67, N = 3SE +/- 0.58, N = 3SE +/- 1.00, N = 34834844841. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression132400800120016002000SE +/- 0.33, N = 31709170917071. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression132120240360480600SE +/- 3.71, N = 3SE +/- 2.33, N = 3SE +/- 3.33, N = 35345355321. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression132400800120016002000SE +/- 1.20, N = 3SE +/- 1.15, N = 31723172117201. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression13220406080100SE +/- 0.33, N = 39089891. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v213270140210280350SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3314.95315.07315.02MIN: 314.32 / MAX: 315.9MIN: 314.61 / MAX: 315.97MIN: 314.45 / MAX: 316.381. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU13248121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 315.7415.7215.74MIN: 15.5MIN: 15.52MIN: 15.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1320.3330.6660.9991.3321.665SE +/- 0.00099, N = 3SE +/- 0.00459, N = 3SE +/- 0.00133, N = 31.467551.480041.46542MIN: 1.45MIN: 1.46MIN: 1.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1320.75741.51482.27223.02963.787SE +/- 0.01902, N = 3SE +/- 0.00338, N = 3SE +/- 0.01514, N = 33.273513.366353.23948MIN: 3.19MIN: 3.32MIN: 3.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.113260120180240300SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3296.12296.09296.02MIN: 295.65 / MAX: 297.87MIN: 295.68 / MAX: 296.99MIN: 295.7 / MAX: 297.11. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.213290M180M270M360M450MSE +/- 4426948.99, N = 8SE +/- 61971.85, N = 3SE +/- 112596.07, N = 34296668754356194004356462671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1320.28050.5610.84151.1221.4025SE +/- 0.01498, N = 6SE +/- 0.00521, N = 3SE +/- 0.00136, N = 31.232171.246721.19520MIN: 1.17MIN: 1.2MIN: 1.161. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH132400K800K1200K1600K2000KSE +/- 14490.39, N = 3SE +/- 9875.33, N = 3SE +/- 6380.74, N = 31889307.211927407.211905897.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU132246810SE +/- 0.00411, N = 3SE +/- 0.00816, N = 3SE +/- 0.00140, N = 38.075508.095218.07076MIN: 8.01MIN: 8.03MIN: 8.011. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET132500K1000K1500K2000K2500KSE +/- 14882.04, N = 3SE +/- 20349.76, N = 3SE +/- 6079.59, N = 32149992.082174653.002181880.081. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP132600K1200K1800K2400K3000KSE +/- 32160.04, N = 3SE +/- 3188.51, N = 3SE +/- 8420.44, N = 32987638.082013855.591984735.791. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD132500K1000K1500K2000K2500KSE +/- 7288.90, N = 3SE +/- 12483.84, N = 3SE +/- 18189.17, N = 32420991.582433452.332410939.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET132600K1200K1800K2400K3000KSE +/- 25840.54, N = 3SE +/- 5857.53, N = 3SE +/- 9485.22, N = 32861475.422765324.332750396.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression132601201802403002682682681. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1320.46550.9311.39651.8622.3275SE +/- 0.00176, N = 3SE +/- 0.00179, N = 3SE +/- 0.00205, N = 32.063082.039812.06901MIN: 2.03MIN: 2.01MIN: 2.041. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1320.20130.40260.60390.80521.0065SE +/- 0.001667, N = 3SE +/- 0.004799, N = 3SE +/- 0.001069, N = 30.8641360.8444650.894479MIN: 0.83MIN: 0.81MIN: 0.861. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU1320.65131.30261.95392.60523.2565SE +/- 0.00245, N = 3SE +/- 0.01096, N = 3SE +/- 0.00187, N = 32.868172.839832.89487MIN: 2.7MIN: 2.67MIN: 2.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU1320.8461.6922.5383.3844.23SE +/- 0.00378, N = 3SE +/- 0.03347, N = 3SE +/- 0.00372, N = 33.633923.760173.60667MIN: 3.58MIN: 3.64MIN: 3.551. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1320.40880.81761.22641.63522.044SE +/- 0.00470, N = 3SE +/- 0.01894, N = 3SE +/- 0.00048, N = 31.675601.816771.64785MIN: 1.6MIN: 1.73MIN: 1.591. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU132246810SE +/- 0.02631, N = 3SE +/- 0.06455, N = 3SE +/- 0.00225, N = 36.200636.204325.80972MIN: 6.13MIN: 6.07MIN: 5.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.313214002800420056007000SE +/- 71.32, N = 3SE +/- 44.49, N = 3SE +/- 14.52, N = 36332.806401.296440.081. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU1323691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.5812.5712.59MIN: 12.38MIN: 12.4MIN: 12.371. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1320.54081.08161.62242.16322.704SE +/- 0.00646, N = 3SE +/- 0.02317, N = 15SE +/- 0.00487, N = 32.203812.403592.16313MIN: 2.17MIN: 2.25MIN: 2.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1323691215SE +/- 0.00034, N = 3SE +/- 0.00661, N = 3SE +/- 0.00261, N = 39.996659.887309.98915MIN: 9.95MIN: 9.84MIN: 9.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1323691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 310.6110.5210.61MIN: 10.56MIN: 10.47MIN: 10.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU13248121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 316.4016.8716.35MIN: 16.32MIN: 16.71MIN: 16.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1320.83251.6652.49753.334.1625SE +/- 0.00098, N = 3SE +/- 0.04752, N = 3SE +/- 0.00461, N = 33.509703.699793.50573MIN: 3.48MIN: 3.59MIN: 3.471. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein132246810SE +/- 0.032, N = 3SE +/- 0.013, N = 3SE +/- 0.113, N = 37.4257.4207.3231. (CXX) g++ options: -O3 -pthread -lm


Phoronix Test Suite v10.8.4