10600K pre RKL

Intel Core i5-10600K testing with a ASUS PRIME Z490M-PLUS (1001 BIOS) and ASUS Intel UHD 630 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2103183-IB-10600KPRE35&sor.

10600K pre RKLProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen Resolution123Intel Core i5-10600K @ 4.80GHz (6 Cores / 12 Threads)ASUS PRIME Z490M-PLUS (1001 BIOS)Intel Comet Lake PCH32GBSamsung SSD 970 EVO 500GBASUS Intel UHD 630 3GB (1200MHz)Realtek ALC887-VDLG Ultra HDIntelUbuntu 20.045.9.0-050900daily20201012-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.94.6 Mesa 20.0.81.2.131GCC 9.3.0ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe0Python Details- Python 3.8.5Security Details- itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

10600K pre RKLincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDaom-av1: Speed 0 Two-Passaom-av1: Speed 4 Two-Passaom-av1: Speed 6 Realtimeaom-av1: Speed 6 Two-Passaom-av1: Speed 8 Realtimesvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080pbuild-mesa: Time To Compilebuild-nodejs: Time To Compileonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3mnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3sysbench: RAM / Memorysysbench: CPU12337.2945633131.9759323.061.14.304.920.235.9620.8317.0596.505.4386.08185.05149.76152.32119.9569.654596.4914.321207.662382.041542.3920015.111710.047148.5108913.72792.668765.366003990.012379.623992.292381.103.384134002.182382.713.460106.012818.3302144.634025.5197.60041.22579.6674.51428.5452.5463.26033.77026132.7614374.2638.2099012131.8184813.051.14.294.920.235.9720.6717.0996.225.4486.61185.25149.49151.96120.0369.689596.9704.314667.735292.036992.3852115.120710.08078.5192713.81422.659835.316713998.992403.363996.072394.463.378493995.942397.833.474946.024018.3402144.591825.5147.60141.21079.6794.50828.5762.5493.24033.98025903.8914373.4737.7690824132.9972133.051.14.294.920.235.9720.7517.0696.595.4486.69184.69149.77152.34120.0069.808597.2164.280257.476842.013972.3927915.07839.981488.5228213.84212.659065.363983986.562374.753987.232372.733.395583987.152372.193.464596.015218.2889143.531925.3657.60341.22179.6624.45228.4712.5593.23133.92826135.3114375.35OpenBenchmarking.org

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction132918273645SE +/- 0.36, N = 3SE +/- 0.35, N = 3SE +/- 0.07, N = 337.2937.7738.211. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction213306090120150SE +/- 1.01, N = 3SE +/- 1.07, N = 3SE +/- 0.15, N = 3131.82131.98133.001. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya1320.68851.3772.06552.7543.4425SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.063.053.051. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom3210.24750.4950.74250.991.2375SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.11.11.11. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets1320.96751.9352.90253.874.8375SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 34.304.294.291. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID3211.1072.2143.3214.4285.535SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.924.924.921. (CXX) g++ options: -O3 -pthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 0 Two-Pass3210.05180.10360.15540.20720.259SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.230.230.231. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 4 Two-Pass3211.34332.68664.02995.37326.7165SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 35.975.975.961. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Realtime132510152025SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.15, N = 320.8320.7520.671. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Two-Pass23148121620SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 317.0917.0617.051. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 8 Realtime31220406080100SE +/- 0.08, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 396.5996.5096.221. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p3211.2242.4483.6724.8966.12SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.445.445.431. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p32120406080100SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 386.6986.6186.081. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p2134080120160200SE +/- 0.38, N = 3SE +/- 0.05, N = 3SE +/- 0.38, N = 3185.25185.05184.691. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p312306090120150SE +/- 0.12, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 3149.77149.76149.491. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p312306090120150SE +/- 0.29, N = 3SE +/- 0.28, N = 3SE +/- 0.11, N = 3152.34152.32151.961. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p231306090120150SE +/- 0.14, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 3120.03120.00119.951. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile1231632486480SE +/- 0.18, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 369.6569.6969.81

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile123130260390520650SE +/- 0.12, N = 3SE +/- 0.25, N = 3SE +/- 0.12, N = 3596.49596.97597.22

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU3210.97231.94462.91693.88924.8615SE +/- 0.01808, N = 3SE +/- 0.01586, N = 3SE +/- 0.00877, N = 34.280254.314664.32120MIN: 4.19MIN: 4.18MIN: 4.191. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU312246810SE +/- 0.01643, N = 3SE +/- 0.01852, N = 3SE +/- 0.01377, N = 37.476847.662387.73529MIN: 7.35MIN: 7.54MIN: 7.591. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3210.45930.91861.37791.83722.2965SE +/- 0.00640, N = 3SE +/- 0.01080, N = 3SE +/- 0.00826, N = 32.013972.036992.04154MIN: 1.98MIN: 1.99MIN: 1.981. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU2130.53841.07681.61522.15362.692SE +/- 0.00590, N = 3SE +/- 0.00743, N = 3SE +/- 0.01182, N = 32.385212.392002.39279MIN: 2.31MIN: 2.34MIN: 2.331. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU31248121620SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 315.0815.1115.12MIN: 14.97MIN: 15.01MIN: 15.011. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU3123691215SE +/- 0.01007, N = 3SE +/- 0.04868, N = 3SE +/- 0.02946, N = 39.9814810.0471410.08070MIN: 6.26MIN: 6.15MIN: 6.131. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU123246810SE +/- 0.00717, N = 3SE +/- 0.00726, N = 3SE +/- 0.00862, N = 38.510898.519278.52282MIN: 8.48MIN: 8.48MIN: 8.481. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU12348121620SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 313.7313.8113.84MIN: 13.46MIN: 13.46MIN: 13.461. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU3210.60051.2011.80152.4023.0025SE +/- 0.00456, N = 3SE +/- 0.00545, N = 3SE +/- 0.01467, N = 32.659062.659832.66876MIN: 2.63MIN: 2.63MIN: 2.631. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU2311.20742.41483.62224.82966.037SE +/- 0.01565, N = 3SE +/- 0.00867, N = 3SE +/- 0.01215, N = 35.316715.363985.36600MIN: 5.27MIN: 5.32MIN: 5.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU3129001800270036004500SE +/- 1.26, N = 3SE +/- 0.55, N = 3SE +/- 2.93, N = 33986.563990.013998.99MIN: 3980.86MIN: 3986.86MIN: 3992.81. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU3125001000150020002500SE +/- 1.89, N = 3SE +/- 2.92, N = 3SE +/- 4.31, N = 32374.752379.622403.36MIN: 2367.87MIN: 2374.18MIN: 2396.191. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU3129001800270036004500SE +/- 1.71, N = 3SE +/- 0.57, N = 3SE +/- 1.10, N = 33987.233992.293996.07MIN: 3981.34MIN: 3987.88MIN: 3991.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU3125001000150020002500SE +/- 1.29, N = 3SE +/- 2.74, N = 3SE +/- 0.97, N = 32372.732381.102394.46MIN: 2366.08MIN: 2375.91MIN: 2391.281. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU2130.7641.5282.2923.0563.82SE +/- 0.00806, N = 3SE +/- 0.00246, N = 3SE +/- 0.00477, N = 33.378493.384133.39558MIN: 3.31MIN: 3.33MIN: 3.331. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU3219001800270036004500SE +/- 1.18, N = 3SE +/- 1.20, N = 3SE +/- 9.00, N = 33987.153995.944002.18MIN: 3982.67MIN: 3991.84MIN: 3990.271. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU3125001000150020002500SE +/- 1.46, N = 3SE +/- 1.67, N = 3SE +/- 1.16, N = 32372.192382.712397.83MIN: 2367.54MIN: 2377.54MIN: 2393.331. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1320.78191.56382.34573.12763.9095SE +/- 0.01319, N = 3SE +/- 0.00212, N = 3SE +/- 0.00087, N = 33.460103.464593.47494MIN: 3.26MIN: 3.27MIN: 3.281. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium132246810SE +/- 0.0064, N = 3SE +/- 0.0018, N = 3SE +/- 0.0013, N = 36.01286.01526.02401. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough312510152025SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 318.2918.3318.341. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive321306090120150SE +/- 0.30, N = 3SE +/- 0.26, N = 3SE +/- 0.22, N = 3143.53144.59144.631. (CXX) g++ options: -O3 -flto -pthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S321612182430SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 325.3725.5125.521. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0123246810SE +/- 0.004, N = 3SE +/- 0.004, N = 3SE +/- 0.004, N = 37.6007.6017.6031. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2231918273645SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 341.2141.2241.231. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 331220406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 379.6679.6779.681. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.03211.01572.03143.04714.06285.0785SE +/- 0.004, N = 3SE +/- 0.007, N = 3SE +/- 0.010, N = 34.4524.5084.514MIN: 4.37 / MAX: 7.17MIN: 4.45 / MAX: 5.33MIN: 4.43 / MAX: 7.131. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50312714212835SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 328.4728.5528.58MIN: 28.25 / MAX: 40.28MIN: 28.18 / MAX: 40.29MIN: 28.26 / MAX: 40.341. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_2241230.57581.15161.72742.30322.879SE +/- 0.012, N = 3SE +/- 0.011, N = 3SE +/- 0.018, N = 32.5462.5492.559MIN: 2.51 / MAX: 3.35MIN: 2.51 / MAX: 5.08MIN: 2.5 / MAX: 33.311. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.03210.73351.4672.20052.9343.6675SE +/- 0.017, N = 3SE +/- 0.002, N = 3SE +/- 0.011, N = 33.2313.2403.260MIN: 3.19 / MAX: 3.46MIN: 3.2 / MAX: 4.08MIN: 3.22 / MAX: 4.011. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3132816243240SE +/- 0.04, N = 3SE +/- 0.19, N = 3SE +/- 0.16, N = 333.7733.9333.98MIN: 33.53 / MAX: 46.55MIN: 33.53 / MAX: 46.76MIN: 33.58 / MAX: 47.61. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory3126K12K18K24K30KSE +/- 185.77, N = 3SE +/- 182.62, N = 3SE +/- 44.57, N = 326135.3126132.7625903.891. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU3123K6K9K12K15KSE +/- 0.21, N = 3SE +/- 1.10, N = 3SE +/- 1.54, N = 314375.3514374.2614373.471. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm


Phoronix Test Suite v10.8.4