Xeon E3 1245 December v5 Intel Xeon E3-1245 v5 testing with a MSI C236A WORKSTATION (MS-7998) v1.0 (2.90 BIOS) and MSI Intel HD P530 3GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012171-HA-XEONE312400&grs .
Xeon E3 1245 December v5 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Xeon E3-1245 v5 @ 3.90GHz (4 Cores / 8 Threads) MSI C236A WORKSTATION (MS-7998) v1.0 (2.90 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB 120GB Samsung SSD 850 MSI Intel HD P530 3GB (1150MHz) Realtek ALC1150 LG Ultra HD Intel I219-V Ubuntu 20.04 5.9.0-050900rc7daily20201003-generic (x86_64) 20201002 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 4.6 Mesa 20.0.4 1.2.131 GCC 9.3.0 ext4 3840x2160 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 1.9.1 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable
Xeon E3 1245 December v5 onednn: IP Shapes 3D - f32 - CPU apache-siege: 500 build2: Time To Compile apache-siege: 250 apache-siege: 100 simdjson: Kostya coremark: CoreMark Size 666 - Iterations Per Second apache-siege: 200 apache-siege: 10 onednn: Recurrent Neural Network Inference - u8s8f32 - CPU apache-siege: 50 onednn: Convolution Batch Shapes Auto - f32 - CPU node-web-tooling: onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU mafft: Multiple Sequence Alignment - LSU RNA onednn: Deconvolution Batch shapes_3d - f32 - CPU phpbench: PHP Benchmark Suite onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU build-clash: Time To Compile vkresample: 2x - Single hmmer: Pfam Database Search sqlite-speedtest: Timed Time - Size 1,000 vkresample: 2x - Double onednn: Recurrent Neural Network Training - f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU vkfft: brl-cad: VGR Performance Metric onednn: IP Shapes 1D - u8s8f32 - CPU build-ffmpeg: Time To Compile onednn: IP Shapes 1D - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU simdjson: DistinctUserID simdjson: PartialTweets simdjson: LargeRand betsy: ETC2 RGB - Highest onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU betsy: ETC1 - Highest 1 2 3 12.0505 35425.87 278.207 35071.79 36590.03 0.62 139999.666879 35475.13 34682.17 3936.93 37184.02 19.9273 10.43 10.8654 3935.48 12.292 14.7863 651452 5.16070 3936.66 503.135 431.800 125.602 73.623 998.698 7475.48 11.5053 3.12413 7.55654 7.06309 19.7494 1403 48474 3.74565 128.177 8.19338 7465.28 0.58 0.57 0.38 7.187 8018.55 11.286 12.3985 34441.37 277.889 34308.01 36203.37 0.61 139563.390363 34911.49 34482.76 3957.02 36795.60 19.9243 10.52 10.9581 3968.98 12.389 14.6893 648537 5.14219 3939.66 502.741 430.887 125.928 73.725 1001.138 7493.45 11.4852 3.13122 7.56196 7.05721 19.7207 1404 48425 3.74740 128.292 8.19008 7469.16 0.58 0.57 0.38 7485.17 11.052 12.1411 34499.88 284.266 34647.10 35988.65 0.61 141836.103733 34952.97 34134.69 3997.52 37055.53 20.1018 10.44 10.8714 3951.34 12.322 14.7021 651248 5.15452 3950.78 504.478 430.666 125.734 73.805 998.819 7484.24 11.5125 3.13129 7.56918 7.06753 19.7427 1405 48489 3.74365 128.211 8.19661 7470.61 0.58 0.57 0.38 7471.28 10.996 OpenBenchmarking.org
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.11, N = 15 SE +/- 0.05, N = 3 12.05 12.40 12.14 MIN: 11.82 MIN: 11.95 MIN: 11.88 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Apache Siege Concurrent Users: 500 OpenBenchmarking.org Transactions Per Second, More Is Better Apache Siege 2.4.29 Concurrent Users: 500 1 2 3 8K 16K 24K 32K 40K SE +/- 133.32, N = 3 SE +/- 188.90, N = 3 SE +/- 164.35, N = 3 35425.87 34441.37 34499.88 1. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile 1 2 3 60 120 180 240 300 SE +/- 0.34, N = 3 SE +/- 0.42, N = 3 SE +/- 3.50, N = 3 278.21 277.89 284.27
Apache Siege Concurrent Users: 250 OpenBenchmarking.org Transactions Per Second, More Is Better Apache Siege 2.4.29 Concurrent Users: 250 1 2 3 8K 16K 24K 32K 40K SE +/- 192.35, N = 3 SE +/- 75.75, N = 3 SE +/- 68.02, N = 3 35071.79 34308.01 34647.10 1. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto
Apache Siege Concurrent Users: 100 OpenBenchmarking.org Transactions Per Second, More Is Better Apache Siege 2.4.29 Concurrent Users: 100 1 2 3 8K 16K 24K 32K 40K SE +/- 60.27, N = 3 SE +/- 194.83, N = 3 SE +/- 54.02, N = 3 36590.03 36203.37 35988.65 1. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya 1 2 3 0.1395 0.279 0.4185 0.558 0.6975 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.62 0.61 0.61 1. (CXX) g++ options: -O3 -pthread
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second 1 2 3 30K 60K 90K 120K 150K SE +/- 241.75, N = 3 SE +/- 759.23, N = 3 SE +/- 1229.10, N = 3 139999.67 139563.39 141836.10 1. (CC) gcc options: -O2 -lrt" -lrt
Apache Siege Concurrent Users: 200 OpenBenchmarking.org Transactions Per Second, More Is Better Apache Siege 2.4.29 Concurrent Users: 200 1 2 3 8K 16K 24K 32K 40K SE +/- 319.00, N = 3 SE +/- 155.19, N = 3 SE +/- 52.60, N = 3 35475.13 34911.49 34952.97 1. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto
Apache Siege Concurrent Users: 10 OpenBenchmarking.org Transactions Per Second, More Is Better Apache Siege 2.4.29 Concurrent Users: 10 1 2 3 7K 14K 21K 28K 35K SE +/- 40.05, N = 3 SE +/- 0.00, N = 3 SE +/- 290.84, N = 3 34682.17 34482.76 34134.69 1. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 2.29, N = 3 SE +/- 5.75, N = 3 SE +/- 51.44, N = 5 3936.93 3957.02 3997.52 MIN: 3929.59 MIN: 3939.37 MIN: 3936.37 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Apache Siege Concurrent Users: 50 OpenBenchmarking.org Transactions Per Second, More Is Better Apache Siege 2.4.29 Concurrent Users: 50 1 2 3 8K 16K 24K 32K 40K SE +/- 40.19, N = 3 SE +/- 265.33, N = 3 SE +/- 60.12, N = 3 37184.02 36795.60 37055.53 1. (CC) gcc options: -O2 -lpthread -ldl -lssl -lcrypto
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1 2 3 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 19.93 19.92 20.10 MIN: 19.79 MIN: 19.81 MIN: 19.8 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark 1 2 3 3 6 9 12 15 SE +/- 0.13, N = 5 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 10.43 10.52 10.44 1. Nodejs
v10.19.0
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 10.87 10.96 10.87 MIN: 10.77 MIN: 10.77 MIN: 10.77 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1 2 3 900 1800 2700 3600 4500 SE +/- 1.87, N = 3 SE +/- 26.09, N = 3 SE +/- 10.45, N = 3 3935.48 3968.98 3951.34 MIN: 3930.35 MIN: 3938.23 MIN: 3936.36 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA 1 2 3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 12.29 12.39 12.32 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1 2 3 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 14.79 14.69 14.70 MIN: 14.53 MIN: 14.48 MIN: 14.49 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite 1 2 3 140K 280K 420K 560K 700K SE +/- 430.55, N = 3 SE +/- 2761.65, N = 3 SE +/- 834.21, N = 3 651452 648537 651248
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1 2 3 1.1612 2.3224 3.4836 4.6448 5.806 SE +/- 0.02207, N = 3 SE +/- 0.00982, N = 3 SE +/- 0.00478, N = 3 5.16070 5.14219 5.15452 MIN: 5.06 MIN: 5.07 MIN: 5.07 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 800 1600 2400 3200 4000 SE +/- 2.64, N = 3 SE +/- 0.62, N = 3 SE +/- 10.37, N = 3 3936.66 3939.66 3950.78 MIN: 3928.39 MIN: 3934.75 MIN: 3936.36 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Timed Clash Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Clash Compilation Time To Compile 1 2 3 110 220 330 440 550 SE +/- 0.42, N = 3 SE +/- 0.87, N = 3 SE +/- 0.30, N = 3 503.14 502.74 504.48
VkResample Upscale: 2x - Precision: Single OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Single 1 2 3 90 180 270 360 450 SE +/- 0.04, N = 3 SE +/- 0.12, N = 3 SE +/- 0.46, N = 3 431.80 430.89 430.67 1. (CXX) g++ options: -O3 -pthread
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.1 Pfam Database Search 1 2 3 30 60 90 120 150 SE +/- 0.05, N = 3 SE +/- 0.18, N = 3 SE +/- 0.10, N = 3 125.60 125.93 125.73 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 1 2 3 16 32 48 64 80 SE +/- 0.25, N = 3 SE +/- 0.08, N = 3 SE +/- 0.22, N = 3 73.62 73.73 73.81 1. (CC) gcc options: -O2 -ldl -lz -lpthread
VkResample Upscale: 2x - Precision: Double OpenBenchmarking.org ms, Fewer Is Better VkResample 1.0 Upscale: 2x - Precision: Double 1 2 3 200 400 600 800 1000 SE +/- 0.33, N = 3 SE +/- 0.56, N = 3 SE +/- 0.34, N = 3 998.70 1001.14 998.82 1. (CXX) g++ options: -O3 -pthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1 2 3 1600 3200 4800 6400 8000 SE +/- 2.11, N = 3 SE +/- 17.04, N = 3 SE +/- 7.55, N = 3 7475.48 7493.45 7484.24 MIN: 7466.93 MIN: 7459.2 MIN: 7462.09 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1 2 3 3 6 9 12 15 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 11.51 11.49 11.51 MIN: 11.29 MIN: 11.29 MIN: 11.28 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.7045 1.409 2.1135 2.818 3.5225 SE +/- 0.01369, N = 3 SE +/- 0.01563, N = 3 SE +/- 0.01203, N = 3 3.12413 3.13122 3.13129 MIN: 3.05 MIN: 3.05 MIN: 3.05 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.01345, N = 3 SE +/- 0.01576, N = 3 SE +/- 0.03000, N = 3 7.55654 7.56196 7.56918 MIN: 7.49 MIN: 7.48 MIN: 7.48 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.00794, N = 3 SE +/- 0.00555, N = 3 SE +/- 0.00309, N = 3 7.06309 7.05721 7.06753 MIN: 7.01 MIN: 7.01 MIN: 7.02 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1 2 3 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 19.75 19.72 19.74 MIN: 19.58 MIN: 19.53 MIN: 19.59 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
VkFFT OpenBenchmarking.org Benchmark Score, More Is Better VkFFT 1.1.1 1 2 3 300 600 900 1200 1500 SE +/- 2.03, N = 3 1403 1404 1405 1. (CXX) g++ options: -O3 -pthread
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.30.8 VGR Performance Metric 1 2 3 10K 20K 30K 40K 50K 48474 48425 48489 1. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1 2 3 0.8432 1.6864 2.5296 3.3728 4.216 SE +/- 0.00518, N = 3 SE +/- 0.00660, N = 3 SE +/- 0.00576, N = 3 3.74565 3.74740 3.74365 MIN: 3.7 MIN: 3.7 MIN: 3.7 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.2.2 Time To Compile 1 2 3 30 60 90 120 150 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 128.18 128.29 128.21
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1 2 3 2 4 6 8 10 SE +/- 0.02381, N = 3 SE +/- 0.02857, N = 3 SE +/- 0.02221, N = 3 8.19338 8.19008 8.19661 MIN: 7.88 MIN: 7.88 MIN: 7.87 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1 2 3 1600 3200 4800 6400 8000 SE +/- 3.96, N = 3 SE +/- 3.45, N = 3 SE +/- 1.75, N = 3 7465.28 7469.16 7470.61 MIN: 7456.1 MIN: 7457.9 MIN: 7463.02 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: DistinctUserID 1 2 3 0.1305 0.261 0.3915 0.522 0.6525 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.58 0.58 0.58 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets 1 2 3 0.1283 0.2566 0.3849 0.5132 0.6415 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.57 0.57 0.57 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: LargeRandom 1 2 3 0.0855 0.171 0.2565 0.342 0.4275 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.38 0.38 0.38 1. (CXX) g++ options: -O3 -pthread
Betsy GPU Compressor Codec: ETC2 RGB - Quality: Highest OpenBenchmarking.org Seconds, Fewer Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC2 RGB - Quality: Highest 1 2 4 6 8 10 7.187 1. (CXX) g++ options: -O3 -O2 -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1 2 3 2K 4K 6K 8K 10K SE +/- 471.82, N = 15 SE +/- 16.81, N = 3 SE +/- 1.86, N = 3 8018.55 7485.17 7471.28 MIN: 7458.49 MIN: 7458.88 MIN: 7460.83 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
Betsy GPU Compressor Codec: ETC1 - Quality: Highest OpenBenchmarking.org Seconds, Fewer Is Better Betsy GPU Compressor 1.1 Beta Codec: ETC1 - Quality: Highest 1 2 3 3 6 9 12 15 SE +/- 0.29, N = 15 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 11.29 11.05 11.00 1. (CXX) g++ options: -O3 -O2 -lpthread -ldl
Phoronix Test Suite v10.8.4