bergamo extra Tests for a future article. AMD EPYC 9384X 32-Core testing with a AMD Titanite_4G (RTI1007B BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2308292-NE-BERGAMOEX01&grs&rdt .
bergamo extra Processor Motherboard Chipset Memory Disk Graphics Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution E F AMD EPYC 9384X 32-Core @ 3.91GHz (32 Cores / 64 Threads) AMD Titanite_4G (RTI1007B BIOS) AMD Device 14a4 768GB 3841GB Micron_9300_MTFDHAL3T8TDP ASPEED Broadcom NetXtreme BCM5720 PCIe Ubuntu 22.04 5.15.0-47-generic (x86_64) GNOME Shell 42.4 X Server 1.21.1.3 1.2.204 GCC 11.2.0 ext4 1024x768 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: amd-pstate performance (Boost: Enabled) - CPU Microcode: 0xa101121 Java Details - OpenJDK Runtime Environment (build 11.0.16+8-post-Ubuntu-0ubuntu122.04) Python Details - Python 3.10.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
bergamo extra svt-av1: Preset 8 - Bosphorus 4K stress-ng: AVX-512 VNNI npb: EP.D stress-ng: Cloning apache-iotdb: 200 - 1 - 500 stress-ng: Pipe apache-iotdb: 100 - 1 - 500 apache-iotdb: 200 - 1 - 500 apache-iotdb: 500 - 100 - 500 askap: tConvolve OpenMP - Gridding apache-iotdb: 200 - 100 - 200 apache-iotdb: 200 - 1 - 200 apache-iotdb: 100 - 1 - 500 apache-iotdb: 200 - 100 - 200 apache-iotdb: 500 - 100 - 500 npb: MG.C stress-ng: Futex apache-iotdb: 200 - 1 - 200 apache-iotdb: 100 - 1 - 200 npb: CG.C stress-ng: SENDFILE apache-iotdb: 100 - 100 - 500 npb: LU.C stress-ng: Semaphores apache-iotdb: 100 - 100 - 500 apache-iotdb: 100 - 100 - 200 apache-iotdb: 100 - 100 - 200 apache-iotdb: 500 - 1 - 500 stress-ng: MMAP apache-iotdb: 100 - 1 - 200 ncnn: CPU - blazeface svt-av1: Preset 12 - Bosphorus 1080p apache-iotdb: 200 - 100 - 500 apache-iotdb: 500 - 1 - 500 ncnn: CPU - vgg16 svt-av1: Preset 13 - Bosphorus 1080p apache-iotdb: 500 - 1 - 200 stress-ng: Poll incompact3d: input.i3d 129 Cells Per Direction libxsmm: 32 stress-ng: Mutex ncnn: CPU - vision_transformer apache-iotdb: 500 - 1 - 200 npb: IS.D svt-av1: Preset 4 - Bosphorus 4K apache-iotdb: 200 - 100 - 500 kvazaar: Bosphorus 1080p - Ultra Fast apache-iotdb: 500 - 100 - 200 kvazaar: Bosphorus 4K - Slow svt-av1: Preset 12 - Bosphorus 4K incompact3d: X3D-benchmarking input.i3d svt-av1: Preset 4 - Bosphorus 1080p stress-ng: IO_uring ncnn: CPU - regnety_400m svt-av1: Preset 8 - Bosphorus 1080p stress-ng: MEMFD ncnn: CPU - shufflenet-v2 compress-7zip: Decompression Rating incompact3d: input.i3d 193 Cells Per Direction ncnn: CPU - squeezenet_ssd stress-ng: Glibc C String Functions ncnn: CPU - mobilenet kvazaar: Bosphorus 1080p - Medium avifenc: 6, Lossless ncnn: CPU - alexnet npb: EP.C ncnn: CPU - efficientnet-b0 ncnn: CPU - FastestDet ncnn: CPU - resnet50 askap: tConvolve MT - Gridding stress-ng: Matrix 3D Math ncnn: CPU - googlenet stress-ng: Function Call ncnn: CPU - resnet18 kvazaar: Bosphorus 1080p - Very Fast stress-ng: x86_64 RdRand npb: FT.C avifenc: 2 ncnn: CPU-v2-v2 - mobilenet-v2 stress-ng: Floating Point kvazaar: Bosphorus 1080p - Super Fast stress-ng: CPU Stress avifenc: 0 svt-av1: Preset 13 - Bosphorus 4K stress-ng: Wide Vector Math blender: Pabellon Barcelona - CPU-Only build-linux-kernel: defconfig compress-7zip: Compression Rating stress-ng: Pthread askap: tConvolve MT - Degridding stress-ng: Malloc npb: SP.B build-nodejs: Time To Compile stress-ng: Crypto avifenc: 6 npb: SP.C openssl: RSA4096 stress-ng: Atomic kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Super Fast stress-ng: NUMA stress-ng: AVL Tree blender: Fishy Cat - CPU-Only stress-ng: Vector Floating Point libxsmm: 64 stress-ng: Forking build-linux-kernel: allmodconfig openssl: AES-128-GCM stress-ng: Hash blender: Classroom - CPU-Only stress-ng: System V Message Passing stress-ng: Fused Multiply-Add ncnn: CPU - mnasnet minife: Small ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - yolov4-tiny stress-ng: Zlib openssl: SHA256 stress-ng: Mixed Scheduler blender: BMW27 - CPU-Only avifenc: 10, Lossless build-godot: Time To Compile kvazaar: Bosphorus 4K - Ultra Fast openssl: RSA4096 build-gem5: Time To Compile stress-ng: Socket Activity stress-ng: Context Switching blender: Barbershop - CPU-Only npb: BT.C build-mesa: Time To Compile openssl: ChaCha20 openssl: SHA512 stress-ng: CPU Cache stress-ng: Memory Copying apache-iotdb: 500 - 100 - 200 kvazaar: Bosphorus 1080p - Slow openssl: AES-256-GCM stress-ng: Vector Math stress-ng: Vector Shuffle stress-ng: Glibc Qsort Data Sorting stress-ng: Matrix Math openssl: ChaCha20-Poly1305 kvazaar: Bosphorus 4K - Medium askap: Hogbom Clean OpenMP askap: tConvolve OpenMP - Degridding askap: tConvolve MPI - Gridding askap: tConvolve MPI - Degridding E F 68.736 4093266.57 3555.93 6870.08 33.33 18207317.96 35.56 1244290.54 70.79 22188 35.05 15.3 1016068.17 47100474.32 67181470.11 121134.91 2852066.71 902188.66 20.42 47029.42 649786.78 52556675.96 195117.13 81406001.02 78.92 37.95 38111412.39 1408314.28 625.09 588587.22 4.13 515.851 92.91 32.07 26.04 615.958 13.94 4343727.88 3.76278901 981.1 23255699.79 45 1178643.97 4202.57 5.285 49539921.59 263.45 29.07 22.7 161.629 426.737732 13.535 5064110.48 24.53 141.111 415.36 10.01 248227 12.3650045 16.55 31352208.83 17.4 73.29 5.65 5.91 4107.38 10.74 12.67 17.9 11262.6 28914.23 20.67 25044.49 9.07 162.16 11942773.88 91806.3 38.121 8.31 11457.56 224.67 76099.17 72.293 161.793 1415448.85 124.16 31.94 301517 104885.48 15576.2 159403243.38 119825.49 171.749 79338.51 2.867 105770.9 17152.6 247.05 48.9 61.87 647.21 693.33 49.68 101972.69 1246.6 26368.36 334.195 350368000140 7179982.15 101.62 7967662.12 30155491.73 7.43 57367.1 8.49 26.35 3889.71 48327010260 33179 39.98 4.082 112.549 72.57 549644.5 155.586 31093.74 14202532.49 355.8 130842.77 17.533 190340976640 15424545380 1117509.74 12746.29 60821535.47 69.99 301537503100 214246.51 24688.71 818.41 166863.57 133613467510 23.49 1149.43 33282 30870.1 23854.1 86.174 3473026.51 4177.87 7924.08 36.74 19946044.78 38.71 1144274.29 65.16 20481.2 32.66 16.3 955839.49 50016758.44 71000860.09 114790.76 2717400.09 862346.98 19.58 49017.63 675353.65 54498343.71 188290.63 78775153.73 76.47 36.79 39129510.65 1445916.13 609.72 603375.23 4.03 505.005 91.03 31.43 26.57 605.256 13.7 4271308.93 3.82253599 967.1 22935233.04 45.62 1193616.95 4255.49 5.348 50107256.21 266.37 29.37 22.93 163.265 422.494141 13.668 5015105.12 24.3 139.795 411.49 9.92 246009 12.255085 16.41 31613431.85 17.26 72.72 5.611 5.87 4134.38 10.67 12.59 18.01 11330 28742.76 20.55 25183.91 9.12 161.28 11878217.33 91355.37 37.935 8.27 11511.71 223.65 76433.87 72.602 162.457 1409726.96 123.67 32.065 302670 104500.93 15633.4 158836969.43 119401.29 171.149 79616.41 2.857 105402.56 17202.9 246.37 48.77 61.71 645.72 694.87 49.79 101749.29 1244 26317.3 333.555 351016588900 7192739.29 101.45 7980751.91 30114608.52 7.44 57438.2 8.48 26.32 3893.79 48277353550 33150.73 40.01 4.085 112.619 72.53 549345.8 155.667 31108.14 14196798.48 355.94 130890.96 17.527 190276666710 15419482790 1117723.88 12744.05 60811785.77 70 301560675530 214233.97 24689.76 818.44 166859.04 133611469760 23.49 1149.43 33282 30870.1 23854.1 OpenBenchmarking.org
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K E F 20 40 60 80 100 68.74 86.17 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Stress-NG Test: AVX-512 VNNI OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVX-512 VNNI E F 900K 1800K 2700K 3600K 4500K 4093266.57 3473026.51 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D E F 900 1800 2700 3600 4500 3555.93 4177.87 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Stress-NG Test: Cloning OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Cloning E F 2K 4K 6K 8K 10K 6870.08 7924.08 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 E F 8 16 24 32 40 33.33 36.74 MAX: 1296.87 MAX: 1226.4
Stress-NG Test: Pipe OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pipe E F 4M 8M 12M 16M 20M 18207317.96 19946044.78 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 E F 9 18 27 36 45 35.56 38.71 MAX: 1236.32 MAX: 1166.9
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 E F 300K 600K 900K 1200K 1500K 1244290.54 1144274.29
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 E F 16 32 48 64 80 70.79 65.16 MAX: 1678.56 MAX: 1866.71
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding E F 5K 10K 15K 20K 25K 22188.0 20481.2 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 E F 8 16 24 32 40 35.05 32.66 MAX: 1388.2 MAX: 1288.46
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 E F 4 8 12 16 20 15.3 16.3 MAX: 1053.48 MAX: 1045.81
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 E F 200K 400K 600K 800K 1000K 1016068.17 955839.49
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 E F 11M 22M 33M 44M 55M 47100474.32 50016758.44
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 E F 15M 30M 45M 60M 75M 67181470.11 71000860.09
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C E F 30K 60K 90K 120K 150K 121134.91 114790.76 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Stress-NG Test: Futex OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Futex E F 600K 1200K 1800K 2400K 3000K 2852066.71 2717400.09 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 E F 200K 400K 600K 800K 1000K 902188.66 862346.98
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 E F 5 10 15 20 25 20.42 19.58 MAX: 1084.24 MAX: 1044.13
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C E F 10K 20K 30K 40K 50K 47029.42 49017.63 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Stress-NG Test: SENDFILE OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: SENDFILE E F 140K 280K 420K 560K 700K 649786.78 675353.65 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 E F 12M 24M 36M 48M 60M 52556675.96 54498343.71
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C E F 40K 80K 120K 160K 200K 195117.13 188290.63 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Stress-NG Test: Semaphores OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Semaphores E F 20M 40M 60M 80M 100M 81406001.02 78775153.73 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 E F 20 40 60 80 100 78.92 76.47 MAX: 1478.84 MAX: 1695.4
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 E F 9 18 27 36 45 37.95 36.79 MAX: 1192.97 MAX: 1268.75
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 E F 8M 16M 24M 32M 40M 38111412.39 39129510.65
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 E F 300K 600K 900K 1200K 1500K 1408314.28 1445916.13
Stress-NG Test: MMAP OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MMAP E F 130 260 390 520 650 625.09 609.72 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 E F 130K 260K 390K 520K 650K 588587.22 603375.23
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface E F 0.9293 1.8586 2.7879 3.7172 4.6465 4.13 4.03 MIN: 4.07 / MAX: 6.23 MIN: 3.96 / MAX: 4.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 1080p E F 110 220 330 440 550 515.85 505.01 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 E F 20 40 60 80 100 92.91 91.03 MAX: 3035.17 MAX: 2893.97
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 E F 7 14 21 28 35 32.07 31.43 MAX: 1415.49 MAX: 1407.82
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 E F 6 12 18 24 30 26.04 26.57 MIN: 25.76 / MAX: 26.78 MIN: 26.31 / MAX: 28.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 1080p E F 130 260 390 520 650 615.96 605.26 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 E F 4 8 12 16 20 13.94 13.70 MAX: 1249.85 MAX: 1226.66
Stress-NG Test: Poll OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Poll E F 900K 1800K 2700K 3600K 4500K 4343727.88 4271308.93 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction E F 0.8601 1.7202 2.5803 3.4404 4.3005 3.76278901 3.82253599 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 E F 200 400 600 800 1000 981.1 967.1 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
Stress-NG Test: Mutex OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mutex E F 5M 10M 15M 20M 25M 23255699.79 22935233.04 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NCNN Target: CPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer E F 10 20 30 40 50 45.00 45.62 MIN: 44.46 / MAX: 52.79 MIN: 45.16 / MAX: 54.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 E F 300K 600K 900K 1200K 1500K 1178643.97 1193616.95
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D E F 900 1800 2700 3600 4500 4202.57 4255.49 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K E F 1.2033 2.4066 3.6099 4.8132 6.0165 5.285 5.348 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 E F 11M 22M 33M 44M 55M 49539921.59 50107256.21
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast E F 60 120 180 240 300 263.45 266.37 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 E F 7 14 21 28 35 29.07 29.37 MAX: 1395.01 MAX: 1468.72
Kvazaar Video Input: Bosphorus 4K - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Slow E F 5 10 15 20 25 22.70 22.93 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 4K E F 40 80 120 160 200 161.63 163.27 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d E F 90 180 270 360 450 426.74 422.49 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 4 - Input: Bosphorus 1080p E F 4 8 12 16 20 13.54 13.67 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Stress-NG Test: IO_uring OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: IO_uring E F 1.1M 2.2M 3.3M 4.4M 5.5M 5064110.48 5015105.12 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m E F 6 12 18 24 30 24.53 24.30 MIN: 24.4 / MAX: 26.12 MIN: 23.83 / MAX: 52.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 1080p E F 30 60 90 120 150 141.11 139.80 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Stress-NG Test: MEMFD OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MEMFD E F 90 180 270 360 450 415.36 411.49 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 E F 3 6 9 12 15 10.01 9.92 MIN: 9.89 / MAX: 11.99 MIN: 9.83 / MAX: 10.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating E F 50K 100K 150K 200K 250K 248227 246009 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction E F 3 6 9 12 15 12.37 12.26 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd E F 4 8 12 16 20 16.55 16.41 MIN: 16.46 / MAX: 17.02 MIN: 16.32 / MAX: 18.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Stress-NG Test: Glibc C String Functions OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc C String Functions E F 7M 14M 21M 28M 35M 31352208.83 31613431.85 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mobilenet E F 4 8 12 16 20 17.40 17.26 MIN: 17.29 / MAX: 19.5 MIN: 17.18 / MAX: 17.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Medium E F 16 32 48 64 80 73.29 72.72 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless E F 1.2713 2.5426 3.8139 5.0852 6.3565 5.650 5.611 1. (CXX) g++ options: -O3 -fPIC -lm
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet E F 1.3298 2.6596 3.9894 5.3192 6.649 5.91 5.87 MIN: 5.74 / MAX: 13.78 MIN: 5.81 / MAX: 6.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C E F 900 1800 2700 3600 4500 4107.38 4134.38 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 E F 3 6 9 12 15 10.74 10.67 MIN: 10.65 / MAX: 11.26 MIN: 10.57 / MAX: 11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet E F 3 6 9 12 15 12.67 12.59 MIN: 12.55 / MAX: 13.14 MIN: 12.44 / MAX: 14.79 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 E F 4 8 12 16 20 17.90 18.01 MIN: 17.78 / MAX: 19.74 MIN: 17.88 / MAX: 18.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding E F 2K 4K 6K 8K 10K 11262.6 11330.0 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Stress-NG Test: Matrix 3D Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix 3D Math E F 6K 12K 18K 24K 30K 28914.23 28742.76 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet E F 5 10 15 20 25 20.67 20.55 MIN: 20.55 / MAX: 21.11 MIN: 20.36 / MAX: 28.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Stress-NG Test: Function Call OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Function Call E F 5K 10K 15K 20K 25K 25044.49 25183.91 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 E F 3 6 9 12 15 9.07 9.12 MIN: 9 / MAX: 9.53 MIN: 9.04 / MAX: 9.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Very Fast E F 40 80 120 160 200 162.16 161.28 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Stress-NG Test: x86_64 RdRand OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: x86_64 RdRand E F 3M 6M 9M 12M 15M 11942773.88 11878217.33 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C E F 20K 40K 60K 80K 100K 91806.30 91355.37 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 E F 9 18 27 36 45 38.12 37.94 1. (CXX) g++ options: -O3 -fPIC -lm
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 E F 2 4 6 8 10 8.31 8.27 MIN: 7.9 / MAX: 8.98 MIN: 7.89 / MAX: 9.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Stress-NG Test: Floating Point OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Floating Point E F 2K 4K 6K 8K 10K 11457.56 11511.71 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Super Fast E F 50 100 150 200 250 224.67 223.65 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Stress-NG Test: CPU Stress OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Stress E F 16K 32K 48K 64K 80K 76099.17 76433.87 1. (CXX) g++ options: -O2 -std=gnu99 -lc
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 0 E F 16 32 48 64 80 72.29 72.60 1. (CXX) g++ options: -O3 -fPIC -lm
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 4K E F 40 80 120 160 200 161.79 162.46 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Stress-NG Test: Wide Vector Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Wide Vector Math E F 300K 600K 900K 1200K 1500K 1415448.85 1409726.96 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only E F 30 60 90 120 150 124.16 123.67
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig E F 7 14 21 28 35 31.94 32.07
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating E F 60K 120K 180K 240K 300K 301517 302670 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Stress-NG Test: Pthread OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pthread E F 20K 40K 60K 80K 100K 104885.48 104500.93 1. (CXX) g++ options: -O2 -std=gnu99 -lc
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding E F 3K 6K 9K 12K 15K 15576.2 15633.4 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Stress-NG Test: Malloc OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Malloc E F 30M 60M 90M 120M 150M 159403243.38 158836969.43 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B E F 30K 60K 90K 120K 150K 119825.49 119401.29 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile E F 40 80 120 160 200 171.75 171.15
Stress-NG Test: Crypto OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Crypto E F 20K 40K 60K 80K 100K 79338.51 79616.41 1. (CXX) g++ options: -O2 -std=gnu99 -lc
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 E F 0.6451 1.2902 1.9353 2.5804 3.2255 2.867 2.857 1. (CXX) g++ options: -O3 -fPIC -lm
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C E F 20K 40K 60K 80K 100K 105770.90 105402.56 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 E F 4K 8K 12K 16K 20K 17152.6 17202.9 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Stress-NG Test: Atomic OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Atomic E F 50 100 150 200 250 247.05 246.37 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast E F 11 22 33 44 55 48.90 48.77 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast E F 14 28 42 56 70 61.87 61.71 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Stress-NG Test: NUMA OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: NUMA E F 140 280 420 560 700 647.21 645.72 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: AVL Tree OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVL Tree E F 150 300 450 600 750 693.33 694.87 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only E F 11 22 33 44 55 49.68 49.79
Stress-NG Test: Vector Floating Point OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Floating Point E F 20K 40K 60K 80K 100K 101972.69 101749.29 1. (CXX) g++ options: -O2 -std=gnu99 -lc
libxsmm M N K: 64 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 E F 300 600 900 1200 1500 1246.6 1244.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
Stress-NG Test: Forking OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Forking E F 6K 12K 18K 24K 30K 26368.36 26317.30 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig E F 70 140 210 280 350 334.20 333.56
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM E F 80000M 160000M 240000M 320000M 400000M 350368000140 351016588900 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Stress-NG Test: Hash OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Hash E F 1.5M 3M 4.5M 6M 7.5M 7179982.15 7192739.29 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only E F 20 40 60 80 100 101.62 101.45
Stress-NG Test: System V Message Passing OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: System V Message Passing E F 2M 4M 6M 8M 10M 7967662.12 7980751.91 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Fused Multiply-Add OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Fused Multiply-Add E F 6M 12M 18M 24M 30M 30155491.73 30114608.52 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet E F 2 4 6 8 10 7.43 7.44 MIN: 7.36 / MAX: 7.87 MIN: 7.37 / MAX: 7.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small E F 12K 24K 36K 48K 60K 57367.1 57438.2 1. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 E F 2 4 6 8 10 8.49 8.48 MIN: 8.38 / MAX: 8.91 MIN: 8.35 / MAX: 10.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny E F 6 12 18 24 30 26.35 26.32 MIN: 26.14 / MAX: 26.85 MIN: 26.16 / MAX: 27.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Stress-NG Test: Zlib OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Zlib E F 800 1600 2400 3200 4000 3889.71 3893.79 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 E F 10000M 20000M 30000M 40000M 50000M 48327010260 48277353550 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Stress-NG Test: Mixed Scheduler OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mixed Scheduler E F 7K 14K 21K 28K 35K 33179.00 33150.73 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only E F 9 18 27 36 45 39.98 40.01
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 10, Lossless E F 0.9191 1.8382 2.7573 3.6764 4.5955 4.082 4.085 1. (CXX) g++ options: -O3 -fPIC -lm
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile E F 30 60 90 120 150 112.55 112.62
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast E F 16 32 48 64 80 72.57 72.53 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 E F 120K 240K 360K 480K 600K 549644.5 549345.8 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 21.2 Time To Compile E F 30 60 90 120 150 155.59 155.67
Stress-NG Test: Socket Activity OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Socket Activity E F 7K 14K 21K 28K 35K 31093.74 31108.14 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Context Switching OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Context Switching E F 3M 6M 9M 12M 15M 14202532.49 14196798.48 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only E F 80 160 240 320 400 355.80 355.94
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C E F 30K 60K 90K 120K 150K 130842.77 130890.96 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile E F 4 8 12 16 20 17.53 17.53
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 E F 40000M 80000M 120000M 160000M 200000M 190340976640 190276666710 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 E F 3000M 6000M 9000M 12000M 15000M 15424545380 15419482790 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Stress-NG Test: CPU Cache OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Cache E F 200K 400K 600K 800K 1000K 1117509.74 1117723.88 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Memory Copying OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Memory Copying E F 3K 6K 9K 12K 15K 12746.29 12744.05 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 E F 13M 26M 39M 52M 65M 60821535.47 60811785.77
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Slow E F 16 32 48 64 80 69.99 70.00 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM E F 60000M 120000M 180000M 240000M 300000M 301537503100 301560675530 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Stress-NG Test: Vector Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Math E F 50K 100K 150K 200K 250K 214246.51 214233.97 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Vector Shuffle OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Shuffle E F 5K 10K 15K 20K 25K 24688.71 24689.76 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Glibc Qsort Data Sorting OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc Qsort Data Sorting E F 200 400 600 800 1000 818.41 818.44 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Matrix Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix Math E F 40K 80K 120K 160K 200K 166863.57 166859.04 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 E F 30000M 60000M 90000M 120000M 150000M 133613467510 133611469760 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium E F 6 12 18 24 30 23.49 23.49 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP E F 200 400 600 800 1000 1149.43 1149.43 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding E F 7K 14K 21K 28K 35K 33282 33282 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding E F 7K 14K 21K 28K 35K 30870.1 30870.1 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding E F 5K 10K 15K 20K 25K 23854.1 23854.1 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Phoronix Test Suite v10.8.5