bergamo extra Tests for a future article. AMD EPYC 9384X 32-Core testing with a AMD Titanite_4G (RTI1007B BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2308292-NE-BERGAMOEX01&grr .
bergamo extra Processor Motherboard Chipset Memory Disk Graphics Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution E F AMD EPYC 9384X 32-Core @ 3.91GHz (32 Cores / 64 Threads) AMD Titanite_4G (RTI1007B BIOS) AMD Device 14a4 768GB 3841GB Micron_9300_MTFDHAL3T8TDP ASPEED Broadcom NetXtreme BCM5720 PCIe Ubuntu 22.04 5.15.0-47-generic (x86_64) GNOME Shell 42.4 X Server 1.21.1.3 1.2.204 GCC 11.2.0 ext4 1024x768 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: amd-pstate performance (Boost: Enabled) - CPU Microcode: 0xa101121 Java Details - OpenJDK Runtime Environment (build 11.0.16+8-post-Ubuntu-0ubuntu122.04) Python Details - Python 3.10.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
bergamo extra incompact3d: X3D-benchmarking input.i3d blender: Barbershop - CPU-Only build-linux-kernel: allmodconfig openssl: ChaCha20-Poly1305 openssl: ChaCha20 openssl: AES-256-GCM openssl: AES-128-GCM openssl: SHA512 openssl: SHA256 build-nodejs: Time To Compile build-gem5: Time To Compile blender: Pabellon Barcelona - CPU-Only build-godot: Time To Compile ncnn: CPU - FastestDet ncnn: CPU - vision_transformer ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet blender: Classroom - CPU-Only askap: tConvolve MT - Degridding askap: tConvolve MT - Gridding avifenc: 0 apache-iotdb: 500 - 100 - 500 apache-iotdb: 500 - 100 - 500 openssl: RSA4096 openssl: RSA4096 blender: Fishy Cat - CPU-Only apache-iotdb: 200 - 100 - 500 apache-iotdb: 200 - 100 - 500 blender: BMW27 - CPU-Only avifenc: 2 apache-iotdb: 500 - 1 - 500 apache-iotdb: 500 - 1 - 500 apache-iotdb: 500 - 100 - 200 apache-iotdb: 500 - 100 - 200 npb: EP.D stress-ng: IO_uring stress-ng: Futex build-linux-kernel: defconfig svt-av1: Preset 4 - Bosphorus 4K stress-ng: Malloc stress-ng: MMAP stress-ng: Cloning stress-ng: MEMFD stress-ng: Atomic stress-ng: CPU Cache stress-ng: Zlib stress-ng: Pthread stress-ng: AVX-512 VNNI stress-ng: Vector Floating Point stress-ng: AVL Tree stress-ng: x86_64 RdRand stress-ng: Vector Math stress-ng: Matrix Math stress-ng: Forking stress-ng: Crypto stress-ng: Hash stress-ng: Function Call stress-ng: NUMA stress-ng: Matrix 3D Math stress-ng: Floating Point stress-ng: Vector Shuffle stress-ng: Memory Copying stress-ng: Mutex stress-ng: System V Message Passing stress-ng: CPU Stress stress-ng: Glibc Qsort Data Sorting stress-ng: Glibc C String Functions stress-ng: Fused Multiply-Add stress-ng: Context Switching stress-ng: Wide Vector Math stress-ng: Socket Activity stress-ng: Mixed Scheduler stress-ng: Semaphores stress-ng: SENDFILE stress-ng: Poll stress-ng: Pipe compress-7zip: Decompression Rating compress-7zip: Compression Rating kvazaar: Bosphorus 4K - Slow kvazaar: Bosphorus 4K - Medium apache-iotdb: 100 - 100 - 500 apache-iotdb: 100 - 100 - 500 apache-iotdb: 500 - 1 - 200 apache-iotdb: 500 - 1 - 200 apache-iotdb: 200 - 1 - 500 apache-iotdb: 200 - 1 - 500 apache-iotdb: 200 - 100 - 200 apache-iotdb: 200 - 100 - 200 npb: BT.C apache-iotdb: 100 - 100 - 200 apache-iotdb: 100 - 100 - 200 apache-iotdb: 100 - 1 - 500 apache-iotdb: 100 - 1 - 500 apache-iotdb: 200 - 1 - 200 apache-iotdb: 200 - 1 - 200 build-mesa: Time To Compile apache-iotdb: 100 - 1 - 200 apache-iotdb: 100 - 1 - 200 npb: SP.C askap: tConvolve MPI - Gridding askap: tConvolve MPI - Degridding incompact3d: input.i3d 193 Cells Per Direction npb: IS.D svt-av1: Preset 4 - Bosphorus 1080p askap: Hogbom Clean OpenMP kvazaar: Bosphorus 4K - Very Fast npb: LU.C libxsmm: 64 kvazaar: Bosphorus 4K - Super Fast minife: Small svt-av1: Preset 8 - Bosphorus 4K kvazaar: Bosphorus 1080p - Slow kvazaar: Bosphorus 4K - Ultra Fast kvazaar: Bosphorus 1080p - Medium libxsmm: 32 askap: tConvolve OpenMP - Degridding askap: tConvolve OpenMP - Gridding avifenc: 6, Lossless npb: FT.C svt-av1: Preset 8 - Bosphorus 1080p svt-av1: Preset 13 - Bosphorus 4K svt-av1: Preset 12 - Bosphorus 4K incompact3d: input.i3d 129 Cells Per Direction avifenc: 10, Lossless kvazaar: Bosphorus 1080p - Very Fast npb: CG.C npb: SP.B avifenc: 6 kvazaar: Bosphorus 1080p - Super Fast npb: EP.C kvazaar: Bosphorus 1080p - Ultra Fast npb: MG.C svt-av1: Preset 12 - Bosphorus 1080p svt-av1: Preset 13 - Bosphorus 1080p E F 426.737732 355.8 334.195 133613467510 190340976640 301537503100 350368000140 15424545380 48327010260 171.749 155.586 124.16 112.549 12.67 45 24.53 16.55 26.35 17.9 5.91 9.07 26.04 20.67 4.13 10.74 7.43 10.01 8.49 8.31 17.4 101.62 15576.2 11262.6 72.293 70.79 67181470.11 549644.5 17152.6 49.68 92.91 49539921.59 39.98 38.121 32.07 1408314.28 29.07 60821535.47 3555.93 5064110.48 2852066.71 31.94 5.285 159403243.38 625.09 6870.08 415.36 247.05 1117509.74 3889.71 104885.48 4093266.57 101972.69 693.33 11942773.88 214246.51 166863.57 26368.36 79338.51 7179982.15 25044.49 647.21 28914.23 11457.56 24688.71 12746.29 23255699.79 7967662.12 76099.17 818.41 31352208.83 30155491.73 14202532.49 1415448.85 31093.74 33179 81406001.02 649786.78 4343727.88 18207317.96 248227 301517 22.7 23.49 78.92 52556675.96 13.94 1178643.97 33.33 1244290.54 35.05 47100474.32 130842.77 37.95 38111412.39 35.56 1016068.17 15.3 902188.66 17.533 20.42 588587.22 105770.9 30870.1 23854.1 12.3650045 4202.57 13.535 1149.43 48.9 195117.13 1246.6 61.87 57367.1 68.736 69.99 72.57 73.29 981.1 33282 22188 5.65 91806.3 141.111 161.793 161.629 3.76278901 4.082 162.16 47029.42 119825.49 2.867 224.67 4107.38 263.45 121134.91 515.851 615.958 422.494141 355.94 333.555 133611469760 190276666710 301560675530 351016588900 15419482790 48277353550 171.149 155.667 123.67 112.619 12.59 45.62 24.3 16.41 26.32 18.01 5.87 9.12 26.57 20.55 4.03 10.67 7.44 9.92 8.48 8.27 17.26 101.45 15633.4 11330 72.602 65.16 71000860.09 549345.8 17202.9 49.79 91.03 50107256.21 40.01 37.935 31.43 1445916.13 29.37 60811785.77 4177.87 5015105.12 2717400.09 32.065 5.348 158836969.43 609.72 7924.08 411.49 246.37 1117723.88 3893.79 104500.93 3473026.51 101749.29 694.87 11878217.33 214233.97 166859.04 26317.3 79616.41 7192739.29 25183.91 645.72 28742.76 11511.71 24689.76 12744.05 22935233.04 7980751.91 76433.87 818.44 31613431.85 30114608.52 14196798.48 1409726.96 31108.14 33150.73 78775153.73 675353.65 4271308.93 19946044.78 246009 302670 22.93 23.49 76.47 54498343.71 13.7 1193616.95 36.74 1144274.29 32.66 50016758.44 130890.96 36.79 39129510.65 38.71 955839.49 16.3 862346.98 17.527 19.58 603375.23 105402.56 30870.1 23854.1 12.255085 4255.49 13.668 1149.43 48.77 188290.63 1244 61.71 57438.2 86.174 70 72.53 72.72 967.1 33282 20481.2 5.611 91355.37 139.795 162.457 163.265 3.82253599 4.085 161.28 49017.63 119401.29 2.857 223.65 4134.38 266.37 114790.76 505.005 605.256 OpenBenchmarking.org
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d E F 90 180 270 360 450 426.74 422.49 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only E F 80 160 240 320 400 355.80 355.94
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig E F 70 140 210 280 350 334.20 333.56
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 E F 30000M 60000M 90000M 120000M 150000M 133613467510 133611469760 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 E F 40000M 80000M 120000M 160000M 200000M 190340976640 190276666710 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM E F 60000M 120000M 180000M 240000M 300000M 301537503100 301560675530 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM E F 80000M 160000M 240000M 320000M 400000M 350368000140 351016588900 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 E F 3000M 6000M 9000M 12000M 15000M 15424545380 15419482790 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 E F 10000M 20000M 30000M 40000M 50000M 48327010260 48277353550 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile E F 40 80 120 160 200 171.75 171.15
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 21.2 Time To Compile E F 30 60 90 120 150 155.59 155.67
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only E F 30 60 90 120 150 124.16 123.67
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile E F 30 60 90 120 150 112.55 112.62
NCNN Target: CPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet E F 3 6 9 12 15 12.67 12.59 MIN: 12.55 / MAX: 13.14 MIN: 12.44 / MAX: 14.79 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer E F 10 20 30 40 50 45.00 45.62 MIN: 44.46 / MAX: 52.79 MIN: 45.16 / MAX: 54.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m E F 6 12 18 24 30 24.53 24.30 MIN: 24.4 / MAX: 26.12 MIN: 23.83 / MAX: 52.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd E F 4 8 12 16 20 16.55 16.41 MIN: 16.46 / MAX: 17.02 MIN: 16.32 / MAX: 18.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny E F 6 12 18 24 30 26.35 26.32 MIN: 26.14 / MAX: 26.85 MIN: 26.16 / MAX: 27.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 E F 4 8 12 16 20 17.90 18.01 MIN: 17.78 / MAX: 19.74 MIN: 17.88 / MAX: 18.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet E F 1.3298 2.6596 3.9894 5.3192 6.649 5.91 5.87 MIN: 5.74 / MAX: 13.78 MIN: 5.81 / MAX: 6.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 E F 3 6 9 12 15 9.07 9.12 MIN: 9 / MAX: 9.53 MIN: 9.04 / MAX: 9.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 E F 6 12 18 24 30 26.04 26.57 MIN: 25.76 / MAX: 26.78 MIN: 26.31 / MAX: 28.59 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet E F 5 10 15 20 25 20.67 20.55 MIN: 20.55 / MAX: 21.11 MIN: 20.36 / MAX: 28.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface E F 0.9293 1.8586 2.7879 3.7172 4.6465 4.13 4.03 MIN: 4.07 / MAX: 6.23 MIN: 3.96 / MAX: 4.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 E F 3 6 9 12 15 10.74 10.67 MIN: 10.65 / MAX: 11.26 MIN: 10.57 / MAX: 11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet E F 2 4 6 8 10 7.43 7.44 MIN: 7.36 / MAX: 7.87 MIN: 7.37 / MAX: 7.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 E F 3 6 9 12 15 10.01 9.92 MIN: 9.89 / MAX: 11.99 MIN: 9.83 / MAX: 10.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 E F 2 4 6 8 10 8.49 8.48 MIN: 8.38 / MAX: 8.91 MIN: 8.35 / MAX: 10.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 E F 2 4 6 8 10 8.31 8.27 MIN: 7.9 / MAX: 8.98 MIN: 7.89 / MAX: 9.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mobilenet E F 4 8 12 16 20 17.40 17.26 MIN: 17.29 / MAX: 19.5 MIN: 17.18 / MAX: 17.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only E F 20 40 60 80 100 101.62 101.45
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding E F 3K 6K 9K 12K 15K 15576.2 15633.4 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding E F 2K 4K 6K 8K 10K 11262.6 11330.0 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 0 E F 16 32 48 64 80 72.29 72.60 1. (CXX) g++ options: -O3 -fPIC -lm
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 E F 16 32 48 64 80 70.79 65.16 MAX: 1678.56 MAX: 1866.71
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 E F 15M 30M 45M 60M 75M 67181470.11 71000860.09
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 E F 120K 240K 360K 480K 600K 549644.5 549345.8 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 E F 4K 8K 12K 16K 20K 17152.6 17202.9 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only E F 11 22 33 44 55 49.68 49.79
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 E F 20 40 60 80 100 92.91 91.03 MAX: 3035.17 MAX: 2893.97
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 500 E F 11M 22M 33M 44M 55M 49539921.59 50107256.21
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only E F 9 18 27 36 45 39.98 40.01
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 E F 9 18 27 36 45 38.12 37.94 1. (CXX) g++ options: -O3 -fPIC -lm
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 E F 7 14 21 28 35 32.07 31.43 MAX: 1415.49 MAX: 1407.82
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500 E F 300K 600K 900K 1200K 1500K 1408314.28 1445916.13
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 E F 7 14 21 28 35 29.07 29.37 MAX: 1395.01 MAX: 1468.72
Apache IoTDB Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 200 E F 13M 26M 39M 52M 65M 60821535.47 60811785.77
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D E F 900 1800 2700 3600 4500 3555.93 4177.87 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Stress-NG Test: IO_uring OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: IO_uring E F 1.1M 2.2M 3.3M 4.4M 5.5M 5064110.48 5015105.12 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Futex OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Futex E F 600K 1200K 1800K 2400K 3000K 2852066.71 2717400.09 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig E F 7 14 21 28 35 31.94 32.07
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K E F 1.2033 2.4066 3.6099 4.8132 6.0165 5.285 5.348 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Stress-NG Test: Malloc OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Malloc E F 30M 60M 90M 120M 150M 159403243.38 158836969.43 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: MMAP OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MMAP E F 130 260 390 520 650 625.09 609.72 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Cloning OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Cloning E F 2K 4K 6K 8K 10K 6870.08 7924.08 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: MEMFD OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MEMFD E F 90 180 270 360 450 415.36 411.49 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Atomic OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Atomic E F 50 100 150 200 250 247.05 246.37 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: CPU Cache OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Cache E F 200K 400K 600K 800K 1000K 1117509.74 1117723.88 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Zlib OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Zlib E F 800 1600 2400 3200 4000 3889.71 3893.79 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Pthread OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pthread E F 20K 40K 60K 80K 100K 104885.48 104500.93 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: AVX-512 VNNI OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVX-512 VNNI E F 900K 1800K 2700K 3600K 4500K 4093266.57 3473026.51 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Vector Floating Point OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Floating Point E F 20K 40K 60K 80K 100K 101972.69 101749.29 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: AVL Tree OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVL Tree E F 150 300 450 600 750 693.33 694.87 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: x86_64 RdRand OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: x86_64 RdRand E F 3M 6M 9M 12M 15M 11942773.88 11878217.33 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Vector Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Math E F 50K 100K 150K 200K 250K 214246.51 214233.97 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Matrix Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix Math E F 40K 80K 120K 160K 200K 166863.57 166859.04 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Forking OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Forking E F 6K 12K 18K 24K 30K 26368.36 26317.30 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Crypto OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Crypto E F 20K 40K 60K 80K 100K 79338.51 79616.41 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Hash OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Hash E F 1.5M 3M 4.5M 6M 7.5M 7179982.15 7192739.29 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Function Call OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Function Call E F 5K 10K 15K 20K 25K 25044.49 25183.91 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: NUMA OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: NUMA E F 140 280 420 560 700 647.21 645.72 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Matrix 3D Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix 3D Math E F 6K 12K 18K 24K 30K 28914.23 28742.76 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Floating Point OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Floating Point E F 2K 4K 6K 8K 10K 11457.56 11511.71 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Vector Shuffle OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Shuffle E F 5K 10K 15K 20K 25K 24688.71 24689.76 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Memory Copying OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Memory Copying E F 3K 6K 9K 12K 15K 12746.29 12744.05 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Mutex OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mutex E F 5M 10M 15M 20M 25M 23255699.79 22935233.04 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: System V Message Passing OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: System V Message Passing E F 2M 4M 6M 8M 10M 7967662.12 7980751.91 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: CPU Stress OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Stress E F 16K 32K 48K 64K 80K 76099.17 76433.87 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Glibc Qsort Data Sorting OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc Qsort Data Sorting E F 200 400 600 800 1000 818.41 818.44 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Glibc C String Functions OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc C String Functions E F 7M 14M 21M 28M 35M 31352208.83 31613431.85 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Fused Multiply-Add OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Fused Multiply-Add E F 6M 12M 18M 24M 30M 30155491.73 30114608.52 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Context Switching OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Context Switching E F 3M 6M 9M 12M 15M 14202532.49 14196798.48 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Wide Vector Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Wide Vector Math E F 300K 600K 900K 1200K 1500K 1415448.85 1409726.96 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Socket Activity OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Socket Activity E F 7K 14K 21K 28K 35K 31093.74 31108.14 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Mixed Scheduler OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mixed Scheduler E F 7K 14K 21K 28K 35K 33179.00 33150.73 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Semaphores OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Semaphores E F 20M 40M 60M 80M 100M 81406001.02 78775153.73 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: SENDFILE OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: SENDFILE E F 140K 280K 420K 560K 700K 649786.78 675353.65 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Poll OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Poll E F 900K 1800K 2700K 3600K 4500K 4343727.88 4271308.93 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Stress-NG Test: Pipe OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pipe E F 4M 8M 12M 16M 20M 18207317.96 19946044.78 1. (CXX) g++ options: -O2 -std=gnu99 -lc
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating E F 50K 100K 150K 200K 250K 248227 246009 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating E F 60K 120K 180K 240K 300K 301517 302670 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Kvazaar Video Input: Bosphorus 4K - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Slow E F 5 10 15 20 25 22.70 22.93 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium E F 6 12 18 24 30 23.49 23.49 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 E F 20 40 60 80 100 78.92 76.47 MAX: 1478.84 MAX: 1695.4
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500 E F 12M 24M 36M 48M 60M 52556675.96 54498343.71
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 E F 4 8 12 16 20 13.94 13.70 MAX: 1249.85 MAX: 1226.66
Apache IoTDB Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200 E F 300K 600K 900K 1200K 1500K 1178643.97 1193616.95
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 E F 8 16 24 32 40 33.33 36.74 MAX: 1296.87 MAX: 1226.4
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500 E F 300K 600K 900K 1200K 1500K 1244290.54 1144274.29
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 E F 8 16 24 32 40 35.05 32.66 MAX: 1388.2 MAX: 1288.46
Apache IoTDB Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 100 - Sensor Count: 200 E F 11M 22M 33M 44M 55M 47100474.32 50016758.44
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C E F 30K 60K 90K 120K 150K 130842.77 130890.96 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 E F 9 18 27 36 45 37.95 36.79 MAX: 1192.97 MAX: 1268.75
Apache IoTDB Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200 E F 8M 16M 24M 32M 40M 38111412.39 39129510.65
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 E F 9 18 27 36 45 35.56 38.71 MAX: 1236.32 MAX: 1166.9
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500 E F 200K 400K 600K 800K 1000K 1016068.17 955839.49
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 E F 4 8 12 16 20 15.3 16.3 MAX: 1053.48 MAX: 1045.81
Apache IoTDB Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200 E F 200K 400K 600K 800K 1000K 902188.66 862346.98
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile E F 4 8 12 16 20 17.53 17.53
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 E F 5 10 15 20 25 20.42 19.58 MAX: 1084.24 MAX: 1044.13
Apache IoTDB Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.1.2 Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200 E F 130K 260K 390K 520K 650K 588587.22 603375.23
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C E F 20K 40K 60K 80K 100K 105770.90 105402.56 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding E F 7K 14K 21K 28K 35K 30870.1 30870.1 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding E F 5K 10K 15K 20K 25K 23854.1 23854.1 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction E F 3 6 9 12 15 12.37 12.26 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D E F 900 1800 2700 3600 4500 4202.57 4255.49 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 4 - Input: Bosphorus 1080p E F 4 8 12 16 20 13.54 13.67 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP E F 200 400 600 800 1000 1149.43 1149.43 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast E F 11 22 33 44 55 48.90 48.77 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C E F 40K 80K 120K 160K 200K 195117.13 188290.63 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
libxsmm M N K: 64 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 E F 300 600 900 1200 1500 1246.6 1244.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
Kvazaar Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast E F 14 28 42 56 70 61.87 61.71 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small E F 12K 24K 36K 48K 60K 57367.1 57438.2 1. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K E F 20 40 60 80 100 68.74 86.17 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Slow OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Slow E F 16 32 48 64 80 69.99 70.00 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast E F 16 32 48 64 80 72.57 72.53 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Medium E F 16 32 48 64 80 73.29 72.72 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 E F 200 400 600 800 1000 981.1 967.1 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding E F 7K 14K 21K 28K 35K 33282 33282 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding E F 5K 10K 15K 20K 25K 22188.0 20481.2 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless E F 1.2713 2.5426 3.8139 5.0852 6.3565 5.650 5.611 1. (CXX) g++ options: -O3 -fPIC -lm
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C E F 20K 40K 60K 80K 100K 91806.30 91355.37 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 1080p E F 30 60 90 120 150 141.11 139.80 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 4K E F 40 80 120 160 200 161.79 162.46 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 4K E F 40 80 120 160 200 161.63 163.27 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction E F 0.8601 1.7202 2.5803 3.4404 4.3005 3.76278901 3.82253599 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 10, Lossless E F 0.9191 1.8382 2.7573 3.6764 4.5955 4.082 4.085 1. (CXX) g++ options: -O3 -fPIC -lm
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Very Fast E F 40 80 120 160 200 162.16 161.28 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C E F 10K 20K 30K 40K 50K 47029.42 49017.63 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B E F 30K 60K 90K 120K 150K 119825.49 119401.29 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 E F 0.6451 1.2902 1.9353 2.5804 3.2255 2.867 2.857 1. (CXX) g++ options: -O3 -fPIC -lm
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Super Fast E F 50 100 150 200 250 224.67 223.65 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C E F 900 1800 2700 3600 4500 4107.38 4134.38 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast E F 60 120 180 240 300 263.45 266.37 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C E F 30K 60K 90K 120K 150K 121134.91 114790.76 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 1080p E F 110 220 330 440 550 515.85 505.01 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 1080p E F 130 260 390 520 650 615.96 605.26 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Phoronix Test Suite v10.8.5