3960x april AMD Ryzen Threadripper 3960X 24-Core testing with a MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) and Gigabyte AMD Radeon RX 5500 XT 8GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2204252-NE-3960XAPRI99&grs .
3960x april Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution A B C AMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads) MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) AMD Starship/Matisse 32GB 1000GB Sabrent Rocket 4.0 1TB Gigabyte AMD Radeon RX 5500 XT 8GB (1900/875MHz) AMD Navi 10 HDMI Audio DELL S2409W Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.12.0-051200rc2daily20210307-generic (x86_64) 20210306 GNOME Shell 3.36.4 X Server 1.20.8 4.6 Mesa 20.0.8 (LLVM 10.0.0) 1.2.128 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301025 Graphics Details - BAR1 / Visible vRAM Size: 256 MB Java Details - OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.04) Python Details - Python 3.8.10 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3960x april build-gem5: Time To Compile onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: IP Shapes 3D - f32 - CPU svt-av1: Preset 10 - Bosphorus 4K memtier-benchmark: Redis aom-av1: Speed 6 Two-Pass - Bosphorus 1080p onednn: Deconvolution Batch shapes_1d - f32 - CPU aom-av1: Speed 10 Realtime - Bosphorus 1080p compress-pbzip2: FreeBSD-13.0-RELEASE-amd64-memstick.img Compression onednn: IP Shapes 3D - u8s8f32 - CPU svt-av1: Preset 10 - Bosphorus 1080p aom-av1: Speed 9 Realtime - Bosphorus 4K rocksdb: Update Rand aom-av1: Speed 8 Realtime - Bosphorus 1080p aom-av1: Speed 9 Realtime - Bosphorus 1080p avifenc: 10, Lossless svt-av1: Preset 4 - Bosphorus 4K onednn: Convolution Batch Shapes Auto - f32 - CPU avifenc: 6, Lossless rocksdb: Rand Read build-mplayer: Time To Compile svt-av1: Preset 12 - Bosphorus 1080p aom-av1: Speed 6 Realtime - Bosphorus 4K onednn: Recurrent Neural Network Training - u8s8f32 - CPU aom-av1: Speed 8 Realtime - Bosphorus 4K onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU avifenc: 6 svt-av1: Preset 12 - Bosphorus 4K onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU ospray: particle_volume/pathtracer/real_time onednn: Recurrent Neural Network Training - f32 - CPU rocksdb: Read Rand Write Rand onednn: IP Shapes 1D - f32 - CPU aom-av1: Speed 6 Two-Pass - Bosphorus 4K onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU aom-av1: Speed 4 Two-Pass - Bosphorus 4K qe: AUSURF112 ospray: gravity_spheres_volume/dim_512/pathtracer/real_time aom-av1: Speed 6 Realtime - Bosphorus 1080p onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU svt-av1: Preset 8 - Bosphorus 1080p aom-av1: Speed 4 Two-Pass - Bosphorus 1080p avifenc: 2 rocksdb: Read While Writing aom-av1: Speed 10 Realtime - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K ospray-studio: 3 - 1080p - 16 - Path Tracer ospray-studio: 1 - 1080p - 1 - Path Tracer onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU avifenc: 0 ospray-studio: 2 - 1080p - 32 - Path Tracer ospray-studio: 3 - 1080p - 32 - Path Tracer onednn: Deconvolution Batch shapes_3d - f32 - CPU ospray-studio: 1 - 1080p - 32 - Path Tracer svt-av1: Preset 4 - Bosphorus 1080p gravitymark: 1920 x 1080 - Vulkan ospray: particle_volume/ao/real_time onednn: IP Shapes 1D - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU gpaw: Carbon Nanotube ospray-studio: 1 - 1080p - 16 - Path Tracer ospray-studio: 2 - 1080p - 16 - Path Tracer ospray-studio: 3 - 1080p - 1 - Path Tracer ospray: gravity_spheres_volume/dim_512/ao/real_time ospray: gravity_spheres_volume/dim_512/scivis/real_time ospray: particle_volume/scivis/real_time java-jmh: Throughput ospray-studio: 2 - 1080p - 1 - Path Tracer aom-av1: Speed 0 Two-Pass - Bosphorus 1080p aom-av1: Speed 0 Two-Pass - Bosphorus 4K gravitymark: 1920 x 1080 - OpenGL A B C 1847.91 2017.87 8.32265 126.428 1989777.97 24.95 4.17907 122.23 3.893 0.545553 303.974 48.74 715412 89.98 116.11 5.099 3.529 9.0733 8.121 119382548 15.771 493.23 13.19 2383 36.97 1.02861 1115.34 4.804 171.74 2385.2 266.079 2383.44 2730517 1.35631 11.28 2.01303 1.03167 6.45 380.61 5.97152 8.35 9.64774 162.372 10.62 47.091 4435348 51.17 58.196 23664 1226 1.35889 91.735 40446 47337 2.59369 39450 7.288 68.9 35.1453 1.11914 1117.04 152.97 19726 20220 1481 3.82926 3.67018 33.6432 48178706329.367 1269 0.47 0.21 242.295 1119.18 5.81433 110.327 1700127.57 22.71 3.8514 115.49 4.086 0.572034 307.79 47.48 728052 91.93 117.54 5.189 3.487 8.98054 7.991 118455589 15.529 500.464 13.29 2361.24 37.22 1.01652 1128.21 4.769 169.901 2410.82 263.474 2385.63 2756727 1.3443 11.38 2.01579 1.03258 6.5 378.34 5.97841 8.41 9.71357 163.478 10.67 46.786 4462048 51.13 58.411 23581 1230 1.36536 91.531 40503 47194 2.60225 39333 7.269 68.7 35.1218 1.1222 1119.2 152.695 19736 20236 1481 3.82889 3.67119 33.6396 48170212073.131 1269 0.47 0.21 234.828 1117.66 5.81028 100.938 1684834.36 23.85 3.91821 118.4 4.12 0.551255 298.821 48.25 732921 91.84 115.41 5.139 3.469 9.12778 8.08 120330555 15.596 493.841 13.36 2391.65 37.44 1.01572 1120.45 4.823 171.027 2402.17 263.343 2407.12 2734949 1.34548 11.34 2.02902 1.03985 6.48 377.75 6.01581 8.38 9.64625 163.35 10.6 46.898 4440467 50.88 58.088 23707 1232 1.36052 91.964 40629 47382 2.59274 39471 7.291 68.9 35.2229 1.12144 1119.45 152.901 19761 20232 1482 3.82705 3.66954 33.6461 48173519644.261 1269 0.47 0.21 OpenBenchmarking.org
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 21.2 Time To Compile A B C 400 800 1200 1600 2000 1847.91 242.30 234.83
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU A B C 400 800 1200 1600 2000 2017.87 1119.18 1117.66 MIN: 1119.24 MIN: 1108.33 MIN: 1106.34 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU A B C 2 4 6 8 10 8.32265 5.81433 5.81028 MIN: 5.8 MIN: 5.77 MIN: 5.77 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
SVT-AV1 Encoder Mode: Preset 10 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 10 - Input: Bosphorus 4K A B C 30 60 90 120 150 126.43 110.33 100.94 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Memtier_benchmark Protocol: Redis OpenBenchmarking.org Ops/sec, More Is Better Memtier_benchmark 1.3 Protocol: Redis A B C 400K 800K 1200K 1600K 2000K 1989777.97 1700127.57 1684834.36 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p A B C 6 12 18 24 30 24.95 22.71 23.85 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU A B C 0.9403 1.8806 2.8209 3.7612 4.7015 4.17907 3.85140 3.91821 MIN: 3.42 MIN: 3.38 MIN: 3.46 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p A B C 30 60 90 120 150 122.23 115.49 118.40 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Parallel BZIP2 Compression FreeBSD-13.0-RELEASE-amd64-memstick.img Compression OpenBenchmarking.org Seconds, Fewer Is Better Parallel BZIP2 Compression 1.1.13 FreeBSD-13.0-RELEASE-amd64-memstick.img Compression A B C 0.927 1.854 2.781 3.708 4.635 3.893 4.086 4.120 1. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU A B C 0.1287 0.2574 0.3861 0.5148 0.6435 0.545553 0.572034 0.551255 MIN: 0.51 MIN: 0.54 MIN: 0.51 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
SVT-AV1 Encoder Mode: Preset 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 10 - Input: Bosphorus 1080p A B C 70 140 210 280 350 303.97 307.79 298.82 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K A B C 11 22 33 44 55 48.74 47.48 48.25 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Facebook RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Update Random A B C 160K 320K 480K 640K 800K 715412 728052 732921 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p A B C 20 40 60 80 100 89.98 91.93 91.84 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p A B C 30 60 90 120 150 116.11 117.54 115.41 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 10, Lossless A B C 1.1675 2.335 3.5025 4.67 5.8375 5.099 5.189 5.139 1. (CXX) g++ options: -O3 -fPIC -lm
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 4 - Input: Bosphorus 4K A B C 0.794 1.588 2.382 3.176 3.97 3.529 3.487 3.469 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU A B C 3 6 9 12 15 9.07330 8.98054 9.12778 MIN: 8.97 MIN: 8.86 MIN: 9 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 6, Lossless A B C 2 4 6 8 10 8.121 7.991 8.080 1. (CXX) g++ options: -O3 -fPIC -lm
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Random Read A B C 30M 60M 90M 120M 150M 119382548 118455589 120330555 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Timed MPlayer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed MPlayer Compilation 1.5 Time To Compile A B C 4 8 12 16 20 15.77 15.53 15.60
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 12 - Input: Bosphorus 1080p A B C 110 220 330 440 550 493.23 500.46 493.84 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K A B C 3 6 9 12 15 13.19 13.29 13.36 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU A B C 500 1000 1500 2000 2500 2383.00 2361.24 2391.65 MIN: 2377.95 MIN: 2357.25 MIN: 2386.39 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K A B C 9 18 27 36 45 36.97 37.22 37.44 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU A B C 0.2314 0.4628 0.6942 0.9256 1.157 1.02861 1.01652 1.01572 MIN: 0.89 MIN: 0.9 MIN: 0.89 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU A B C 200 400 600 800 1000 1115.34 1128.21 1120.45 MIN: 1104.79 MIN: 1118.72 MIN: 1109.68 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 6 A B C 1.0852 2.1704 3.2556 4.3408 5.426 4.804 4.769 4.823 1. (CXX) g++ options: -O3 -fPIC -lm
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 12 - Input: Bosphorus 4K A B C 40 80 120 160 200 171.74 169.90 171.03 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU A B C 500 1000 1500 2000 2500 2385.20 2410.82 2402.17 MIN: 2381.46 MIN: 2405.51 MIN: 2395.89 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OSPray Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/pathtracer/real_time A B C 60 120 180 240 300 266.08 263.47 263.34
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU A B C 500 1000 1500 2000 2500 2383.44 2385.63 2407.12 MIN: 2378.89 MIN: 2377.77 MIN: 2403.55 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Facebook RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Read Random Write Random A B C 600K 1200K 1800K 2400K 3000K 2730517 2756727 2734949 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU A B C 0.3052 0.6104 0.9156 1.2208 1.526 1.35631 1.34430 1.34548 MIN: 1.31 MIN: 1.3 MIN: 1.31 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K A B C 3 6 9 12 15 11.28 11.38 11.34 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU A B C 0.4565 0.913 1.3695 1.826 2.2825 2.01303 2.01579 2.02902 MIN: 1.94 MIN: 1.94 MIN: 1.96 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU A B C 0.234 0.468 0.702 0.936 1.17 1.03167 1.03258 1.03985 MIN: 0.9 MIN: 0.9 MIN: 0.9 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K A B C 2 4 6 8 10 6.45 6.50 6.48 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 7.0 Input: AUSURF112 A B C 80 160 240 320 400 380.61 378.34 377.75 1. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi
OSPray Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time A B C 2 4 6 8 10 5.97152 5.97841 6.01581
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p A B C 2 4 6 8 10 8.35 8.41 8.38 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU A B C 3 6 9 12 15 9.64774 9.71357 9.64625 MIN: 9.51 MIN: 9.48 MIN: 9.43 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 8 - Input: Bosphorus 1080p A B C 40 80 120 160 200 162.37 163.48 163.35 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p A B C 3 6 9 12 15 10.62 10.67 10.60 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 2 A B C 11 22 33 44 55 47.09 46.79 46.90 1. (CXX) g++ options: -O3 -fPIC -lm
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Read While Writing A B C 1000K 2000K 3000K 4000K 5000K 4435348 4462048 4440467 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
AOM AV1 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K A B C 12 24 36 48 60 51.17 51.13 50.88 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 8 - Input: Bosphorus 4K A B C 13 26 39 52 65 58.20 58.41 58.09 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer A B C 5K 10K 15K 20K 25K 23664 23581 23707 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer A B C 300 600 900 1200 1500 1226 1230 1232 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU A B C 0.3072 0.6144 0.9216 1.2288 1.536 1.35889 1.36536 1.36052 MIN: 1.31 MIN: 1.32 MIN: 1.31 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 0 A B C 20 40 60 80 100 91.74 91.53 91.96 1. (CXX) g++ options: -O3 -fPIC -lm
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer A B C 9K 18K 27K 36K 45K 40446 40503 40629 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer A B C 10K 20K 30K 40K 50K 47337 47194 47382 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU A B C 0.5855 1.171 1.7565 2.342 2.9275 2.59369 2.60225 2.59274 MIN: 2.52 MIN: 2.54 MIN: 2.55 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer A B C 8K 16K 24K 32K 40K 39450 39333 39471 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 4 - Input: Bosphorus 1080p A B C 2 4 6 8 10 7.288 7.269 7.291 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
GravityMark Resolution: 1920 x 1080 - Renderer: Vulkan OpenBenchmarking.org Frames Per Second, More Is Better GravityMark 1.53 Resolution: 1920 x 1080 - Renderer: Vulkan A B C 15 30 45 60 75 68.9 68.7 68.9
OSPray Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/ao/real_time A B C 8 16 24 32 40 35.15 35.12 35.22
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU A B C 0.2525 0.505 0.7575 1.01 1.2625 1.11914 1.12220 1.12144 MIN: 1.09 MIN: 1.1 MIN: 1.1 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU A B C 200 400 600 800 1000 1117.04 1119.20 1119.45 MIN: 1107.37 MIN: 1108.94 MIN: 1109.94 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
GPAW Input: Carbon Nanotube OpenBenchmarking.org Seconds, Fewer Is Better GPAW 22.1 Input: Carbon Nanotube A B C 30 60 90 120 150 152.97 152.70 152.90 1. (CC) gcc options: -pthread -shared -fwrapv -O2 -lxc -lblas -lmpi
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer A B C 4K 8K 12K 16K 20K 19726 19736 19761 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer A B C 4K 8K 12K 16K 20K 20220 20236 20232 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer A B C 300 600 900 1200 1500 1481 1481 1482 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/ao/real_time A B C 0.8616 1.7232 2.5848 3.4464 4.308 3.82926 3.82889 3.82705
OSPray Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time A B C 0.826 1.652 2.478 3.304 4.13 3.67018 3.67119 3.66954
OSPray Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/scivis/real_time A B C 8 16 24 32 40 33.64 33.64 33.65
Java JMH Throughput OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput A B C 10000M 20000M 30000M 40000M 50000M 48178706329.37 48170212073.13 48173519644.26
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer A B C 300 600 900 1200 1500 1269 1269 1269 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p A B C 0.1058 0.2116 0.3174 0.4232 0.529 0.47 0.47 0.47 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K A B C 0.0473 0.0946 0.1419 0.1892 0.2365 0.21 0.21 0.21 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Phoronix Test Suite v10.8.4