3700x march AMD Ryzen 7 3700X 8-Core testing with a Gigabyte A320M-S2H-CF (F52a BIOS) and HIS AMD Radeon HD 7750/8740 / R7 250E 1GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2204019-NE-3700XMARC38&grs&sor .
3700x march Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution A B C AMD Ryzen 7 3700X 8-Core @ 3.60GHz (8 Cores / 16 Threads) Gigabyte A320M-S2H-CF (F52a BIOS) AMD Starship/Matisse 8GB 240GB TOSHIBA RC100 HIS AMD Radeon HD 7750/8740 / R7 250E 1GB AMD Oland/Hainan/Cape VA2431 Realtek RTL8111/8168/8411 Ubuntu 20.04 5.11.0-38-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.9 4.5 Mesa 20.0.8 (LLVM 10.0.0) GCC 9.4.0 ext4 1920x1080 4.5 Mesa 21.2.6 (LLVM 12.0.0) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Java Details - OpenJDK Runtime Environment (build 11.0.14.1+1-Ubuntu-0ubuntu1.20.04) Python Details - Python 3.8.10 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3700x march onednn: IP Shapes 1D - f32 - CPU perf-bench: Epoll Wait rodinia: OpenMP Streamcluster aom-av1: Speed 10 Realtime - Bosphorus 1080p toktx: Zstd Compression 9 astcenc: Medium perf-bench: Futex Hash onednn: IP Shapes 3D - f32 - CPU rodinia: OpenMP CFD Solver aom-av1: Speed 9 Realtime - Bosphorus 4K aom-av1: Speed 10 Realtime - Bosphorus 4K onednn: IP Shapes 3D - u8s8f32 - CPU build-gem5: Time To Compile perf-bench: Memset 1MB svt-av1: Preset 12 - Bosphorus 4K rocksdb: Rand Read onednn: Deconvolution Batch shapes_1d - f32 - CPU srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM rodinia: OpenMP HotSpot3D aom-av1: Speed 9 Realtime - Bosphorus 1080p perf-bench: Memcpy 1MB onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU perf-bench: Syscall Basic draco: Church Facade toktx: UASTC 3 onednn: Recurrent Neural Network Inference - u8s8f32 - CPU svt-av1: Preset 12 - Bosphorus 1080p toktx: Zstd Compression 19 svt-av1: Preset 10 - Bosphorus 4K etcpak: DXT1 onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU openjpeg: NASA Curiosity Panorama M34 rocksdb: Read While Writing aom-av1: Speed 8 Realtime - Bosphorus 1080p svt-av1: Preset 10 - Bosphorus 1080p etcpak: ETC1 etcpak: ETC1 + Dithering aom-av1: Speed 8 Realtime - Bosphorus 4K perf-bench: Sched Pipe svt-av1: Preset 4 - Bosphorus 4K aom-av1: Speed 6 Realtime - Bosphorus 1080p svt-av1: Preset 4 - Bosphorus 1080p srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM svt-av1: Preset 8 - Bosphorus 4K srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM rocksdb: Read Rand Write Rand onednn: Recurrent Neural Network Training - f32 - CPU libgav1: Chimera 1080p onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU kvazaar: Bosphorus 1080p - Ultra Fast libgav1: Summer Nature 1080p aom-av1: Speed 6 Two-Pass - Bosphorus 1080p build-linux-kernel: defconfig ospray: particle_volume/pathtracer/real_time ospray: gravity_spheres_volume/dim_512/ao/real_time rocksdb: Update Rand libgav1: Chimera 1080p 10-bit draco: Lion kvazaar: Bosphorus 1080p - Very Fast libgav1: Summer Nature 4K srsran: OFDM_Test svt-av1: Preset 8 - Bosphorus 1080p onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU aom-av1: Speed 6 Realtime - Bosphorus 4K onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM rodinia: OpenMP LavaMD onednn: Convolution Batch Shapes Auto - f32 - CPU kvazaar: Bosphorus 1080p - Medium gcrypt: ospray: particle_volume/scivis/real_time srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM ospray-studio: 1 - 1080p - 16 - Path Tracer rodinia: OpenMP Leukocyte perf-bench: Futex Lock-Pi onednn: Deconvolution Batch shapes_3d - f32 - CPU srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM onednn: IP Shapes 1D - u8s8f32 - CPU ospray-studio: 1 - 1080p - 32 - Path Tracer onednn: Recurrent Neural Network Training - u8s8f32 - CPU srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU toktx: UASTC 3 + Zstd Compression 19 ospray: particle_volume/ao/real_time kvazaar: Bosphorus 4K - Very Fast ospray-studio: 3 - 1080p - 1 - Path Tracer ospray: gravity_spheres_volume/dim_512/scivis/real_time ospray-studio: 3 - 1080p - 16 - Path Tracer java-jmh: Throughput aom-av1: Speed 6 Two-Pass - Bosphorus 4K kvazaar: Bosphorus 4K - Ultra Fast ospray-studio: 2 - 1080p - 16 - Path Tracer ospray: gravity_spheres_volume/dim_512/pathtracer/real_time tachyon: Total Time srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM ospray-studio: 1 - 1080p - 1 - Path Tracer ospray-studio: 2 - 1080p - 1 - Path Tracer kvazaar: Bosphorus 4K - Medium toktx: UASTC 4 + Zstd Compression 19 ospray-studio: 3 - 1080p - 32 - Path Tracer etcpak: ETC2 astcenc: Thorough astcenc: Exhaustive ospray-studio: 2 - 1080p - 32 - Path Tracer build-nodejs: A B C 6.67346 70515 33.878 122.26 2.545 5.0545 4824413 10.248 22.75 51.95 54.8 2.52331 527.795 66.394612 48.365 40413130 6.30423 146.4 153.6 88.953 127.73 13.592719 2780.14 21.3262 20651833 7531 12.56 2768.09 374.172 19.569 42.919 1329.935 1.27343 2793.39 62855 1685693 102.81 209.889 305.018 288.656 38.8 344927 2.211 8.38 7.203 386.6 26.163 167.1 1451343 3960.3 169.41 4.95672 126.81 223.9 23.98 96.139 166.469 1.30538 438705 57.63 4957 70.07 65.55 125000000 94.87 4000.15 13.57 4.61914 105 240.61 23.0464 37.71 203.974 11.209 58.9 65883 124.859 1022 6.73567 387.1 2.50697 125344 3978.05 358.9 158.1 3.63064 18.338 11.8457 16.92 4472 1.25609 77460 15971827325.799 8.29 29.02 67031 2.02104 164.5427 354.4 3721 3815 7.52 340.447 148288 179.383 9.1383 83.4929 128042 5.38239 76990 33.731 132.23 2.582 5.401 5141654 10.4932 23.991 54.5 55.72 2.53097 511.637 69.370794 47.78 38836394 6.55418 146 159.4 87.572 123.79 13.921405 2785.84 21.4534 20221808 7609 12.78 2807.28 370.421 19.719 43.585 1350.363 1.28282 2829.23 62242 1694715 103.63 208.721 306.517 285.141 39.2 349100 2.233 8.47 7.28 383.6 26.309 167.8 1456612 3969.06 168.9 4.96916 127.71 224.06 24.15 95.675 167.46 1.30382 441448 57.95 4980 69.61 65.98 124200000 95.243 3989.49 13.49 4.63708 105.6 239.251 23.1473 37.92 204.773 11.2413 59.2 65777 124.749 1027 6.75419 385.7 2.51867 125761 3995.75 359.5 157.6 3.64345 18.396 11.8107 16.97 4461 1.25248 77272 16016773575.675 8.31 28.97 67130 2.01811 164.4666 354.3 3715 3810 7.53 340.135 148470 179.572 9.1454 83.4223 128064 5.38075 78006 31.155 127.1 2.746 5.1039 5121390 10.8503 22.699 54.86 52.82 2.65863 504.2 66.398269 49.797 40469857 6.35941 141 159 90.817 124.53 13.589469 2842.52 21.7906 20599176 7677 12.765 2814.1 368.321 19.411 43.356 1335.559 1.26382 2834.58 61979 1709393 102.23 211.49 302.705 286.024 39.27 345671 2.235 8.47 7.21 387.7 26.438 168.8 1442100 3997.84 167.87 5.00068 126.62 222.17 24.18 96.405 167.683 1.3133 441787 57.56 4947 69.96 65.69 124600000 95.449 3976.52 13.53 4.61019 105.4 239.986 23.1748 37.78 205.106 11.2682 59.1 65552 124.244 1026 6.76793 385.3 2.51488 125186 3977.63 357.9 158.2 3.64286 18.333 11.8137 16.97 4459 1.25364 77238 15985465912.414 8.29 29.03 66999 2.02204 164.2377 354.9 3718 3809 7.52 340.007 148448 179.367 9.1352 83.402 128171 OpenBenchmarking.org
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU C B A 2 4 6 8 10 5.38075 5.38239 6.67346 MIN: 5.25 MIN: 5.25 MIN: 5.27 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
perf-bench Benchmark: Epoll Wait OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Epoll Wait C B A 20K 40K 60K 80K 100K 78006 76990 70515 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster C B A 8 16 24 32 40 31.16 33.73 33.88 1. (CXX) g++ options: -O2 -lOpenCL
AOM AV1 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p B C A 30 60 90 120 150 132.23 127.10 122.26 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
KTX-Software toktx Settings: Zstd Compression 9 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 9 A B C 0.6179 1.2358 1.8537 2.4716 3.0895 2.545 2.582 2.746
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Medium A C B 1.2152 2.4304 3.6456 4.8608 6.076 5.0545 5.1039 5.4010 1. (CXX) g++ options: -O3 -flto -pthread
perf-bench Benchmark: Futex Hash OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Hash B C A 1.1M 2.2M 3.3M 4.4M 5.5M 5141654 5121390 4824413 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU A B C 3 6 9 12 15 10.25 10.49 10.85 MIN: 9.82 MIN: 10.24 MIN: 10.66 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver C A B 6 12 18 24 30 22.70 22.75 23.99 1. (CXX) g++ options: -O2 -lOpenCL
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K C B A 12 24 36 48 60 54.86 54.50 51.95 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K B A C 13 26 39 52 65 55.72 54.80 52.82 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU A B C 0.5982 1.1964 1.7946 2.3928 2.991 2.52331 2.53097 2.65863 MIN: 2.47 MIN: 2.48 MIN: 2.61 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 21.2 Time To Compile C B A 110 220 330 440 550 504.20 511.64 527.80
perf-bench Benchmark: Memset 1MB OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memset 1MB B C A 15 30 45 60 75 69.37 66.40 66.39 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 12 - Input: Bosphorus 4K C A B 11 22 33 44 55 49.80 48.37 47.78 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Random Read C A B 9M 18M 27M 36M 45M 40469857 40413130 38836394 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU A C B 2 4 6 8 10 6.30423 6.35941 6.55418 MIN: 5.99 MIN: 6.03 MIN: 6.02 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM A B C 30 60 90 120 150 146.4 146.0 141.0 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM B C A 40 80 120 160 200 159.4 159.0 153.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D B A C 20 40 60 80 100 87.57 88.95 90.82 1. (CXX) g++ options: -O2 -lOpenCL
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p A C B 30 60 90 120 150 127.73 124.53 123.79 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
perf-bench Benchmark: Memcpy 1MB OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memcpy 1MB B A C 4 8 12 16 20 13.92 13.59 13.59 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU A B C 600 1200 1800 2400 3000 2780.14 2785.84 2842.52 MIN: 2760.3 MIN: 2768.25 MIN: 2827.85 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU A B C 5 10 15 20 25 21.33 21.45 21.79 MIN: 21.17 MIN: 21.31 MIN: 21.31 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
perf-bench Benchmark: Syscall Basic OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Syscall Basic A C B 4M 8M 12M 16M 20M 20651833 20599176 20221808 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.0 Model: Church Facade A B C 1600 3200 4800 6400 8000 7531 7609 7677 1. (CXX) g++ options: -O3
KTX-Software toktx Settings: UASTC 3 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 A C B 3 6 9 12 15 12.56 12.77 12.78
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU A B C 600 1200 1800 2400 3000 2768.09 2807.28 2814.10 MIN: 2742.44 MIN: 2785.2 MIN: 2783.96 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 12 - Input: Bosphorus 1080p A B C 80 160 240 320 400 374.17 370.42 368.32 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
KTX-Software toktx Settings: Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 19 C A B 5 10 15 20 25 19.41 19.57 19.72
SVT-AV1 Encoder Mode: Preset 10 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 10 - Input: Bosphorus 4K B C A 10 20 30 40 50 43.59 43.36 42.92 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 B C A 300 600 900 1200 1500 1350.36 1335.56 1329.94 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU C A B 0.2886 0.5772 0.8658 1.1544 1.443 1.26382 1.27343 1.28282 MIN: 1.19 MIN: 1.19 MIN: 1.21 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU A B C 600 1200 1800 2400 3000 2793.39 2829.23 2834.58 MIN: 2780.33 MIN: 2801.68 MIN: 2818.46 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenJPEG Encode: NASA Curiosity Panorama M34 OpenBenchmarking.org ms, Fewer Is Better OpenJPEG 2.4 Encode: NASA Curiosity Panorama M34 C B A 13K 26K 39K 52K 65K 61979 62242 62855 1. (CXX) g++ options: -rdynamic
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Read While Writing C B A 400K 800K 1200K 1600K 2000K 1709393 1694715 1685693 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p B A C 20 40 60 80 100 103.63 102.81 102.23 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
SVT-AV1 Encoder Mode: Preset 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 10 - Input: Bosphorus 1080p C A B 50 100 150 200 250 211.49 209.89 208.72 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 B A C 70 140 210 280 350 306.52 305.02 302.71 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering A C B 60 120 180 240 300 288.66 286.02 285.14 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K C B A 9 18 27 36 45 39.27 39.20 38.80 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
perf-bench Benchmark: Sched Pipe OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Sched Pipe B C A 70K 140K 210K 280K 350K 349100 345671 344927 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 4 - Input: Bosphorus 4K C B A 0.5029 1.0058 1.5087 2.0116 2.5145 2.235 2.233 2.211 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p C B A 2 4 6 8 10 8.47 8.47 8.38 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 4 - Input: Bosphorus 1080p B C A 2 4 6 8 10 7.280 7.210 7.203 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM C A B 80 160 240 320 400 387.7 386.6 383.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 8 - Input: Bosphorus 4K C B A 6 12 18 24 30 26.44 26.31 26.16 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM C B A 40 80 120 160 200 168.8 167.8 167.1 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Facebook RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Read Random Write Random B A C 300K 600K 900K 1200K 1500K 1456612 1451343 1442100 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU A B C 900 1800 2700 3600 4500 3960.30 3969.06 3997.84 MIN: 3946.69 MIN: 3963.33 MIN: 3986.46 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.17 Video Input: Chimera 1080p A B C 40 80 120 160 200 169.41 168.90 167.87 1. (CXX) g++ options: -O3 -lpthread -lrt
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU A B C 1.1252 2.2504 3.3756 4.5008 5.626 4.95672 4.96916 5.00068 MIN: 4.84 MIN: 4.87 MIN: 4.92 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast B A C 30 60 90 120 150 127.71 126.81 126.62 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.17 Video Input: Summer Nature 1080p B A C 50 100 150 200 250 224.06 223.90 222.17 1. (CXX) g++ options: -O3 -lpthread -lrt
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p C B A 6 12 18 24 30 24.18 24.15 23.98 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.16 Build: defconfig B A C 20 40 60 80 100 95.68 96.14 96.41
OSPray Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/pathtracer/real_time C B A 40 80 120 160 200 167.68 167.46 166.47
OSPray Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/ao/real_time C A B 0.2955 0.591 0.8865 1.182 1.4775 1.31330 1.30538 1.30382
Facebook RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Update Random C B A 90K 180K 270K 360K 450K 441787 441448 438705 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 0.17 Video Input: Chimera 1080p 10-bit B A C 13 26 39 52 65 57.95 57.63 57.56 1. (CXX) g++ options: -O3 -lpthread -lrt
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.0 Model: Lion C A B 1100 2200 3300 4400 5500 4947 4957 4980 1. (CXX) g++ options: -O3
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 1080p - Video Preset: Very Fast A C B 16 32 48 64 80 70.07 69.96 69.61 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.17 Video Input: Summer Nature 4K B C A 15 30 45 60 75 65.98 65.69 65.55 1. (CXX) g++ options: -O3 -lpthread -lrt
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.10 Test: OFDM_Test A C B 30M 60M 90M 120M 150M 125000000 124600000 124200000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 8 - Input: Bosphorus 1080p C B A 20 40 60 80 100 95.45 95.24 94.87 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU C B A 900 1800 2700 3600 4500 3976.52 3989.49 4000.15 MIN: 3968.89 MIN: 3980 MIN: 3984.76 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K A C B 3 6 9 12 15 13.57 13.53 13.49 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU C A B 1.0433 2.0866 3.1299 4.1732 5.2165 4.61019 4.61914 4.63708 MIN: 4.45 MIN: 4.43 MIN: 4.48 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.10 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM B C A 20 40 60 80 100 105.6 105.4 105.0 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD B C A 50 100 150 200 250 239.25 239.99 240.61 1. (CXX) g++ options: -O2 -lOpenCL
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU A B C 6 12 18 24 30 23.05 23.15 23.17 MIN: 22.76 MIN: 22.87 MIN: 23 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 1080p - Video Preset: Medium B C A 9 18 27 36 45 37.92 37.78 37.71 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 A B C 50 100 150 200 250 203.97 204.77 205.11 1. (CC) gcc options: -O2 -fvisibility=hidden
OSPray Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/scivis/real_time C B A 3 6 9 12 15 11.27 11.24 11.21
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.10 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM B C A 13 26 39 52 65 59.2 59.1 58.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer C B A 14K 28K 42K 56K 70K 65552 65777 65883 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte C B A 30 60 90 120 150 124.24 124.75 124.86 1. (CXX) g++ options: -O2 -lOpenCL
perf-bench Benchmark: Futex Lock-Pi OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Lock-Pi B C A 200 400 600 800 1000 1027 1026 1022 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU A B C 2 4 6 8 10 6.73567 6.75419 6.76793 MIN: 6.64 MIN: 6.66 MIN: 6.63 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM A B C 80 160 240 320 400 387.1 385.7 385.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU A C B 0.5667 1.1334 1.7001 2.2668 2.8335 2.50697 2.51488 2.51867 MIN: 2.47 MIN: 2.48 MIN: 2.47 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer C A B 30K 60K 90K 120K 150K 125186 125344 125761 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU C A B 900 1800 2700 3600 4500 3977.63 3978.05 3995.75 MIN: 3967.61 MIN: 3963.46 MIN: 3983.37 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM B A C 80 160 240 320 400 359.5 358.9 357.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM C A B 30 60 90 120 150 158.2 158.1 157.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU A C B 0.8198 1.6396 2.4594 3.2792 4.099 3.63064 3.64286 3.64345 MIN: 3.47 MIN: 3.46 MIN: 3.48 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
KTX-Software toktx Settings: UASTC 3 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 + Zstd Compression 19 C A B 5 10 15 20 25 18.33 18.34 18.40
OSPray Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/ao/real_time A C B 3 6 9 12 15 11.85 11.81 11.81
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Very Fast C B A 4 8 12 16 20 16.97 16.97 16.92 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer C B A 1000 2000 3000 4000 5000 4459 4461 4472 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time A C B 0.2826 0.5652 0.8478 1.1304 1.413 1.25609 1.25364 1.25248
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer C B A 17K 34K 51K 68K 85K 77238 77272 77460 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
Java JMH Throughput OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput B C A 3000M 6000M 9000M 12000M 15000M 16016773575.68 15985465912.41 15971827325.80
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K B C A 2 4 6 8 10 8.31 8.29 8.29 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast C A B 7 14 21 28 35 29.03 29.02 28.97 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer C A B 14K 28K 42K 56K 70K 66999 67031 67130 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time C A B 0.455 0.91 1.365 1.82 2.275 2.02204 2.02104 2.01811
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99.2 Total Time C B A 40 80 120 160 200 164.24 164.47 164.54 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM C A B 80 160 240 320 400 354.9 354.4 354.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer B C A 800 1600 2400 3200 4000 3715 3718 3721 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer C B A 800 1600 2400 3200 4000 3809 3810 3815 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Medium B C A 2 4 6 8 10 7.53 7.52 7.52 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
KTX-Software toktx Settings: UASTC 4 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 4 + Zstd Compression 19 C B A 70 140 210 280 350 340.01 340.14 340.45
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer A C B 30K 60K 90K 120K 150K 148288 148448 148470 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 B A C 40 80 120 160 200 179.57 179.38 179.37 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Thorough C A B 3 6 9 12 15 9.1352 9.1383 9.1454 1. (CXX) g++ options: -O3 -flto -pthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Exhaustive C B A 20 40 60 80 100 83.40 83.42 83.49 1. (CXX) g++ options: -O3 -flto -pthread
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer A B C 30K 60K 90K 120K 150K 128042 128064 128171 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
Phoronix Test Suite v10.8.5