2 x Intel Xeon Platinum 8280 testing with a GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS) and llvmpipe on Ubuntu 21.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2206050-NE-CASCADELA50 cascade lake summer 2022 - Phoronix Test Suite cascade lake summer 2022 2 x Intel Xeon Platinum 8280 testing with a GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS) and llvmpipe on Ubuntu 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2206050-NE-CASCADELA50&grs&rdt .
cascade lake summer 2022 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution A B C 2 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads) GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS) Intel Sky Lake-E DMI3 Registers 384GB 280GB INTEL SSDPED1D280GA llvmpipe VE228 2 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbE Ubuntu 21.04 5.11.0-40-generic (x86_64) GNOME Shell 3.38.4 X Server + Wayland 4.5 Mesa 21.0.1 (LLVM 11.0.1 256 bits) GCC 10.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003102 Java Details - OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.21.04) Python Details - Python 3.9.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
cascade lake summer 2022 stress-ng: CPU Cache onnx: super-resolution-10 - CPU - Standard onnx: fcn-resnet101-11 - CPU - Standard apache: 20 onnx: bertsquad-12 - CPU - Standard perf-bench: Epoll Wait apache: 200 apache: 1000 renaissance: Scala Dotty aom-av1: Speed 8 Realtime - Bosphorus 1080p onnx: GPT-2 - CPU - Standard aom-av1: Speed 9 Realtime - Bosphorus 1080p onednn: Recurrent Neural Network Training - f32 - CPU aom-av1: Speed 6 Realtime - Bosphorus 4K stress-ng: Atomic renaissance: Savina Reactors.IO svt-vp9: VMAF Optimized - Bosphorus 1080p x264: Bosphorus 1080p aom-av1: Speed 6 Two-Pass - Bosphorus 4K svt-vp9: Visual Quality Optimized - Bosphorus 1080p svt-vp9: VMAF Optimized - Bosphorus 4K renaissance: In-Memory Database Shootout svt-vp9: PSNR/SSIM Optimized - Bosphorus 4K stress-ng: Futex avifenc: 6 avifenc: 2 renaissance: Apache Spark ALS svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p renaissance: Rand Forest perf-bench: Memcpy 1MB renaissance: Finagle HTTP Requests aom-av1: Speed 10 Realtime - Bosphorus 4K tensorflow-lite: Inception ResNet V2 svt-vp9: Visual Quality Optimized - Bosphorus 4K perf-bench: Sched Pipe aom-av1: Speed 6 Two-Pass - Bosphorus 1080p tensorflow-lite: Mobilenet Quant stress-ng: NUMA onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU perf-bench: Memset 1MB apache: 100 svt-av1: Preset 8 - Bosphorus 1080p aom-av1: Speed 9 Realtime - Bosphorus 4K aom-av1: Speed 4 Two-Pass - Bosphorus 1080p onednn: Recurrent Neural Network Inference - u8s8f32 - CPU perf-bench: Futex Lock-Pi renaissance: Genetic Algorithm Using Jenetics + Futures aom-av1: Speed 10 Realtime - Bosphorus 1080p renaissance: ALS Movie Lens apache: 500 svt-hevc: 7 - Bosphorus 4K svt-av1: Preset 12 - Bosphorus 4K apache: 1 onednn: IP Shapes 3D - u8s8f32 - CPU nginx: 500 renaissance: Akka Unbalanced Cobwebbed Tree onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU svt-hevc: 7 - Bosphorus 1080p svt-av1: Preset 10 - Bosphorus 4K tensorflow-lite: NASNet Mobile svt-av1: Preset 10 - Bosphorus 1080p onednn: IP Shapes 1D - u8s8f32 - CPU svt-av1: Preset 4 - Bosphorus 4K nettle: aes256 stress-ng: Socket Activity rocksdb: Read While Writing simdjson: Kostya nettle: chacha etcpak: Multi-Threaded - DXT1 svt-av1: Preset 12 - Bosphorus 1080p tensorflow-lite: SqueezeNet aom-av1: Speed 6 Realtime - Bosphorus 1080p stress-ng: Context Switching nettle: sha512 svt-av1: Preset 8 - Bosphorus 4K nginx: 20 avifenc: 6, Lossless stress-ng: Glibc C String Functions tensorflow-lite: Inception V4 onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU renaissance: Apache Spark PageRank renaissance: Apache Spark Bayes perf-bench: Syscall Basic rocksdb: Rand Read glibc-bench: log2 etcpak: Multi-Threaded - ETC2 onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU aom-av1: Speed 8 Realtime - Bosphorus 4K onednn: IP Shapes 1D - f32 - CPU etcpak: Single-Threaded - DXT1 rocksdb: Read Rand Write Rand svt-hevc: 10 - Bosphorus 4K onnx: yolov4 - CPU - Standard onednn: Recurrent Neural Network Training - u8s8f32 - CPU svt-av1: Preset 4 - Bosphorus 1080p java-jmh: Throughput stress-ng: MMAP nettle: poly1305-aes onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU nginx: 100 ospray: gravity_spheres_volume/dim_512/ao/real_time simdjson: LargeRand nginx: 200 aom-av1: Speed 4 Two-Pass - Bosphorus 4K glibc-bench: modf ospray: particle_volume/ao/real_time ospray: particle_volume/scivis/real_time svt-hevc: 10 - Bosphorus 1080p glibc-bench: cos glibc-bench: sin onednn: IP Shapes 3D - bf16bf16bf16 - CPU nginx: 1 glibc-bench: pthread_once tensorflow-lite: Mobilenet Float ospray: particle_volume/pathtracer/real_time x264: Bosphorus 4K simdjson: PartialTweets avifenc: 0 glibc-bench: sinh ospray: gravity_spheres_volume/dim_512/scivis/real_time onednn: Convolution Batch Shapes Auto - f32 - CPU glibc-bench: sqrt build-mplayer: Time To Compile glibc-bench: exp glibc-bench: ffs simdjson: DistinctUserID stress-ng: Vector Math onednn: IP Shapes 3D - f32 - CPU stress-ng: CPU Stress stress-ng: Forking onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onnx: ArcFace ResNet-100 - CPU - Standard ospray: gravity_spheres_volume/dim_512/pathtracer/real_time ospray-studio: 2 - 1080p - 16 - Path Tracer onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU stress-ng: System V Message Passing onednn: IP Shapes 1D - bf16bf16bf16 - CPU nginx: 1000 stress-ng: Memory Copying gromacs: MPI CPU - water_GMX50_bare stress-ng: Malloc onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU avifenc: 10, Lossless svt-hevc: 1 - Bosphorus 4K svt-hevc: 1 - Bosphorus 1080p simdjson: TopTweet onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU stress-ng: Semaphores etcpak: Single-Threaded - ETC2 glibc-bench: ffsll stress-ng: IO_uring ospray-studio: 3 - 1080p - 16 - Path Tracer perf-bench: Futex Hash ospray-studio: 3 - 1080p - 1 - Path Tracer ospray-studio: 1 - 1080p - 16 - Path Tracer onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU glibc-bench: atanh ospray-studio: 1 - 1080p - 32 - Path Tracer glibc-bench: tanh stress-ng: MEMFD ospray-studio: 3 - 1080p - 32 - Path Tracer glibc-bench: sincos onednn: Deconvolution Batch shapes_3d - f32 - CPU ospray-studio: 1 - 1080p - 1 - Path Tracer onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU rocksdb: Update Rand stress-ng: SENDFILE glibc-bench: asinh stress-ng: Glibc Qsort Data Sorting stress-ng: Matrix Math stress-ng: Crypto ospray-studio: 2 - 1080p - 32 - Path Tracer stress-ng: x86_64 RdRand ospray-studio: 2 - 1080p - 1 - Path Tracer aom-av1: Speed 0 Two-Pass - Bosphorus 1080p aom-av1: Speed 0 Two-Pass - Bosphorus 4K A B C 26.3 6359 376 24165.82 691 4782 126245.9 105438.13 1108.2 36.47 10161 54.95 869.415 5.73 148748.4 13994.2 275.3 116.81 5.57 208.91 111.31 11794.7 108.16 908797.5 6.249 55.606 84016.6 257.91 1166.1 16.830448 6808.3 24.91 47840.8 101.74 149542 8.9 4833.58 583.66 0.343941 55.720119 97283.8 94.091 21.45 6.04 458.421 62 8292.8 55.88 50872.9 122830.61 99.19 107.289 2809.72 1.10589 165114.5 21915.1 2.94845 222.88 89.538 65535 143.11 1.40425 2.718 6285.45 27081.43 6680014 2.5 1223.77 2466.422 206.24 5446.51 5.67 4008133.57 498.39 44.109 141460.52 10.316 5765765 37643.7 0.47808 455.061 4107.5 859.3 16452745 222126242 19.8486 2455.861 871.308 13.96 1.22929 191.921 2904862 133.39 529 880.417 5.472 90801514532.112 1686.14 3708.97 0.23244 152697.49 13.5826 0.86 159676.56 3.53 6.1103 108.23 106.42 339.56 68.2399 58.7263 2.08511 29191.38 4.739 3411.68 194.847 46.94 3.69 97.005 25.4906 13.4348 3.92995 6.15346 12.698 15.5737 4.79113 4.28 172699.69 2.99374 105656.32 47213.65 0.307546 1700 16.2289 8564 453.108 3944584.98 3.72981 164799.04 7011.74 5.836 217472149.53 5.03515 6.973 7.92 29.01 4.22 3.69564 6707967.45 189.233 4.5738 4871109.61 10175 2870019 637 8305 19.2236 4.45509 32.295 16607 35.8504 2420.44 20298 41.3711 1.23463 523 3.14788 175746 859564.82 28.2875 690.47 175964.38 59728.27 17084 503080.37 538 0.36 0.16 15.34 10331 256 23084.76 824 4015 137891.33 992.8 34.33 10159 49.99 864.653 5.79 137381.5 13313.1 281.7 123.32 5.66 207.79 104.8 12498.3 114.24 948927.76 5.949 54.768 86541.0 268.4 1217.5 17.393043 7137.8 23.78 50114.1 97.39 156089 9.13 4632.48 607.35 0.349142 56.218949 95724.11 94.551 21.66 5.84 455.132 64 8040.8 57.45 52001.2 96.4 104.292 2879.71 1.12046 160705.05 21598.3 2.90208 224.55 91.833 65890 146.746 1.38839 2.744 6323.88 27478.48 6519441 2.49 1234.72 2462.53 202.25 5459.11 5.69 4091594.76 500.56 44.159 142633.59 10.134 5777754.86 36933.9 0.469105 451.592 4075.1 846.9 16190299 222876704 20.1559 2422.178 869.792 13.86 1.22057 190.727 2865013 131.58 527 871.338 5.431 91846052434.498 1665.99 3740.29 0.230458 150993.71 13.724 0.86 159256.42 3.55 6.1788 109.393 107.32 343.05 68.4246 59.2954 2.1007 28929.04 4.78202 3381.23 194.282 47.02 3.72 97.171 25.692 13.4723 3.9233 6.19897 12.612 15.6309 4.79183 4.3 173550.28 3.01428 105053.55 47180.4 0.306507 1697 16.1643 8533 451.599 3955795.95 3.72794 164214.5 7033.99 5.811 216782885.23 5.06211 6.936 7.96 28.88 4.24 3.67899 6708782.81 189.138 4.55586 4852459.71 10186 2860347 639 8325 19.2204 4.4429 32.2666 16622 35.9424 2414.27 20348 41.4653 1.23372 524 3.1499 175854 859190.38 28.2762 690.73 175988.63 59757.3 17085 503057.22 538 0.36 0.16 21.03 10378 352 18658.18 823 4683 117695.8 121473.73 1127.7 38.97 11218 50.96 943.921 6.22 140720.96 12964.8 262.04 125.16 5.31 220.7 109.89 11920.7 114.38 958464.66 6.074 52.955 82422.8 270.7 1223.1 16.586807 7106.4 24.7 48762.5 97.76 150205 8.75 4770.39 592.78 0.336 54.109331 99205.02 97.422 22.2 5.99 470.305 63 8045.6 55.74 50492.3 119354.61 98.73 104.398 2800.97 1.09022 162250.89 21331.4 2.87069 228.66 89.83 64248.8 144.764 1.36947 2.787 6167.35 26817.55 6563287 2.44 1205.82 2408.789 207.089 5334.38 5.56 3998434.06 489.37 43.229 139702.5 10.345 5882673.17 37165.7 0.474279 460.086 4142.0 860.8 16325616 219474809 20.11 2459.642 882.755 13.76 1.238 189.229 2882009 132.13 522 868.797 5.401 90741800042.538 1671.06 3695.61 0.233227 152791.84 13.744 0.85 157843.09 3.57 6.14095 108.988 106.206 340.52 67.7415 59.0952 2.10487 29196.39 4.77963 3383.49 195.98 47.34 3.7 96.39 25.6932 13.5376 3.95272 6.16877 12.606 15.52 4.8253 4.27 172356.49 3.00023 104938.6 47498.9 0.305485 1689 16.1264 8587 450.298 3968510.86 3.70853 163876.76 7050.81 5.843 217955200.37 5.04983 6.953 7.94 28.87 4.22 3.67879 6736736.94 188.455 4.57007 4859985.91 10148 2860657 638 8330 19.1695 4.44536 32.3506 16650 35.8679 2419.96 20322 41.4239 1.23195 524 3.15372 176062 860581.21 28.2569 690.13 176083.49 59750.12 17092 503062.12 538 0.36 0.16 OpenBenchmarking.org
Stress-NG Test: CPU Cache OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: CPU Cache A B C 6 12 18 24 30 26.30 15.34 21.03 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
ONNX Runtime Model: super-resolution-10 - Device: CPU - Executor: Standard OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.11 Model: super-resolution-10 - Device: CPU - Executor: Standard A B C 2K 4K 6K 8K 10K 6359 10331 10378 1. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
ONNX Runtime Model: fcn-resnet101-11 - Device: CPU - Executor: Standard OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.11 Model: fcn-resnet101-11 - Device: CPU - Executor: Standard A B C 80 160 240 320 400 376 256 352 1. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
Apache HTTP Server Concurrent Requests: 20 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 20 A B C 5K 10K 15K 20K 25K 24165.82 23084.76 18658.18 1. (CC) gcc options: -shared -fPIC -O2 -pthread
ONNX Runtime Model: bertsquad-12 - Device: CPU - Executor: Standard OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.11 Model: bertsquad-12 - Device: CPU - Executor: Standard A B C 200 400 600 800 1000 691 824 823 1. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
perf-bench Benchmark: Epoll Wait OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Epoll Wait A B C 1000 2000 3000 4000 5000 4782 4015 4683 1. (CC) gcc options: -pthread -shared -lunwind-x86_64 -lunwind -llzma -Xlinker -O6 -ggdb3 -funwind-tables -std=gnu99 -lnuma
Apache HTTP Server Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 200 A B C 30K 60K 90K 120K 150K 126245.90 137891.33 117695.80 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Apache HTTP Server Concurrent Requests: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 1000 A C 30K 60K 90K 120K 150K 105438.13 121473.73 1. (CC) gcc options: -shared -fPIC -O2 -pthread
Renaissance Test: Scala Dotty OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Scala Dotty A B C 200 400 600 800 1000 1108.2 992.8 1127.7 MIN: 823.35 / MAX: 1501.19 MIN: 823.44 / MAX: 1525.49 MIN: 851.96 / MAX: 1499.37
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p A B C 9 18 27 36 45 36.47 34.33 38.97 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
ONNX Runtime Model: GPT-2 - Device: CPU - Executor: Standard OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.11 Model: GPT-2 - Device: CPU - Executor: Standard A B C 2K 4K 6K 8K 10K 10161 10159 11218 1. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p A B C 12 24 36 48 60 54.95 49.99 50.96 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU A B C 200 400 600 800 1000 869.42 864.65 943.92 MIN: 851.99 MIN: 848.3 MIN: 841.53 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K A B C 2 4 6 8 10 5.73 5.79 6.22 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Stress-NG Test: Atomic OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Atomic A B C 30K 60K 90K 120K 150K 148748.40 137381.50 140720.96 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
Renaissance Test: Savina Reactors.IO OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Savina Reactors.IO A B C 3K 6K 9K 12K 15K 13994.2 13313.1 12964.8 MAX: 28224.11 MIN: 13313.09 / MAX: 21731.4 MIN: 12964.76 / MAX: 20215.75
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p A B C 60 120 180 240 300 275.30 281.70 262.04 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
x264 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 1080p A B C 30 60 90 120 150 116.81 123.32 125.16 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -flto
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K A B C 1.2735 2.547 3.8205 5.094 6.3675 5.57 5.66 5.31 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p A B C 50 100 150 200 250 208.91 207.79 220.70 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K A B C 20 40 60 80 100 111.31 104.80 109.89 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Renaissance Test: In-Memory Database Shootout OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: In-Memory Database Shootout A B C 3K 6K 9K 12K 15K 11794.7 12498.3 11920.7 MIN: 11669.17 / MAX: 14233.25 MIN: 12420.19 / MAX: 14927.64 MIN: 11638.75 / MAX: 14335.64
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K A B C 30 60 90 120 150 108.16 114.24 114.38 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Stress-NG Test: Futex OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Futex A B C 200K 400K 600K 800K 1000K 908797.50 948927.76 958464.66 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 6 A B C 2 4 6 8 10 6.249 5.949 6.074 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 2 A B C 12 24 36 48 60 55.61 54.77 52.96 1. (CXX) g++ options: -O3 -fPIC -lm
Renaissance Test: Apache Spark ALS OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Apache Spark ALS A B C 20K 40K 60K 80K 100K 84016.6 86541.0 82422.8 MIN: 80662.94 / MAX: 86134.44 MIN: 84631.44 / MAX: 87601.1 MIN: 75074.38 / MAX: 86258.18
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p A B C 60 120 180 240 300 257.91 268.40 270.70 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Renaissance Test: Random Forest OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Random Forest A B C 300 600 900 1200 1500 1166.1 1217.5 1223.1 MIN: 1051.96 / MAX: 1351.88 MIN: 1052.07 / MAX: 1444.36 MIN: 1084.03 / MAX: 1419.24
perf-bench Benchmark: Memcpy 1MB OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memcpy 1MB A B C 4 8 12 16 20 16.83 17.39 16.59 1. (CC) gcc options: -pthread -shared -lunwind-x86_64 -lunwind -llzma -Xlinker -O6 -ggdb3 -funwind-tables -std=gnu99 -lnuma
Renaissance Test: Finagle HTTP Requests OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Finagle HTTP Requests A B C 1500 3000 4500 6000 7500 6808.3 7137.8 7106.4 MIN: 6198.77 MIN: 6576.64 / MAX: 7137.82 MIN: 6637.43
AOM AV1 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K A B C 6 12 18 24 30 24.91 23.78 24.70 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2022-05-18 Model: Inception ResNet V2 A B C 11K 22K 33K 44K 55K 47840.8 50114.1 48762.5
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K A B C 20 40 60 80 100 101.74 97.39 97.76 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
perf-bench Benchmark: Sched Pipe OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Sched Pipe A B C 30K 60K 90K 120K 150K 149542 156089 150205 1. (CC) gcc options: -pthread -shared -lunwind-x86_64 -lunwind -llzma -Xlinker -O6 -ggdb3 -funwind-tables -std=gnu99 -lnuma
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p A B C 3 6 9 12 15 8.90 9.13 8.75 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2022-05-18 Model: Mobilenet Quant A B C 1000 2000 3000 4000 5000 4833.58 4632.48 4770.39
Stress-NG Test: NUMA OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: NUMA A B C 130 260 390 520 650 583.66 607.35 592.78 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU A B C 0.0786 0.1572 0.2358 0.3144 0.393 0.343941 0.349142 0.336000 MIN: 0.33 MIN: 0.33 MIN: 0.32 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
perf-bench Benchmark: Memset 1MB OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memset 1MB A B C 13 26 39 52 65 55.72 56.22 54.11 1. (CC) gcc options: -pthread -shared -lunwind-x86_64 -lunwind -llzma -Xlinker -O6 -ggdb3 -funwind-tables -std=gnu99 -lnuma
Apache HTTP Server Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 100 A B C 20K 40K 60K 80K 100K 97283.80 95724.11 99205.02 1. (CC) gcc options: -shared -fPIC -O2 -pthread
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 8 - Input: Bosphorus 1080p A B C 20 40 60 80 100 94.09 94.55 97.42 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K A B C 5 10 15 20 25 21.45 21.66 22.20 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p A B C 2 4 6 8 10 6.04 5.84 5.99 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU A B C 100 200 300 400 500 458.42 455.13 470.31 MIN: 444.49 MIN: 438.74 MIN: 441.87 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
perf-bench Benchmark: Futex Lock-Pi OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Lock-Pi A B C 14 28 42 56 70 62 64 63 1. (CC) gcc options: -pthread -shared -lunwind-x86_64 -lunwind -llzma -Xlinker -O6 -ggdb3 -funwind-tables -std=gnu99 -lnuma
Renaissance Test: Genetic Algorithm Using Jenetics + Futures OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Genetic Algorithm Using Jenetics + Futures A B C 2K 4K 6K 8K 10K 8292.8 8040.8 8045.6 MIN: 8151.24 / MAX: 8455.36 MIN: 7541.33 / MAX: 8322.9 MIN: 7714.33 / MAX: 8198.57
AOM AV1 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p A B C 13 26 39 52 65 55.88 57.45 55.74 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Renaissance Test: ALS Movie Lens OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: ALS Movie Lens A B C 11K 22K 33K 44K 55K 50872.9 52001.2 50492.3 MAX: 58880.34 MIN: 50554.35 / MAX: 60239.74 MAX: 58284.29
Apache HTTP Server Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 500 A C 30K 60K 90K 120K 150K 122830.61 119354.61 1. (CC) gcc options: -shared -fPIC -O2 -pthread
SVT-HEVC Tuning: 7 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K A B C 20 40 60 80 100 99.19 96.40 98.73 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 12 - Input: Bosphorus 4K A B C 20 40 60 80 100 107.29 104.29 104.40 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Apache HTTP Server Concurrent Requests: 1 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 1 A B C 600 1200 1800 2400 3000 2809.72 2879.71 2800.97 1. (CC) gcc options: -shared -fPIC -O2 -pthread
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU A B C 0.2521 0.5042 0.7563 1.0084 1.2605 1.10589 1.12046 1.09022 MIN: 0.77 MIN: 0.78 MIN: 0.78 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
nginx Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 500 A B C 40K 80K 120K 160K 200K 165114.50 160705.05 162250.89 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
Renaissance Test: Akka Unbalanced Cobwebbed Tree OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Akka Unbalanced Cobwebbed Tree A B C 5K 10K 15K 20K 25K 21915.1 21598.3 21331.4 MIN: 16827.41 MIN: 16497.83 / MAX: 21598.31 MIN: 16358.49
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU A B C 0.6634 1.3268 1.9902 2.6536 3.317 2.94845 2.90208 2.87069 MIN: 2.7 MIN: 2.64 MIN: 2.63 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p A B C 50 100 150 200 250 222.88 224.55 228.66 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-AV1 Encoder Mode: Preset 10 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 10 - Input: Bosphorus 4K A B C 20 40 60 80 100 89.54 91.83 89.83 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2022-05-18 Model: NASNet Mobile A B C 14K 28K 42K 56K 70K 65535.0 65890.0 64248.8
SVT-AV1 Encoder Mode: Preset 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 10 - Input: Bosphorus 1080p A B C 30 60 90 120 150 143.11 146.75 144.76 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU A B C 0.316 0.632 0.948 1.264 1.58 1.40425 1.38839 1.36947 MIN: 1.22 MIN: 1.09 MIN: 1.19 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 4 - Input: Bosphorus 4K A B C 0.6271 1.2542 1.8813 2.5084 3.1355 2.718 2.744 2.787 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Nettle Test: aes256 OpenBenchmarking.org Mbyte/s, More Is Better Nettle 3.8 Test: aes256 A B C 1400 2800 4200 5600 7000 6285.45 6323.88 6167.35 MIN: 4246.46 / MAX: 10397.63 MIN: 4271.48 / MAX: 10459.4 MIN: 4162.19 / MAX: 10212.16 1. (CC) gcc options: -O2 -ggdb3 -lnettle -lgmp -lm -lcrypto
Stress-NG Test: Socket Activity OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Socket Activity A B C 6K 12K 18K 24K 30K 27081.43 27478.48 26817.55 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Read While Writing A B C 1.4M 2.8M 4.2M 5.6M 7M 6680014 6519441 6563287 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: Kostya A B C 0.5625 1.125 1.6875 2.25 2.8125 2.50 2.49 2.44 1. (CXX) g++ options: -O3 -pthread
Nettle Test: chacha OpenBenchmarking.org Mbyte/s, More Is Better Nettle 3.8 Test: chacha A B C 300 600 900 1200 1500 1223.77 1234.72 1205.82 MIN: 554.98 / MAX: 3694.93 MIN: 561.17 / MAX: 3732.39 MIN: 547.67 / MAX: 3646.85 1. (CC) gcc options: -O2 -ggdb3 -lnettle -lgmp -lm -lcrypto
Etcpak Benchmark: Multi-Threaded - Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 1.0 Benchmark: Multi-Threaded - Configuration: DXT1 A B C 500 1000 1500 2000 2500 2466.42 2462.53 2408.79 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 12 - Input: Bosphorus 1080p A B C 50 100 150 200 250 206.24 202.25 207.09 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2022-05-18 Model: SqueezeNet A B C 1200 2400 3600 4800 6000 5446.51 5459.11 5334.38
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p A B C 1.2803 2.5606 3.8409 5.1212 6.4015 5.67 5.69 5.56 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Stress-NG Test: Context Switching OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Context Switching A B C 900K 1800K 2700K 3600K 4500K 4008133.57 4091594.76 3998434.06 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
Nettle Test: sha512 OpenBenchmarking.org Mbyte/s, More Is Better Nettle 3.8 Test: sha512 A B C 110 220 330 440 550 498.39 500.56 489.37 1. (CC) gcc options: -O2 -ggdb3 -lnettle -lgmp -lm -lcrypto
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 8 - Input: Bosphorus 4K A B C 10 20 30 40 50 44.11 44.16 43.23 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
nginx Concurrent Requests: 20 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 20 A B C 30K 60K 90K 120K 150K 141460.52 142633.59 139702.50 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 6, Lossless A B C 3 6 9 12 15 10.32 10.13 10.35 1. (CXX) g++ options: -O3 -fPIC -lm
Stress-NG Test: Glibc C String Functions OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Glibc C String Functions A B C 1.3M 2.6M 3.9M 5.2M 6.5M 5765765.00 5777754.86 5882673.17 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2022-05-18 Model: Inception V4 A B C 8K 16K 24K 32K 40K 37643.7 36933.9 37165.7
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU A B C 0.1076 0.2152 0.3228 0.4304 0.538 0.478080 0.469105 0.474279 MIN: 0.4 MIN: 0.4 MIN: 0.4 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU A B C 100 200 300 400 500 455.06 451.59 460.09 MIN: 445.99 MIN: 441.67 MIN: 445.03 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Renaissance Test: Apache Spark PageRank OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Apache Spark PageRank A B C 900 1800 2700 3600 4500 4107.5 4075.1 4142.0 MIN: 3638.26 / MAX: 4140.32 MIN: 3574.3 / MAX: 4120.57 MIN: 3707.03
Renaissance Test: Apache Spark Bayes OpenBenchmarking.org ms, Fewer Is Better Renaissance 0.14 Test: Apache Spark Bayes A B C 200 400 600 800 1000 859.3 846.9 860.8 MIN: 516.49 / MAX: 1602.06 MIN: 525.2 / MAX: 1088.41 MIN: 530.21 / MAX: 1413.48
perf-bench Benchmark: Syscall Basic OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Syscall Basic A B C 4M 8M 12M 16M 20M 16452745 16190299 16325616 1. (CC) gcc options: -pthread -shared -lunwind-x86_64 -lunwind -llzma -Xlinker -O6 -ggdb3 -funwind-tables -std=gnu99 -lnuma
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Random Read A B C 50M 100M 150M 200M 250M 222126242 222876704 219474809 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Glibc Benchmarks Benchmark: log2 OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: log2 A B C 5 10 15 20 25 19.85 20.16 20.11 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Etcpak Benchmark: Multi-Threaded - Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 1.0 Benchmark: Multi-Threaded - Configuration: ETC2 A B C 500 1000 1500 2000 2500 2455.86 2422.18 2459.64 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU A B C 200 400 600 800 1000 871.31 869.79 882.76 MIN: 852.5 MIN: 850.21 MIN: 861.14 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K A B C 4 8 12 16 20 13.96 13.86 13.76 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU A B C 0.2786 0.5572 0.8358 1.1144 1.393 1.22929 1.22057 1.23800 MIN: 1.17 MIN: 1.16 MIN: 1.17 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Etcpak Benchmark: Single-Threaded - Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 1.0 Benchmark: Single-Threaded - Configuration: DXT1 A B C 40 80 120 160 200 191.92 190.73 189.23 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Facebook RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Read Random Write Random A B C 600K 1200K 1800K 2400K 3000K 2904862 2865013 2882009 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
SVT-HEVC Tuning: 10 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K A B C 30 60 90 120 150 133.39 131.58 132.13 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
ONNX Runtime Model: yolov4 - Device: CPU - Executor: Standard OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.11 Model: yolov4 - Device: CPU - Executor: Standard A B C 110 220 330 440 550 529 527 522 1. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU A B C 200 400 600 800 1000 880.42 871.34 868.80 MIN: 860.19 MIN: 850.22 MIN: 854.33 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.0 Encoder Mode: Preset 4 - Input: Bosphorus 1080p A B C 1.2312 2.4624 3.6936 4.9248 6.156 5.472 5.431 5.401 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Java JMH Throughput OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput A B C 20000M 40000M 60000M 80000M 100000M 90801514532.11 91846052434.50 90741800042.54
Stress-NG Test: MMAP OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: MMAP A B C 400 800 1200 1600 2000 1686.14 1665.99 1671.06 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
Nettle Test: poly1305-aes OpenBenchmarking.org Mbyte/s, More Is Better Nettle 3.8 Test: poly1305-aes A B C 800 1600 2400 3200 4000 3708.97 3740.29 3695.61 1. (CC) gcc options: -O2 -ggdb3 -lnettle -lgmp -lm -lcrypto
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU A B C 0.0525 0.105 0.1575 0.21 0.2625 0.232440 0.230458 0.233227 MIN: 0.19 MIN: 0.19 MIN: 0.2 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
nginx Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 100 A B C 30K 60K 90K 120K 150K 152697.49 150993.71 152791.84 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
OSPray Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/ao/real_time A B C 4 8 12 16 20 13.58 13.72 13.74
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: LargeRandom A B C 0.1935 0.387 0.5805 0.774 0.9675 0.86 0.86 0.85 1. (CXX) g++ options: -O3 -pthread
nginx Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 200 A B C 30K 60K 90K 120K 150K 159676.56 159256.42 157843.09 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
AOM AV1 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K A B C 0.8033 1.6066 2.4099 3.2132 4.0165 3.53 3.55 3.57 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Glibc Benchmarks Benchmark: modf OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: modf A B C 2 4 6 8 10 6.11030 6.17880 6.14095 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
OSPray Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/ao/real_time A B C 20 40 60 80 100 108.23 109.39 108.99
OSPray Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/scivis/real_time A B C 20 40 60 80 100 106.42 107.32 106.21
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p A B C 70 140 210 280 350 339.56 343.05 340.52 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Glibc Benchmarks Benchmark: cos OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: cos A B C 15 30 45 60 75 68.24 68.42 67.74 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: sin OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: sin A B C 13 26 39 52 65 58.73 59.30 59.10 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
oneDNN Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU A B C 0.4736 0.9472 1.4208 1.8944 2.368 2.08511 2.10070 2.10487 MIN: 2.04 MIN: 2.05 MIN: 2.05 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
nginx Concurrent Requests: 1 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 1 A B C 6K 12K 18K 24K 30K 29191.38 28929.04 29196.39 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
Glibc Benchmarks Benchmark: pthread_once OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: pthread_once A B C 1.076 2.152 3.228 4.304 5.38 4.73900 4.78202 4.77963 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2022-05-18 Model: Mobilenet Float A B C 700 1400 2100 2800 3500 3411.68 3381.23 3383.49
OSPray Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/pathtracer/real_time A B C 40 80 120 160 200 194.85 194.28 195.98
x264 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 4K A B C 11 22 33 44 55 46.94 47.02 47.34 1. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -flto
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: PartialTweets A B C 0.837 1.674 2.511 3.348 4.185 3.69 3.72 3.70 1. (CXX) g++ options: -O3 -pthread
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 0 A B C 20 40 60 80 100 97.01 97.17 96.39 1. (CXX) g++ options: -O3 -fPIC -lm
Glibc Benchmarks Benchmark: sinh OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: sinh A B C 6 12 18 24 30 25.49 25.69 25.69 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
OSPray Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time A B C 3 6 9 12 15 13.43 13.47 13.54
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU A B C 0.8894 1.7788 2.6682 3.5576 4.447 3.92995 3.92330 3.95272 MIN: 3.83 MIN: 3.83 MIN: 3.84 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Glibc Benchmarks Benchmark: sqrt OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: sqrt A B C 2 4 6 8 10 6.15346 6.19897 6.16877 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Timed MPlayer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed MPlayer Compilation 1.5 Time To Compile A B C 3 6 9 12 15 12.70 12.61 12.61
Glibc Benchmarks Benchmark: exp OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: exp A B C 4 8 12 16 20 15.57 15.63 15.52 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: ffs OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: ffs A B C 1.0857 2.1714 3.2571 4.3428 5.4285 4.79113 4.79183 4.82530 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: DistinctUserID A B C 0.9675 1.935 2.9025 3.87 4.8375 4.28 4.30 4.27 1. (CXX) g++ options: -O3 -pthread
Stress-NG Test: Vector Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Vector Math A B C 40K 80K 120K 160K 200K 172699.69 173550.28 172356.49 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU A B C 0.6782 1.3564 2.0346 2.7128 3.391 2.99374 3.01428 3.00023 MIN: 2.94 MIN: 2.96 MIN: 2.95 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Stress-NG Test: CPU Stress OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: CPU Stress A B C 20K 40K 60K 80K 100K 105656.32 105053.55 104938.60 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
Stress-NG Test: Forking OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Forking A B C 10K 20K 30K 40K 50K 47213.65 47180.40 47498.90 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU A B C 0.0692 0.1384 0.2076 0.2768 0.346 0.307546 0.306507 0.305485 MIN: 0.28 MIN: 0.28 MIN: 0.27 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
ONNX Runtime Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.11 Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard A B C 400 800 1200 1600 2000 1700 1697 1689 1. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
OSPray Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time A B C 4 8 12 16 20 16.23 16.16 16.13
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer A B C 2K 4K 6K 8K 10K 8564 8533 8587 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU A B C 100 200 300 400 500 453.11 451.60 450.30 MIN: 441.33 MIN: 442.48 MIN: 440.16 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Stress-NG Test: System V Message Passing OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: System V Message Passing A B C 800K 1600K 2400K 3200K 4000K 3944584.98 3955795.95 3968510.86 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
oneDNN Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU A B C 0.8392 1.6784 2.5176 3.3568 4.196 3.72981 3.72794 3.70853 MIN: 3.52 MIN: 3.52 MIN: 3.55 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
nginx Concurrent Requests: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 1000 A B C 40K 80K 120K 160K 200K 164799.04 164214.50 163876.76 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
Stress-NG Test: Memory Copying OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Memory Copying A B C 1500 3000 4500 6000 7500 7011.74 7033.99 7050.81 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2022.1 Implementation: MPI CPU - Input: water_GMX50_bare A B C 1.3147 2.6294 3.9441 5.2588 6.5735 5.836 5.811 5.843 1. (CXX) g++ options: -O3 -pthread
Stress-NG Test: Malloc OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Malloc A B C 50M 100M 150M 200M 250M 217472149.53 216782885.23 217955200.37 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU A B C 1.139 2.278 3.417 4.556 5.695 5.03515 5.06211 5.04983 MIN: 4.89 MIN: 4.91 MIN: 4.91 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.10 Encoder Speed: 10, Lossless A B C 2 4 6 8 10 6.973 6.936 6.953 1. (CXX) g++ options: -O3 -fPIC -lm
SVT-HEVC Tuning: 1 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 4K A B C 2 4 6 8 10 7.92 7.96 7.94 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p A B C 7 14 21 28 35 29.01 28.88 28.87 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
simdjson Throughput Test: TopTweet OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: TopTweet A B C 0.954 1.908 2.862 3.816 4.77 4.22 4.24 4.22 1. (CXX) g++ options: -O3 -pthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU A B C 0.8315 1.663 2.4945 3.326 4.1575 3.69564 3.67899 3.67879 MIN: 3.59 MIN: 3.6 MIN: 3.59 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Stress-NG Test: Semaphores OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Semaphores A B C 1.4M 2.8M 4.2M 5.6M 7M 6707967.45 6708782.81 6736736.94 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
Etcpak Benchmark: Single-Threaded - Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 1.0 Benchmark: Single-Threaded - Configuration: ETC2 A B C 40 80 120 160 200 189.23 189.14 188.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Glibc Benchmarks Benchmark: ffsll OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: ffsll A B C 1.0291 2.0582 3.0873 4.1164 5.1455 4.57380 4.55586 4.57007 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Stress-NG Test: IO_uring OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: IO_uring A B C 1000K 2000K 3000K 4000K 5000K 4871109.61 4852459.71 4859985.91 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer A B C 2K 4K 6K 8K 10K 10175 10186 10148 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
perf-bench Benchmark: Futex Hash OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Hash A B C 600K 1200K 1800K 2400K 3000K 2870019 2860347 2860657 1. (CC) gcc options: -pthread -shared -lunwind-x86_64 -lunwind -llzma -Xlinker -O6 -ggdb3 -funwind-tables -std=gnu99 -lnuma
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer A B C 140 280 420 560 700 637 639 638 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer A B C 2K 4K 6K 8K 10K 8305 8325 8330 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU A B C 5 10 15 20 25 19.22 19.22 19.17 MIN: 13.82 MIN: 15.45 MIN: 16.57 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU A B C 1.0024 2.0048 3.0072 4.0096 5.012 4.45509 4.44290 4.44536 MIN: 4.4 MIN: 4.4 MIN: 4.4 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Glibc Benchmarks Benchmark: atanh OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: atanh A B C 8 16 24 32 40 32.30 32.27 32.35 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer A B C 4K 8K 12K 16K 20K 16607 16622 16650 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
Glibc Benchmarks Benchmark: tanh OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: tanh A B C 8 16 24 32 40 35.85 35.94 35.87 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Stress-NG Test: MEMFD OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: MEMFD A B C 500 1000 1500 2000 2500 2420.44 2414.27 2419.96 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer A B C 4K 8K 12K 16K 20K 20298 20348 20322 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
Glibc Benchmarks Benchmark: sincos OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: sincos A B C 9 18 27 36 45 41.37 41.47 41.42 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU A B C 0.2778 0.5556 0.8334 1.1112 1.389 1.23463 1.23372 1.23195 MIN: 1.19 MIN: 1.19 MIN: 1.19 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer A B C 110 220 330 440 550 523 524 524 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU A B C 0.7096 1.4192 2.1288 2.8384 3.548 3.14788 3.14990 3.15372 MIN: 2.96 MIN: 2.94 MIN: 2.94 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Facebook RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Update Random A B C 40K 80K 120K 160K 200K 175746 175854 176062 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Stress-NG Test: SENDFILE OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: SENDFILE A B C 200K 400K 600K 800K 1000K 859564.82 859190.38 860581.21 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
Glibc Benchmarks Benchmark: asinh OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks Benchmark: asinh A B C 7 14 21 28 35 28.29 28.28 28.26 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Stress-NG Test: Glibc Qsort Data Sorting OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Glibc Qsort Data Sorting A B C 150 300 450 600 750 690.47 690.73 690.13 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
Stress-NG Test: Matrix Math OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Matrix Math A B C 40K 80K 120K 160K 200K 175964.38 175988.63 176083.49 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
Stress-NG Test: Crypto OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: Crypto A B C 13K 26K 39K 52K 65K 59728.27 59757.30 59750.12 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer A B C 4K 8K 12K 16K 20K 17084 17085 17092 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
Stress-NG Test: x86_64 RdRand OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.14 Test: x86_64 RdRand A B C 110K 220K 330K 440K 550K 503080.37 503057.22 503062.12 1. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lsctp -lz -pthread
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer A B C 120 240 360 480 600 538 538 538 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p A B C 0.081 0.162 0.243 0.324 0.405 0.36 0.36 0.36 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K A B C 0.036 0.072 0.108 0.144 0.18 0.16 0.16 0.16 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Phoronix Test Suite v10.8.4