Tests for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2012046-HA-GCCZNVER333 GCC Znver3 First Cut Benchmarks - Phoronix Test Suite GCC Znver3 First Cut Benchmarks Tests for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2012046-HA-GCCZNVER333&sgm=1&hgv=znver3&grt&sro .
GCC Znver3 First Cut Benchmarks Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution znver3 x86-64 znver1 znver2 haswell skylake AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (2311 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 + 2000GB NVIDIA GeForce RTX 3060 Ti 8GB (1665/7000MHz) NVIDIA Device 228b ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.4.0-54-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 NVIDIA 455.45.01 4.6.0 OpenCL 1.2 CUDA 11.1.114 1.2.142 GCC 11.0.0 20201203 ext4 3840x2160 OpenBenchmarking.org Environment Details - znver3: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3" - x86-64: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64" - znver1: CXXFLAGS="-O3 -march=znver1" CFLAGS="-O3 -march=znver1" - znver2: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2" - haswell: CXXFLAGS="-O3 -march=haswell" CFLAGS="-O3 -march=haswell" - skylake: CXXFLAGS="-O3 -march=skylake" CFLAGS="-O3 -march=skylake" Compiler Details - --disable-multilib --enable-checking=release Disk Details - znver3: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 OpenCL Details - GPU Compute Cores: 4864 Python Details - znver3: Python 2.7.18 + Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
GCC Znver3 First Cut Benchmarks mt-dgemm: Sustained Floating-Point Rate aobench: 2048 x 2048 - Total Time c-ray: Total Time - 4K, 16 Rays Per Pixel coremark: CoreMark Size 666 - Iterations Per Second crafty: Elapsed Time daphne: OpenMP - NDT Mapping daphne: OpenMP - Points2Image dav1d: Chimera 1080p 10-bit graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing hint: FLOAT kvazaar: Bosphorus 4K - Ultra Fast kvazaar: Bosphorus 1080p - Ultra Fast encode-mp3: WAV To MP3 libraw: Post-Processing Benchmark rnnoise: scimark2: Composite scimark2: Sparse Matrix Multiply scimark2: Jacobi Successive Over-Relaxation smallpt: Global Illumination Renderer; 128 Samples sqlite-speedtest: Timed Time - Size 1,000 vpxenc: Speed 0 vpxenc: Speed 5 webp: Quality 100 znver3 x86-64 znver1 znver2 haswell skylake 8.972347 25.258 25.202 837788.084234 12060994 969.38 29625.957802710 263.97 374 447 2025 539663428.10785 55.82 203.08 5.468 76.04 13.989 4463.42 5091.12 3072.04 4.627 41.905 10.76 39.13 1.670 8.054180 29.521 31.212 783951.867250 11828707 932.29 29502.084524288 189.62 225 382 1809 521237587.67053 55.07 202.43 5.822 65.48 14.633 3662.75 4977.77 2440.10 5.059 42.186 10.17 36.03 1.717 8.778316 25.028 26.073 835605.873975 11865359 950.30 29918.699671060 251.11 367 446 2084 523840766.06954 56.22 205.71 5.546 74.54 14.095 3304.21 4392.61 2913.83 4.790 42.250 10.82 38.95 1.692 8.922133 25.254 25.321 834342.440189 11910299 963.53 29824.340387406 265.16 373 446 2050 529343739.09606 55.85 201.84 5.478 76.01 13.854 4310.95 4918.89 2931.78 4.664 42.086 10.77 38.97 1.689 7.829319 25.328 25.455 776752.217528 11935854 969.60 29785.613813708 175.36 299 524 2023 519167343.77856 57.53 209.19 5.520 74.13 15.512 4332.88 5062.07 2934.46 4.906 42.229 10.81 39.18 1.690 8.111486 25.369 25.085 823512.886406 11943425 962.73 30279.771134316 262.40 377 436 2023 520771613.78862 55.75 203.69 5.565 75.04 15.575 4314.57 4955.38 2936.04 4.872 42.783 10.77 38.87 1.684 OpenBenchmarking.org
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate haswell skylake x86-64 znver1 znver2 znver3 3 6 9 12 15 SE +/- 0.062576, N = 15 SE +/- 0.117999, N = 4 SE +/- 0.072954, N = 3 SE +/- 0.028561, N = 3 SE +/- 0.094255, N = 3 SE +/- 0.135072, N = 3 7.829319 8.111486 8.054180 8.778316 8.922133 8.972347 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -march=native -fopenmp
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time haswell skylake x86-64 znver1 znver2 znver3 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.68, N = 15 SE +/- 0.24, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 25.33 25.37 29.52 25.03 25.25 25.26 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -lm -O3
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel haswell skylake x86-64 znver1 znver2 znver3 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 25.46 25.09 31.21 26.07 25.32 25.20 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -lm -lpthread -O3
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second haswell skylake x86-64 znver1 znver2 znver3 200K 400K 600K 800K 1000K SE +/- 563.60, N = 3 SE +/- 1894.03, N = 3 SE +/- 651.07, N = 3 SE +/- 2226.29, N = 3 SE +/- 7614.04, N = 3 SE +/- 5296.50, N = 3 776752.22 823512.89 783951.87 835605.87 834342.44 837788.08 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O2 -O3 -lrt" -lrt
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time haswell skylake x86-64 znver1 znver2 znver3 3M 6M 9M 12M 15M SE +/- 22104.24, N = 3 SE +/- 69563.68, N = 3 SE +/- 18592.10, N = 3 SE +/- 33532.81, N = 3 SE +/- 44223.61, N = 3 SE +/- 121191.73, N = 3 11935854 11943425 11828707 11865359 11910299 12060994 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping haswell skylake x86-64 znver1 znver2 znver3 200 400 600 800 1000 SE +/- 1.90, N = 3 SE +/- 1.88, N = 3 SE +/- 2.69, N = 3 SE +/- 0.66, N = 3 SE +/- 2.52, N = 3 SE +/- 3.90, N = 3 969.60 962.73 932.29 950.30 963.53 969.38 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image haswell skylake x86-64 znver1 znver2 znver3 6K 12K 18K 24K 30K SE +/- 289.82, N = 3 SE +/- 113.76, N = 3 SE +/- 108.90, N = 3 SE +/- 159.95, N = 3 SE +/- 240.50, N = 3 SE +/- 109.65, N = 3 29785.61 30279.77 29502.08 29918.70 29824.34 29625.96 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.7.0 Video Input: Chimera 1080p 10-bit haswell skylake x86-64 znver1 znver2 znver3 60 120 180 240 300 SE +/- 0.22, N = 3 SE +/- 0.56, N = 3 SE +/- 0.23, N = 3 SE +/- 0.45, N = 3 SE +/- 0.73, N = 3 SE +/- 0.85, N = 3 175.36 262.40 189.62 251.11 265.16 263.97 -march=haswell - MIN: 93.55 / MAX: 459.71 -march=skylake - MIN: 173.7 / MAX: 495.13 -march=x86-64 - MIN: 124.1 / MAX: 372.85 -march=znver1 - MIN: 167.21 / MAX: 471.36 -march=znver2 - MIN: 175.81 / MAX: 502.41 -march=znver3 - MIN: 175.34 / MAX: 478.58 1. (CC) gcc options: -O3 -pthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen haswell skylake x86-64 znver1 znver2 znver3 80 160 240 320 400 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 1.00, N = 3 299 377 225 367 373 374 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced haswell skylake x86-64 znver1 znver2 znver3 110 220 330 440 550 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 524 436 382 446 446 447 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing haswell skylake x86-64 znver1 znver2 znver3 400 800 1200 1600 2000 SE +/- 1.00, N = 3 SE +/- 4.10, N = 3 SE +/- 3.79, N = 3 SE +/- 0.88, N = 3 SE +/- 4.91, N = 3 SE +/- 4.33, N = 3 2023 2023 1809 2084 2050 2025 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT haswell skylake x86-64 znver1 znver2 znver3 120M 240M 360M 480M 600M SE +/- 2024395.89, N = 3 SE +/- 4152069.13, N = 3 SE +/- 1444160.80, N = 3 SE +/- 114637.70, N = 3 SE +/- 5908890.92, N = 3 SE +/- 963540.42, N = 3 519167343.78 520771613.79 521237587.67 523840766.07 529343739.10 539663428.11 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -march=native -lm
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast haswell skylake x86-64 znver1 znver2 znver3 13 26 39 52 65 SE +/- 0.01, N = 3 SE +/- 0.22, N = 3 SE +/- 0.10, N = 3 SE +/- 0.20, N = 3 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 57.53 55.75 55.07 56.22 55.85 55.82 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast haswell skylake x86-64 znver1 znver2 znver3 50 100 150 200 250 SE +/- 1.14, N = 3 SE +/- 0.72, N = 3 SE +/- 0.29, N = 3 SE +/- 0.58, N = 3 SE +/- 1.10, N = 3 SE +/- 0.81, N = 3 209.19 203.69 202.43 205.71 201.84 203.08 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -lpthread -lm -lrt
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 haswell skylake x86-64 znver1 znver2 znver3 1.31 2.62 3.93 5.24 6.55 SE +/- 0.020, N = 3 SE +/- 0.007, N = 3 SE +/- 0.015, N = 3 SE +/- 0.009, N = 3 SE +/- 0.010, N = 3 SE +/- 0.006, N = 3 5.520 5.565 5.822 5.546 5.478 5.468 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark haswell skylake x86-64 znver1 znver2 znver3 20 40 60 80 100 SE +/- 0.11, N = 3 SE +/- 0.39, N = 3 SE +/- 0.02, N = 3 SE +/- 0.16, N = 3 SE +/- 0.20, N = 3 SE +/- 0.06, N = 3 74.13 75.04 65.48 74.54 76.01 76.04 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CXX) g++ options: -O3 -fopenmp -ljpeg -lz -lm
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 haswell skylake x86-64 znver1 znver2 znver3 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.05, N = 3 SE +/- 0.22, N = 3 SE +/- 0.10, N = 3 SE +/- 0.19, N = 3 SE +/- 0.13, N = 3 15.51 15.58 14.63 14.10 13.85 13.99 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -pedantic -fvisibility=hidden
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite haswell skylake x86-64 znver1 znver2 znver3 1000 2000 3000 4000 5000 SE +/- 11.48, N = 3 SE +/- 4.37, N = 3 SE +/- 5.93, N = 3 SE +/- 13.80, N = 3 SE +/- 1.02, N = 3 SE +/- 0.19, N = 3 4332.88 4314.57 3662.75 3304.21 4310.95 4463.42 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply haswell skylake x86-64 znver1 znver2 znver3 1100 2200 3300 4400 5500 SE +/- 44.33, N = 3 SE +/- 10.56, N = 3 SE +/- 10.14, N = 3 SE +/- 1.53, N = 3 SE +/- 1.75, N = 3 SE +/- 2.08, N = 3 5062.07 4955.38 4977.77 4392.61 4918.89 5091.12 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation haswell skylake x86-64 znver1 znver2 znver3 700 1400 2100 2800 3500 SE +/- 2.02, N = 3 SE +/- 0.45, N = 3 SE +/- 0.05, N = 3 SE +/- 3.35, N = 3 SE +/- 3.83, N = 3 SE +/- 0.78, N = 3 2934.46 2936.04 2440.10 2913.83 2931.78 3072.04 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -lm
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples haswell skylake x86-64 znver1 znver2 znver3 1.1383 2.2766 3.4149 4.5532 5.6915 SE +/- 0.014, N = 3 SE +/- 0.005, N = 3 SE +/- 0.010, N = 3 SE +/- 0.007, N = 3 SE +/- 0.007, N = 3 SE +/- 0.005, N = 3 4.906 4.872 5.059 4.790 4.664 4.627 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CXX) g++ options: -fopenmp -O3
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 haswell skylake x86-64 znver1 znver2 znver3 10 20 30 40 50 SE +/- 0.23, N = 3 SE +/- 0.15, N = 3 SE +/- 0.18, N = 3 SE +/- 0.19, N = 3 SE +/- 0.61, N = 3 SE +/- 0.25, N = 3 42.23 42.78 42.19 42.25 42.09 41.91 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -ldl -lz -lpthread
VP9 libvpx Encoding Speed: Speed 0 OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 0 haswell skylake x86-64 znver1 znver2 znver3 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 10.81 10.77 10.17 10.82 10.77 10.76 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
VP9 libvpx Encoding Speed: Speed 5 OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 5 haswell skylake x86-64 znver1 znver2 znver3 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.27, N = 3 SE +/- 0.09, N = 3 SE +/- 0.33, N = 3 SE +/- 0.15, N = 3 SE +/- 0.03, N = 3 39.18 38.87 36.03 38.95 38.97 39.13 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 haswell skylake x86-64 znver1 znver2 znver3 0.3863 0.7726 1.1589 1.5452 1.9315 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 SE +/- 0.020, N = 3 1.690 1.684 1.717 1.692 1.689 1.670 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -fvisibility=hidden -O3 -pthread -lm -ljpeg -lpng16 -ltiff
Geometric Mean Of All Test Results Result Composite - GCC Znver3 First Cut Benchmarks OpenBenchmarking.org Geometric Mean, More Is Better Geometric Mean Of All Test Results Result Composite - GCC Znver3 First Cut Benchmarks haswell skylake x86-64 znver1 znver2 znver3 110 220 330 440 550 473.43 482.49 441.20 478.18 488.84 492.26
Phoronix Test Suite v10.8.4