Tests for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2012046-HA-GCCZNVER333 GCC Znver3 First Cut Benchmarks - Phoronix Test Suite GCC Znver3 First Cut Benchmarks Tests for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2012046-HA-GCCZNVER333&sgm=1&hgv=znver3&gru&sro .
GCC Znver3 First Cut Benchmarks Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution znver3 x86-64 znver1 znver2 haswell skylake AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (2311 BIOS) AMD Starship/Matisse 16GB 2000GB Corsair Force MP600 + 2000GB NVIDIA GeForce RTX 3060 Ti 8GB (1665/7000MHz) NVIDIA Device 228b ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.04 5.4.0-54-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.8 NVIDIA 455.45.01 4.6.0 OpenCL 1.2 CUDA 11.1.114 1.2.142 GCC 11.0.0 20201203 ext4 3840x2160 OpenBenchmarking.org Environment Details - znver3: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3" - x86-64: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64" - znver1: CXXFLAGS="-O3 -march=znver1" CFLAGS="-O3 -march=znver1" - znver2: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2" - haswell: CXXFLAGS="-O3 -march=haswell" CFLAGS="-O3 -march=haswell" - skylake: CXXFLAGS="-O3 -march=skylake" CFLAGS="-O3 -march=skylake" Compiler Details - --disable-multilib --enable-checking=release Disk Details - znver3: NONE / errors=remount-ro,relatime,rw / Block Size: 4096 Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 OpenCL Details - GPU Compute Cores: 4864 Python Details - znver3: Python 2.7.18 + Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
GCC Znver3 First Cut Benchmarks dav1d: Chimera 1080p 10-bit kvazaar: Bosphorus 4K - Ultra Fast kvazaar: Bosphorus 1080p - Ultra Fast vpxenc: Speed 0 vpxenc: Speed 5 mt-dgemm: Sustained Floating-Point Rate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing coremark: CoreMark Size 666 - Iterations Per Second scimark2: Composite scimark2: Sparse Matrix Multiply scimark2: Jacobi Successive Over-Relaxation libraw: Post-Processing Benchmark crafty: Elapsed Time hint: FLOAT daphne: OpenMP - NDT Mapping daphne: OpenMP - Points2Image webp: Quality 100 c-ray: Total Time - 4K, 16 Rays Per Pixel smallpt: Global Illumination Renderer; 128 Samples aobench: 2048 x 2048 - Total Time encode-mp3: WAV To MP3 rnnoise: sqlite-speedtest: Timed Time - Size 1,000 znver3 x86-64 znver1 znver2 haswell skylake 263.97 55.82 203.08 10.76 39.13 8.972347 374 447 2025 837788.084234 4463.42 5091.12 3072.04 76.04 12060994 539663428.10785 969.38 29625.957802710 1.670 25.202 4.627 25.258 5.468 13.989 41.905 189.62 55.07 202.43 10.17 36.03 8.054180 225 382 1809 783951.867250 3662.75 4977.77 2440.10 65.48 11828707 521237587.67053 932.29 29502.084524288 1.717 31.212 5.059 29.521 5.822 14.633 42.186 251.11 56.22 205.71 10.82 38.95 8.778316 367 446 2084 835605.873975 3304.21 4392.61 2913.83 74.54 11865359 523840766.06954 950.30 29918.699671060 1.692 26.073 4.790 25.028 5.546 14.095 42.250 265.16 55.85 201.84 10.77 38.97 8.922133 373 446 2050 834342.440189 4310.95 4918.89 2931.78 76.01 11910299 529343739.09606 963.53 29824.340387406 1.689 25.321 4.664 25.254 5.478 13.854 42.086 175.36 57.53 209.19 10.81 39.18 7.829319 299 524 2023 776752.217528 4332.88 5062.07 2934.46 74.13 11935854 519167343.77856 969.60 29785.613813708 1.690 25.455 4.906 25.328 5.520 15.512 42.229 262.40 55.75 203.69 10.77 38.87 8.111486 377 436 2023 823512.886406 4314.57 4955.38 2936.04 75.04 11943425 520771613.78862 962.73 30279.771134316 1.684 25.085 4.872 25.369 5.565 15.575 42.783 OpenBenchmarking.org
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.7.0 Video Input: Chimera 1080p 10-bit haswell skylake x86-64 znver1 znver2 znver3 60 120 180 240 300 SE +/- 0.22, N = 3 SE +/- 0.56, N = 3 SE +/- 0.23, N = 3 SE +/- 0.45, N = 3 SE +/- 0.73, N = 3 SE +/- 0.85, N = 3 175.36 262.40 189.62 251.11 265.16 263.97 -march=haswell - MIN: 93.55 / MAX: 459.71 -march=skylake - MIN: 173.7 / MAX: 495.13 -march=x86-64 - MIN: 124.1 / MAX: 372.85 -march=znver1 - MIN: 167.21 / MAX: 471.36 -march=znver2 - MIN: 175.81 / MAX: 502.41 -march=znver3 - MIN: 175.34 / MAX: 478.58 1. (CC) gcc options: -O3 -pthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast haswell skylake x86-64 znver1 znver2 znver3 13 26 39 52 65 SE +/- 0.01, N = 3 SE +/- 0.22, N = 3 SE +/- 0.10, N = 3 SE +/- 0.20, N = 3 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 57.53 55.75 55.07 56.22 55.85 55.82 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast haswell skylake x86-64 znver1 znver2 znver3 50 100 150 200 250 SE +/- 1.14, N = 3 SE +/- 0.72, N = 3 SE +/- 0.29, N = 3 SE +/- 0.58, N = 3 SE +/- 1.10, N = 3 SE +/- 0.81, N = 3 209.19 203.69 202.43 205.71 201.84 203.08 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -lpthread -lm -lrt
VP9 libvpx Encoding Speed: Speed 0 OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 0 haswell skylake x86-64 znver1 znver2 znver3 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 10.81 10.77 10.17 10.82 10.77 10.76 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
VP9 libvpx Encoding Speed: Speed 5 OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 5 haswell skylake x86-64 znver1 znver2 znver3 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.27, N = 3 SE +/- 0.09, N = 3 SE +/- 0.33, N = 3 SE +/- 0.15, N = 3 SE +/- 0.03, N = 3 39.18 38.87 36.03 38.95 38.97 39.13 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate haswell skylake x86-64 znver1 znver2 znver3 3 6 9 12 15 SE +/- 0.062576, N = 15 SE +/- 0.117999, N = 4 SE +/- 0.072954, N = 3 SE +/- 0.028561, N = 3 SE +/- 0.094255, N = 3 SE +/- 0.135072, N = 3 7.829319 8.111486 8.054180 8.778316 8.922133 8.972347 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -march=native -fopenmp
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen haswell skylake x86-64 znver1 znver2 znver3 80 160 240 320 400 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 1.00, N = 3 299 377 225 367 373 374 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced haswell skylake x86-64 znver1 znver2 znver3 110 220 330 440 550 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 524 436 382 446 446 447 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing haswell skylake x86-64 znver1 znver2 znver3 400 800 1200 1600 2000 SE +/- 1.00, N = 3 SE +/- 4.10, N = 3 SE +/- 3.79, N = 3 SE +/- 0.88, N = 3 SE +/- 4.91, N = 3 SE +/- 4.33, N = 3 2023 2023 1809 2084 2050 2025 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second haswell skylake x86-64 znver1 znver2 znver3 200K 400K 600K 800K 1000K SE +/- 563.60, N = 3 SE +/- 1894.03, N = 3 SE +/- 651.07, N = 3 SE +/- 2226.29, N = 3 SE +/- 7614.04, N = 3 SE +/- 5296.50, N = 3 776752.22 823512.89 783951.87 835605.87 834342.44 837788.08 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O2 -O3 -lrt" -lrt
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite haswell skylake x86-64 znver1 znver2 znver3 1000 2000 3000 4000 5000 SE +/- 11.48, N = 3 SE +/- 4.37, N = 3 SE +/- 5.93, N = 3 SE +/- 13.80, N = 3 SE +/- 1.02, N = 3 SE +/- 0.19, N = 3 4332.88 4314.57 3662.75 3304.21 4310.95 4463.42 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply haswell skylake x86-64 znver1 znver2 znver3 1100 2200 3300 4400 5500 SE +/- 44.33, N = 3 SE +/- 10.56, N = 3 SE +/- 10.14, N = 3 SE +/- 1.53, N = 3 SE +/- 1.75, N = 3 SE +/- 2.08, N = 3 5062.07 4955.38 4977.77 4392.61 4918.89 5091.12 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation haswell skylake x86-64 znver1 znver2 znver3 700 1400 2100 2800 3500 SE +/- 2.02, N = 3 SE +/- 0.45, N = 3 SE +/- 0.05, N = 3 SE +/- 3.35, N = 3 SE +/- 3.83, N = 3 SE +/- 0.78, N = 3 2934.46 2936.04 2440.10 2913.83 2931.78 3072.04 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -lm
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark haswell skylake x86-64 znver1 znver2 znver3 20 40 60 80 100 SE +/- 0.11, N = 3 SE +/- 0.39, N = 3 SE +/- 0.02, N = 3 SE +/- 0.16, N = 3 SE +/- 0.20, N = 3 SE +/- 0.06, N = 3 74.13 75.04 65.48 74.54 76.01 76.04 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CXX) g++ options: -O3 -fopenmp -ljpeg -lz -lm
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time haswell skylake x86-64 znver1 znver2 znver3 3M 6M 9M 12M 15M SE +/- 22104.24, N = 3 SE +/- 69563.68, N = 3 SE +/- 18592.10, N = 3 SE +/- 33532.81, N = 3 SE +/- 44223.61, N = 3 SE +/- 121191.73, N = 3 11935854 11943425 11828707 11865359 11910299 12060994 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT haswell skylake x86-64 znver1 znver2 znver3 120M 240M 360M 480M 600M SE +/- 2024395.89, N = 3 SE +/- 4152069.13, N = 3 SE +/- 1444160.80, N = 3 SE +/- 114637.70, N = 3 SE +/- 5908890.92, N = 3 SE +/- 963540.42, N = 3 519167343.78 520771613.79 521237587.67 523840766.07 529343739.10 539663428.11 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -march=native -lm
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping haswell skylake x86-64 znver1 znver2 znver3 200 400 600 800 1000 SE +/- 1.90, N = 3 SE +/- 1.88, N = 3 SE +/- 2.69, N = 3 SE +/- 0.66, N = 3 SE +/- 2.52, N = 3 SE +/- 3.90, N = 3 969.60 962.73 932.29 950.30 963.53 969.38 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image haswell skylake x86-64 znver1 znver2 znver3 6K 12K 18K 24K 30K SE +/- 289.82, N = 3 SE +/- 113.76, N = 3 SE +/- 108.90, N = 3 SE +/- 159.95, N = 3 SE +/- 240.50, N = 3 SE +/- 109.65, N = 3 29785.61 30279.77 29502.08 29918.70 29824.34 29625.96 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100 haswell skylake x86-64 znver1 znver2 znver3 0.3863 0.7726 1.1589 1.5452 1.9315 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 SE +/- 0.020, N = 3 1.690 1.684 1.717 1.692 1.689 1.670 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -fvisibility=hidden -O3 -pthread -lm -ljpeg -lpng16 -ltiff
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel haswell skylake x86-64 znver1 znver2 znver3 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 25.46 25.09 31.21 26.07 25.32 25.20 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -lm -lpthread -O3
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples haswell skylake x86-64 znver1 znver2 znver3 1.1383 2.2766 3.4149 4.5532 5.6915 SE +/- 0.014, N = 3 SE +/- 0.005, N = 3 SE +/- 0.010, N = 3 SE +/- 0.007, N = 3 SE +/- 0.007, N = 3 SE +/- 0.005, N = 3 4.906 4.872 5.059 4.790 4.664 4.627 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CXX) g++ options: -fopenmp -O3
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time haswell skylake x86-64 znver1 znver2 znver3 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.68, N = 15 SE +/- 0.24, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 25.33 25.37 29.52 25.03 25.25 25.26 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -lm -O3
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 haswell skylake x86-64 znver1 znver2 znver3 1.31 2.62 3.93 5.24 6.55 SE +/- 0.020, N = 3 SE +/- 0.007, N = 3 SE +/- 0.015, N = 3 SE +/- 0.009, N = 3 SE +/- 0.010, N = 3 SE +/- 0.006, N = 3 5.520 5.565 5.822 5.546 5.478 5.468 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 haswell skylake x86-64 znver1 znver2 znver3 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.05, N = 3 SE +/- 0.22, N = 3 SE +/- 0.10, N = 3 SE +/- 0.19, N = 3 SE +/- 0.13, N = 3 15.51 15.58 14.63 14.10 13.85 13.99 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -pedantic -fvisibility=hidden
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 haswell skylake x86-64 znver1 znver2 znver3 10 20 30 40 50 SE +/- 0.23, N = 3 SE +/- 0.15, N = 3 SE +/- 0.18, N = 3 SE +/- 0.19, N = 3 SE +/- 0.61, N = 3 SE +/- 0.25, N = 3 42.23 42.78 42.19 42.25 42.09 41.91 -march=haswell -march=skylake -march=x86-64 -march=znver1 -march=znver2 -march=znver3 1. (CC) gcc options: -O3 -ldl -lz -lpthread
Geometric Mean Of All Test Results Result Composite - GCC Znver3 First Cut Benchmarks OpenBenchmarking.org Geometric Mean, More Is Better Geometric Mean Of All Test Results Result Composite - GCC Znver3 First Cut Benchmarks haswell skylake x86-64 znver1 znver2 znver3 110 220 330 440 550 473.43 482.49 441.20 478.18 488.84 492.26
Phoronix Test Suite v10.8.4