GCC Znver3 First Cut Benchmarks

Tests for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012046-HA-GCCZNVER333
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C/C++ Compiler Tests 9 Tests
CPU Massive 7 Tests
Creator Workloads 11 Tests
Encoding 4 Tests
HPC - High Performance Computing 3 Tests
Imaging 3 Tests
Multi-Core 9 Tests
Programmer / Developer System Benchmarks 2 Tests
Renderers 3 Tests
Server CPU Tests 2 Tests
Single-Threaded 3 Tests
Video Encoding 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
znver3
December 03 2020
  56 Minutes
x86-64
December 04 2020
  1 Hour, 1 Minute
znver1
December 04 2020
  51 Minutes
znver2
December 04 2020
  54 Minutes
haswell
December 04 2020
  1 Hour, 4 Minutes
skylake
December 04 2020
  56 Minutes
Invert Hiding All Results Option
  57 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


GCC Znver3 First Cut Benchmarks - Phoronix Test Suite

GCC Znver3 First Cut Benchmarks

Tests for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2012046-HA-GCCZNVER333&sgm=1&hgv=znver3&gru&export=txt&sor.

GCC Znver3 First Cut BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolutionznver3x86-64znver1znver2haswellskylakeAMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (2311 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBNVIDIA GeForce RTX 3060 Ti 8GB (1665/7000MHz)NVIDIA Device 228bASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-54-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 455.45.014.6.0OpenCL 1.2 CUDA 11.1.1141.2.142GCC 11.0.0 20201203ext43840x2160OpenBenchmarking.orgEnvironment Details- znver3: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3"- x86-64: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64"- znver1: CXXFLAGS="-O3 -march=znver1" CFLAGS="-O3 -march=znver1"- znver2: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2"- haswell: CXXFLAGS="-O3 -march=haswell" CFLAGS="-O3 -march=haswell"- skylake: CXXFLAGS="-O3 -march=skylake" CFLAGS="-O3 -march=skylake"Compiler Details- --disable-multilib --enable-checking=releaseDisk Details- znver3: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 OpenCL Details- GPU Compute Cores: 4864Python Details- znver3: Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

GCC Znver3 First Cut Benchmarksdav1d: Chimera 1080p 10-bitkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 1080p - Ultra Fastvpxenc: Speed 0vpxenc: Speed 5mt-dgemm: Sustained Floating-Point Rategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizingcoremark: CoreMark Size 666 - Iterations Per Secondscimark2: Compositescimark2: Sparse Matrix Multiplyscimark2: Jacobi Successive Over-Relaxationlibraw: Post-Processing Benchmarkcrafty: Elapsed Timehint: FLOATdaphne: OpenMP - NDT Mappingdaphne: OpenMP - Points2Imagewebp: Quality 100c-ray: Total Time - 4K, 16 Rays Per Pixelsmallpt: Global Illumination Renderer; 128 Samplesaobench: 2048 x 2048 - Total Timeencode-mp3: WAV To MP3rnnoise: sqlite-speedtest: Timed Time - Size 1,000znver3x86-64znver1znver2haswellskylake263.9755.82203.0810.7639.138.9723473744472025837788.0842344463.425091.123072.0476.0412060994539663428.10785969.3829625.9578027101.67025.2024.62725.2585.46813.98941.905189.6255.07202.4310.1736.038.0541802253821809783951.8672503662.754977.772440.1065.4811828707521237587.67053932.2929502.0845242881.71731.2125.05929.5215.82214.63342.186251.1156.22205.7110.8238.958.7783163674462084835605.8739753304.214392.612913.8374.5411865359523840766.06954950.3029918.6996710601.69226.0734.79025.0285.54614.09542.250265.1655.85201.8410.7738.978.9221333734462050834342.4401894310.954918.892931.7876.0111910299529343739.09606963.5329824.3403874061.68925.3214.66425.2545.47813.85442.086175.3657.53209.1910.8139.187.8293192995242023776752.2175284332.885062.072934.4674.1311935854519167343.77856969.6029785.6138137081.69025.4554.90625.3285.52015.51242.229262.4055.75203.6910.7738.878.1114863774362023823512.8864064314.574955.382936.0475.0411943425520771613.78862962.7330279.7711343161.68425.0854.87225.3695.56515.57542.783OpenBenchmarking.org

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bitznver2znver3skylakeznver1x86-64haswell60120180240300SE +/- 0.73, N = 3SE +/- 0.85, N = 3SE +/- 0.56, N = 3SE +/- 0.45, N = 3SE +/- 0.23, N = 3SE +/- 0.22, N = 3265.16263.97262.40251.11189.62175.36-march=znver2 - MIN: 175.81 / MAX: 502.41-march=znver3 - MIN: 175.34 / MAX: 478.58-march=skylake - MIN: 173.7 / MAX: 495.13-march=znver1 - MIN: 167.21 / MAX: 471.36-march=x86-64 - MIN: 124.1 / MAX: 372.85-march=haswell - MIN: 93.55 / MAX: 459.711. (CC) gcc options: -O3 -pthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fasthaswellznver1znver2znver3skylakex86-641326395265SE +/- 0.01, N = 3SE +/- 0.20, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.22, N = 3SE +/- 0.10, N = 357.5356.2255.8555.8255.7555.07-march=haswell-march=znver1-march=znver2-march=znver3-march=skylake-march=x86-641. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fasthaswellznver1skylakeznver3x86-64znver250100150200250SE +/- 1.14, N = 3SE +/- 0.58, N = 3SE +/- 0.72, N = 3SE +/- 0.81, N = 3SE +/- 0.29, N = 3SE +/- 1.10, N = 3209.19205.71203.69203.08202.43201.84-march=haswell-march=znver1-march=skylake-march=znver3-march=x86-64-march=znver21. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -lpthread -lm -lrt

VP9 libvpx Encoding

Speed: Speed 0

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 0znver1haswellskylakeznver2znver3x86-643691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 310.8210.8110.7710.7710.7610.17-march=znver1-march=haswell-march=skylake-march=znver2-march=znver3-march=x86-641. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

VP9 libvpx Encoding

Speed: Speed 5

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 5haswellznver3znver2znver1skylakex86-64918273645SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.33, N = 3SE +/- 0.27, N = 3SE +/- 0.09, N = 339.1839.1338.9738.9538.8736.03-march=haswell-march=znver3-march=znver2-march=znver1-march=skylake-march=x86-641. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rateznver3znver2znver1skylakex86-64haswell3691215SE +/- 0.135072, N = 3SE +/- 0.094255, N = 3SE +/- 0.028561, N = 3SE +/- 0.117999, N = 4SE +/- 0.072954, N = 3SE +/- 0.062576, N = 158.9723478.9221338.7783168.1114868.0541807.829319-march=znver3-march=znver2-march=znver1-march=skylake-march=x86-64-march=haswell1. (CC) gcc options: -O3 -march=native -fopenmp

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpenskylakeznver3znver2znver1haswellx86-6480160240320400SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3377374373367299225-march=skylake-march=znver3-march=znver2-march=znver1-march=haswell-march=x86-641. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhancedhaswellznver3znver2znver1skylakex86-64110220330440550SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3524447446446436382-march=haswell-march=znver3-march=znver2-march=znver1-march=skylake-march=x86-641. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizingznver1znver2znver3skylakehaswellx86-64400800120016002000SE +/- 0.88, N = 3SE +/- 4.91, N = 3SE +/- 4.33, N = 3SE +/- 4.10, N = 3SE +/- 1.00, N = 3SE +/- 3.79, N = 3208420502025202320231809-march=znver1-march=znver2-march=znver3-march=skylake-march=haswell-march=x86-641. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondznver3znver1znver2skylakex86-64haswell200K400K600K800K1000KSE +/- 5296.50, N = 3SE +/- 2226.29, N = 3SE +/- 7614.04, N = 3SE +/- 1894.03, N = 3SE +/- 651.07, N = 3SE +/- 563.60, N = 3837788.08835605.87834342.44823512.89783951.87776752.22-march=znver3-march=znver1-march=znver2-march=skylake-march=x86-64-march=haswell1. (CC) gcc options: -O2 -O3 -lrt" -lrt

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositeznver3haswellskylakeznver2x86-64znver110002000300040005000SE +/- 0.19, N = 3SE +/- 11.48, N = 3SE +/- 4.37, N = 3SE +/- 1.02, N = 3SE +/- 5.93, N = 3SE +/- 13.80, N = 34463.424332.884314.574310.953662.753304.21-march=znver3-march=haswell-march=skylake-march=znver2-march=x86-64-march=znver11. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiplyznver3haswellx86-64skylakeznver2znver111002200330044005500SE +/- 2.08, N = 3SE +/- 44.33, N = 3SE +/- 10.14, N = 3SE +/- 10.56, N = 3SE +/- 1.75, N = 3SE +/- 1.53, N = 35091.125062.074977.774955.384918.894392.61-march=znver3-march=haswell-march=x86-64-march=skylake-march=znver2-march=znver11. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxationznver3skylakehaswellznver2znver1x86-647001400210028003500SE +/- 0.78, N = 3SE +/- 0.45, N = 3SE +/- 2.02, N = 3SE +/- 3.83, N = 3SE +/- 3.35, N = 3SE +/- 0.05, N = 33072.042936.042934.462931.782913.832440.10-march=znver3-march=skylake-march=haswell-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -lm

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmarkznver3znver2skylakeznver1haswellx86-6420406080100SE +/- 0.06, N = 3SE +/- 0.20, N = 3SE +/- 0.39, N = 3SE +/- 0.16, N = 3SE +/- 0.11, N = 3SE +/- 0.02, N = 376.0476.0175.0474.5474.1365.48-march=znver3-march=znver2-march=skylake-march=znver1-march=haswell-march=x86-641. (CXX) g++ options: -O3 -fopenmp -ljpeg -lz -lm

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Timeznver3skylakehaswellznver2znver1x86-643M6M9M12M15MSE +/- 121191.73, N = 3SE +/- 69563.68, N = 3SE +/- 22104.24, N = 3SE +/- 44223.61, N = 3SE +/- 33532.81, N = 3SE +/- 18592.10, N = 31206099411943425119358541191029911865359118287071. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATznver3znver2znver1x86-64skylakehaswell120M240M360M480M600MSE +/- 963540.42, N = 3SE +/- 5908890.92, N = 3SE +/- 114637.70, N = 3SE +/- 1444160.80, N = 3SE +/- 4152069.13, N = 3SE +/- 2024395.89, N = 3539663428.11529343739.10523840766.07521237587.67520771613.79519167343.78-march=znver3-march=znver2-march=znver1-march=x86-64-march=skylake-march=haswell1. (CC) gcc options: -O3 -march=native -lm

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT Mappinghaswellznver3znver2skylakeznver1x86-642004006008001000SE +/- 1.90, N = 3SE +/- 3.90, N = 3SE +/- 2.52, N = 3SE +/- 1.88, N = 3SE +/- 0.66, N = 3SE +/- 2.69, N = 3969.60969.38963.53962.73950.30932.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Points2Image

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Points2Imageskylakeznver1znver2haswellznver3x86-646K12K18K24K30KSE +/- 113.76, N = 3SE +/- 159.95, N = 3SE +/- 240.50, N = 3SE +/- 289.82, N = 3SE +/- 109.65, N = 3SE +/- 108.90, N = 330279.7729918.7029824.3429785.6129625.9629502.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100znver3skylakeznver2haswellznver1x86-640.38630.77261.15891.54521.9315SE +/- 0.020, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.004, N = 3SE +/- 0.004, N = 31.6701.6841.6891.6901.6921.717-march=znver3-march=skylake-march=znver2-march=haswell-march=znver1-march=x86-641. (CC) gcc options: -fvisibility=hidden -O3 -pthread -lm -ljpeg -lpng16 -ltiff

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelskylakeznver3znver2haswellznver1x86-64714212835SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 325.0925.2025.3225.4626.0731.21-march=skylake-march=znver3-march=znver2-march=haswell-march=znver1-march=x86-641. (CC) gcc options: -lm -lpthread -O3

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samplesznver3znver2znver1skylakehaswellx86-641.13832.27663.41494.55325.6915SE +/- 0.005, N = 3SE +/- 0.007, N = 3SE +/- 0.007, N = 3SE +/- 0.005, N = 3SE +/- 0.014, N = 3SE +/- 0.010, N = 34.6274.6644.7904.8724.9065.059-march=znver3-march=znver2-march=znver1-march=skylake-march=haswell-march=x86-641. (CXX) g++ options: -fopenmp -O3

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Timeznver1znver2znver3haswellskylakex86-64714212835SE +/- 0.24, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.68, N = 1525.0325.2525.2625.3325.3729.52-march=znver1-march=znver2-march=znver3-march=haswell-march=skylake-march=x86-641. (CC) gcc options: -lm -O3

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3znver3znver2haswellznver1skylakex86-641.312.623.935.246.55SE +/- 0.006, N = 3SE +/- 0.010, N = 3SE +/- 0.020, N = 3SE +/- 0.009, N = 3SE +/- 0.007, N = 3SE +/- 0.015, N = 35.4685.4785.5205.5465.5655.822-march=znver3-march=znver2-march=haswell-march=znver1-march=skylake-march=x86-641. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm

RNNoise

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28znver2znver3znver1x86-64haswellskylake48121620SE +/- 0.19, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.22, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 313.8513.9914.1014.6315.5115.58-march=znver2-march=znver3-march=znver1-march=x86-64-march=haswell-march=skylake1. (CC) gcc options: -O3 -pedantic -fvisibility=hidden

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000znver3znver2x86-64haswellznver1skylake1020304050SE +/- 0.25, N = 3SE +/- 0.61, N = 3SE +/- 0.18, N = 3SE +/- 0.23, N = 3SE +/- 0.19, N = 3SE +/- 0.15, N = 341.9142.0942.1942.2342.2542.78-march=znver3-march=znver2-march=x86-64-march=haswell-march=znver1-march=skylake1. (CC) gcc options: -O3 -ldl -lz -lpthread

Geometric Mean Of All Test Results

Result Composite - GCC Znver3 First Cut Benchmarks

OpenBenchmarking.orgGeometric Mean, More Is BetterGeometric Mean Of All Test ResultsResult Composite - GCC Znver3 First Cut Benchmarksznver3znver2skylakeznver1haswellx86-64110220330440550492.26488.84482.49478.18473.43441.20


Phoronix Test Suite v10.8.4