GCC Znver3 First Cut Benchmarks

Tests for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012046-HA-GCCZNVER333
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C/C++ Compiler Tests 9 Tests
CPU Massive 7 Tests
Creator Workloads 11 Tests
Encoding 4 Tests
HPC - High Performance Computing 3 Tests
Imaging 3 Tests
Multi-Core 9 Tests
Programmer / Developer System Benchmarks 2 Tests
Renderers 3 Tests
Server CPU Tests 2 Tests
Single-Threaded 3 Tests
Video Encoding 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
znver3
December 03 2020
  56 Minutes
x86-64
December 04 2020
  1 Hour, 1 Minute
znver1
December 04 2020
  51 Minutes
znver2
December 04 2020
  54 Minutes
haswell
December 04 2020
  1 Hour, 4 Minutes
skylake
December 04 2020
  56 Minutes
Invert Hiding All Results Option
  57 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


GCC Znver3 First Cut BenchmarksOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (2311 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBNVIDIA GeForce RTX 3060 Ti 8GB (1665/7000MHz)NVIDIA Device 228bASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-54-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 455.45.014.6.0OpenCL 1.2 CUDA 11.1.1141.2.142GCC 11.0.0 20201203ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionGCC Znver3 First Cut Benchmarks PerformanceSystem Logs- znver3: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3"- x86-64: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64"- znver1: CXXFLAGS="-O3 -march=znver1" CFLAGS="-O3 -march=znver1"- znver2: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2"- haswell: CXXFLAGS="-O3 -march=haswell" CFLAGS="-O3 -march=haswell"- skylake: CXXFLAGS="-O3 -march=skylake" CFLAGS="-O3 -march=skylake"- --disable-multilib --enable-checking=release- znver3: NONE / errors=remount-ro,relatime,rw / Block Size: 4096- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009 - GPU Compute Cores: 4864- znver3: Python 2.7.18 + Python 3.8.5- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

znver3x86-64znver1znver2haswellskylakeResult OverviewPhoronix Test Suite100%113%126%138%dav1dGraphicsMagickC-RaySciMarkAOBenchLibRawACES DGEMMRNNoiseSmallptCoremarkVP9 libvpx EncodingLAME MP3 EncodingHierarchical INTegrationKvazaarDarmstadt Automotive Parallel Heterogeneous SuiteWebP Image EncodeSQLite SpeedtestCrafty

GCC Znver3 First Cut Benchmarkswebp: Quality 100scimark2: Compositescimark2: Sparse Matrix Multiplyscimark2: Jacobi Successive Over-Relaxationlibraw: Post-Processing Benchmarkcrafty: Elapsed Timegraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizingdav1d: Chimera 1080p 10-bitkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 1080p - Ultra Fastvpxenc: Speed 0vpxenc: Speed 5mt-dgemm: Sustained Floating-Point Ratecoremark: CoreMark Size 666 - Iterations Per Secondc-ray: Total Time - 4K, 16 Rays Per Pixelsmallpt: Global Illumination Renderer; 128 Samplesaobench: 2048 x 2048 - Total Timeencode-mp3: WAV To MP3rnnoise: daphne: OpenMP - NDT Mappingdaphne: OpenMP - Points2Imagesqlite-speedtest: Timed Time - Size 1,000hint: FLOATznver3x86-64znver1znver2haswellskylake1.6704463.425091.123072.0476.04120609943744472025263.9755.82203.0810.7639.138.972347837788.08423425.2024.62725.2585.46813.989969.3829625.95780271041.905539663428.107851.7173662.754977.772440.1065.48118287072253821809189.6255.07202.4310.1736.038.054180783951.86725031.2125.05929.5215.82214.633932.2929502.08452428842.186521237587.670531.6923304.214392.612913.8374.54118653593674462084251.1156.22205.7110.8238.958.778316835605.87397526.0734.79025.0285.54614.095950.3029918.69967106042.250523840766.069541.6894310.954918.892931.7876.01119102993734462050265.1655.85201.8410.7738.978.922133834342.44018925.3214.66425.2545.47813.854963.5329824.34038740642.086529343739.096061.6904332.885062.072934.4674.13119358542995242023175.3657.53209.1910.8139.187.829319776752.21752825.4554.90625.3285.52015.512969.6029785.61381370842.229519167343.778561.6844314.574955.382936.0475.04119434253774362023262.4055.75203.6910.7738.878.111486823512.88640625.0854.87225.3695.56515.575962.7330279.77113431642.783520771613.78862OpenBenchmarking.org

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100znver3x86-64znver1znver2haswellskylake0.38630.77261.15891.54521.9315SE +/- 0.020, N = 3SE +/- 0.004, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 31.6701.7171.6921.6891.6901.684-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -fvisibility=hidden -O3 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100znver3x86-64znver1znver2haswellskylake246810Min: 1.63 / Avg: 1.67 / Max: 1.7Min: 1.71 / Avg: 1.72 / Max: 1.73Min: 1.69 / Avg: 1.69 / Max: 1.7Min: 1.68 / Avg: 1.69 / Max: 1.7Min: 1.68 / Avg: 1.69 / Max: 1.7Min: 1.68 / Avg: 1.68 / Max: 1.691. (CC) gcc options: -fvisibility=hidden -O3 -pthread -lm -ljpeg -lpng16 -ltiff

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositeznver3x86-64znver1znver2haswellskylake10002000300040005000SE +/- 0.19, N = 3SE +/- 5.93, N = 3SE +/- 13.80, N = 3SE +/- 1.02, N = 3SE +/- 11.48, N = 3SE +/- 4.37, N = 34463.423662.753304.214310.954332.884314.57-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -O3 -lm
OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositeznver3x86-64znver1znver2haswellskylake8001600240032004000Min: 4463.11 / Avg: 4463.42 / Max: 4463.76Min: 3651.09 / Avg: 3662.75 / Max: 3670.42Min: 3277.28 / Avg: 3304.21 / Max: 3322.89Min: 4309.73 / Avg: 4310.95 / Max: 4312.98Min: 4314.86 / Avg: 4332.88 / Max: 4354.21Min: 4307.56 / Avg: 4314.57 / Max: 4322.581. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiplyznver3x86-64znver1znver2haswellskylake11002200330044005500SE +/- 2.08, N = 3SE +/- 10.14, N = 3SE +/- 1.53, N = 3SE +/- 1.75, N = 3SE +/- 44.33, N = 3SE +/- 10.56, N = 35091.124977.774392.614918.895062.074955.38-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -O3 -lm
OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiplyznver3x86-64znver1znver2haswellskylake9001800270036004500Min: 5087 / Avg: 5091.12 / Max: 5093.61Min: 4959.33 / Avg: 4977.77 / Max: 4994.32Min: 4391.04 / Avg: 4392.61 / Max: 4395.67Min: 4915.61 / Avg: 4918.89 / Max: 4921.6Min: 5015.39 / Avg: 5062.07 / Max: 5150.69Min: 4942.04 / Avg: 4955.38 / Max: 4976.231. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxationznver3x86-64znver1znver2haswellskylake7001400210028003500SE +/- 0.78, N = 3SE +/- 0.05, N = 3SE +/- 3.35, N = 3SE +/- 3.83, N = 3SE +/- 2.02, N = 3SE +/- 0.45, N = 33072.042440.102913.832931.782934.462936.04-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -O3 -lm
OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxationznver3x86-64znver1znver2haswellskylake5001000150020002500Min: 3070.49 / Avg: 3072.04 / Max: 3072.9Min: 2440.01 / Avg: 2440.1 / Max: 2440.17Min: 2907.35 / Avg: 2913.83 / Max: 2918.52Min: 2924.26 / Avg: 2931.78 / Max: 2936.81Min: 2930.43 / Avg: 2934.46 / Max: 2936.69Min: 2935.35 / Avg: 2936.04 / Max: 2936.881. (CC) gcc options: -O3 -lm

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmarkznver3x86-64znver1znver2haswellskylake20406080100SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.16, N = 3SE +/- 0.20, N = 3SE +/- 0.11, N = 3SE +/- 0.39, N = 376.0465.4874.5476.0174.1375.04-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CXX) g++ options: -O3 -fopenmp -ljpeg -lz -lm
OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing Benchmarkznver3x86-64znver1znver2haswellskylake1530456075Min: 75.91 / Avg: 76.04 / Max: 76.12Min: 65.46 / Avg: 65.48 / Max: 65.53Min: 74.25 / Avg: 74.54 / Max: 74.78Min: 75.63 / Avg: 76.01 / Max: 76.33Min: 73.91 / Avg: 74.13 / Max: 74.28Min: 74.26 / Avg: 75.04 / Max: 75.461. (CXX) g++ options: -O3 -fopenmp -ljpeg -lz -lm

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Timeznver3x86-64znver1znver2haswellskylake3M6M9M12M15MSE +/- 121191.73, N = 3SE +/- 18592.10, N = 3SE +/- 33532.81, N = 3SE +/- 44223.61, N = 3SE +/- 22104.24, N = 3SE +/- 69563.68, N = 31206099411828707118653591191029911935854119434251. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Timeznver3x86-64znver1znver2haswellskylake2M4M6M8M10MMin: 11859705 / Avg: 12060994.33 / Max: 12278576Min: 11791777 / Avg: 11828706.67 / Max: 11850933Min: 11804954 / Avg: 11865358.67 / Max: 11920796Min: 11826723 / Avg: 11910299.33 / Max: 11977156Min: 11904745 / Avg: 11935854.33 / Max: 11978611Min: 11843663 / Avg: 11943425 / Max: 120772881. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpenznver3x86-64znver1znver2haswellskylake80160240320400SE +/- 1.00, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3374225367373299377-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpenznver3x86-64znver1znver2haswellskylake70140210280350Min: 373 / Avg: 374 / Max: 376Min: 224 / Avg: 224.67 / Max: 226Min: 367 / Avg: 367.33 / Max: 368Min: 373 / Avg: 373.33 / Max: 374Min: 299 / Avg: 299.33 / Max: 300Min: 377 / Avg: 377.33 / Max: 3781. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhancedznver3x86-64znver1znver2haswellskylake110220330440550SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3447382446446524436-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhancedznver3x86-64znver1znver2haswellskylake90180270360450Min: 446 / Avg: 447 / Max: 448Min: 381 / Avg: 382 / Max: 383Min: 446 / Avg: 446.33 / Max: 447Min: 524 / Avg: 524.33 / Max: 525Min: 436 / Avg: 436.33 / Max: 4371. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizingznver3x86-64znver1znver2haswellskylake400800120016002000SE +/- 4.33, N = 3SE +/- 3.79, N = 3SE +/- 0.88, N = 3SE +/- 4.91, N = 3SE +/- 1.00, N = 3SE +/- 4.10, N = 3202518092084205020232023-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizingznver3x86-64znver1znver2haswellskylake400800120016002000Min: 2018 / Avg: 2025.33 / Max: 2033Min: 1802 / Avg: 1809 / Max: 1815Min: 2083 / Avg: 2084.33 / Max: 2086Min: 2041 / Avg: 2049.67 / Max: 2058Min: 2022 / Avg: 2023 / Max: 2025Min: 2015 / Avg: 2022.67 / Max: 20291. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bitznver3x86-64znver1znver2haswellskylake60120180240300SE +/- 0.85, N = 3SE +/- 0.23, N = 3SE +/- 0.45, N = 3SE +/- 0.73, N = 3SE +/- 0.22, N = 3SE +/- 0.56, N = 3263.97189.62251.11265.16175.36262.40-march=znver3 - MIN: 175.34 / MAX: 478.58-march=x86-64 - MIN: 124.1 / MAX: 372.85-march=znver1 - MIN: 167.21 / MAX: 471.36-march=znver2 - MIN: 175.81 / MAX: 502.41-march=haswell - MIN: 93.55 / MAX: 459.71-march=skylake - MIN: 173.7 / MAX: 495.131. (CC) gcc options: -O3 -pthread
OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bitznver3x86-64znver1znver2haswellskylake50100150200250Min: 262.4 / Avg: 263.97 / Max: 265.3Min: 189.31 / Avg: 189.62 / Max: 190.07Min: 250.22 / Avg: 251.11 / Max: 251.61Min: 263.77 / Avg: 265.16 / Max: 266.22Min: 174.98 / Avg: 175.36 / Max: 175.75Min: 261.36 / Avg: 262.4 / Max: 263.261. (CC) gcc options: -O3 -pthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fastznver3x86-64znver1znver2haswellskylake1326395265SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 3SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.22, N = 355.8255.0756.2255.8557.5355.75-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fastznver3x86-64znver1znver2haswellskylake1122334455Min: 55.67 / Avg: 55.82 / Max: 55.99Min: 54.87 / Avg: 55.07 / Max: 55.21Min: 55.83 / Avg: 56.22 / Max: 56.43Min: 55.67 / Avg: 55.85 / Max: 56.02Min: 57.52 / Avg: 57.53 / Max: 57.54Min: 55.31 / Avg: 55.75 / Max: 56.021. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fastznver3x86-64znver1znver2haswellskylake50100150200250SE +/- 0.81, N = 3SE +/- 0.29, N = 3SE +/- 0.58, N = 3SE +/- 1.10, N = 3SE +/- 1.14, N = 3SE +/- 0.72, N = 3203.08202.43205.71201.84209.19203.69-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fastznver3x86-64znver1znver2haswellskylake4080120160200Min: 201.71 / Avg: 203.08 / Max: 204.53Min: 201.88 / Avg: 202.43 / Max: 202.86Min: 205.13 / Avg: 205.71 / Max: 206.87Min: 200.38 / Avg: 201.84 / Max: 204Min: 207.24 / Avg: 209.19 / Max: 211.19Min: 202.25 / Avg: 203.69 / Max: 204.481. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -lpthread -lm -lrt

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 0znver3x86-64znver1znver2haswellskylake3691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 310.7610.1710.8210.7710.8110.77-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 0znver3x86-64znver1znver2haswellskylake3691215Min: 10.72 / Avg: 10.76 / Max: 10.78Min: 10.14 / Avg: 10.17 / Max: 10.22Min: 10.79 / Avg: 10.82 / Max: 10.84Min: 10.71 / Avg: 10.77 / Max: 10.8Min: 10.73 / Avg: 10.81 / Max: 10.85Min: 10.73 / Avg: 10.77 / Max: 10.81. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 5znver3x86-64znver1znver2haswellskylake918273645SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.33, N = 3SE +/- 0.15, N = 3SE +/- 0.02, N = 3SE +/- 0.27, N = 339.1336.0338.9538.9739.1838.87-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11
OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.2Speed: Speed 5znver3x86-64znver1znver2haswellskylake816243240Min: 39.07 / Avg: 39.13 / Max: 39.19Min: 35.88 / Avg: 36.03 / Max: 36.18Min: 38.6 / Avg: 38.95 / Max: 39.6Min: 38.67 / Avg: 38.97 / Max: 39.15Min: 39.15 / Avg: 39.18 / Max: 39.22Min: 38.34 / Avg: 38.87 / Max: 39.181. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rateznver3x86-64znver1znver2haswellskylake3691215SE +/- 0.135072, N = 3SE +/- 0.072954, N = 3SE +/- 0.028561, N = 3SE +/- 0.094255, N = 3SE +/- 0.062576, N = 15SE +/- 0.117999, N = 48.9723478.0541808.7783168.9221337.8293198.111486-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -O3 -march=native -fopenmp
OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rateznver3x86-64znver1znver2haswellskylake3691215Min: 8.7 / Avg: 8.97 / Max: 9.14Min: 7.92 / Avg: 8.05 / Max: 8.17Min: 8.72 / Avg: 8.78 / Max: 8.82Min: 8.74 / Avg: 8.92 / Max: 9.04Min: 7.36 / Avg: 7.83 / Max: 8.27Min: 7.92 / Avg: 8.11 / Max: 8.411. (CC) gcc options: -O3 -march=native -fopenmp

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondznver3x86-64znver1znver2haswellskylake200K400K600K800K1000KSE +/- 5296.50, N = 3SE +/- 651.07, N = 3SE +/- 2226.29, N = 3SE +/- 7614.04, N = 3SE +/- 563.60, N = 3SE +/- 1894.03, N = 3837788.08783951.87835605.87834342.44776752.22823512.89-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -O2 -O3 -lrt" -lrt
OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondznver3x86-64znver1znver2haswellskylake150K300K450K600K750KMin: 827229.64 / Avg: 837788.08 / Max: 843807.68Min: 782715.04 / Avg: 783951.87 / Max: 784922.94Min: 831204.81 / Avg: 835605.87 / Max: 838391.34Min: 819252.43 / Avg: 834342.44 / Max: 843659.37Min: 775632.22 / Avg: 776752.22 / Max: 777422.36Min: 820512.82 / Avg: 823512.89 / Max: 827015.851. (CC) gcc options: -O2 -O3 -lrt" -lrt

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelznver3x86-64znver1znver2haswellskylake714212835SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 325.2031.2126.0725.3225.4625.09-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -lm -lpthread -O3
OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelznver3x86-64znver1znver2haswellskylake714212835Min: 25.07 / Avg: 25.2 / Max: 25.31Min: 31.05 / Avg: 31.21 / Max: 31.31Min: 25.94 / Avg: 26.07 / Max: 26.26Min: 25.25 / Avg: 25.32 / Max: 25.38Min: 25.4 / Avg: 25.46 / Max: 25.49Min: 25.06 / Avg: 25.09 / Max: 25.111. (CC) gcc options: -lm -lpthread -O3

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samplesznver3x86-64znver1znver2haswellskylake1.13832.27663.41494.55325.6915SE +/- 0.005, N = 3SE +/- 0.010, N = 3SE +/- 0.007, N = 3SE +/- 0.007, N = 3SE +/- 0.014, N = 3SE +/- 0.005, N = 34.6275.0594.7904.6644.9064.872-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CXX) g++ options: -fopenmp -O3
OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samplesznver3x86-64znver1znver2haswellskylake246810Min: 4.62 / Avg: 4.63 / Max: 4.64Min: 5.04 / Avg: 5.06 / Max: 5.07Min: 4.78 / Avg: 4.79 / Max: 4.8Min: 4.66 / Avg: 4.66 / Max: 4.68Min: 4.88 / Avg: 4.91 / Max: 4.92Min: 4.86 / Avg: 4.87 / Max: 4.881. (CXX) g++ options: -fopenmp -O3

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Timeznver3x86-64znver1znver2haswellskylake714212835SE +/- 0.06, N = 3SE +/- 0.68, N = 15SE +/- 0.24, N = 3SE +/- 0.01, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 325.2629.5225.0325.2525.3325.37-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -lm -O3
OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Timeznver3x86-64znver1znver2haswellskylake714212835Min: 25.15 / Avg: 25.26 / Max: 25.36Min: 28.4 / Avg: 29.52 / Max: 39.02Min: 24.55 / Avg: 25.03 / Max: 25.31Min: 25.23 / Avg: 25.25 / Max: 25.27Min: 25.15 / Avg: 25.33 / Max: 25.49Min: 25.34 / Avg: 25.37 / Max: 25.431. (CC) gcc options: -lm -O3

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3znver3x86-64znver1znver2haswellskylake1.312.623.935.246.55SE +/- 0.006, N = 3SE +/- 0.015, N = 3SE +/- 0.009, N = 3SE +/- 0.010, N = 3SE +/- 0.020, N = 3SE +/- 0.007, N = 35.4685.8225.5465.4785.5205.565-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3znver3x86-64znver1znver2haswellskylake246810Min: 5.46 / Avg: 5.47 / Max: 5.48Min: 5.8 / Avg: 5.82 / Max: 5.85Min: 5.54 / Avg: 5.55 / Max: 5.56Min: 5.46 / Avg: 5.48 / Max: 5.49Min: 5.48 / Avg: 5.52 / Max: 5.55Min: 5.55 / Avg: 5.57 / Max: 5.571. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lncurses -lm

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28znver3x86-64znver1znver2haswellskylake48121620SE +/- 0.13, N = 3SE +/- 0.22, N = 3SE +/- 0.10, N = 3SE +/- 0.19, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 313.9914.6314.1013.8515.5115.58-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -O3 -pedantic -fvisibility=hidden
OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28znver3x86-64znver1znver2haswellskylake48121620Min: 13.73 / Avg: 13.99 / Max: 14.17Min: 14.2 / Avg: 14.63 / Max: 14.86Min: 13.92 / Avg: 14.1 / Max: 14.28Min: 13.64 / Avg: 13.85 / Max: 14.24Min: 15.22 / Avg: 15.51 / Max: 15.76Min: 15.52 / Avg: 15.58 / Max: 15.671. (CC) gcc options: -O3 -pedantic -fvisibility=hidden

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT Mappingznver3x86-64znver1znver2haswellskylake2004006008001000SE +/- 3.90, N = 3SE +/- 2.69, N = 3SE +/- 0.66, N = 3SE +/- 2.52, N = 3SE +/- 1.90, N = 3SE +/- 1.88, N = 3969.38932.29950.30963.53969.60962.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp
OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT Mappingznver3x86-64znver1znver2haswellskylake2004006008001000Min: 961.94 / Avg: 969.38 / Max: 975.12Min: 926.94 / Avg: 932.29 / Max: 935.45Min: 949.01 / Avg: 950.3 / Max: 951.21Min: 958.53 / Avg: 963.53 / Max: 966.53Min: 965.79 / Avg: 969.6 / Max: 971.59Min: 959.14 / Avg: 962.73 / Max: 965.471. (CXX) g++ options: -O3 -std=c++11 -fopenmp

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Points2Imageznver3x86-64znver1znver2haswellskylake6K12K18K24K30KSE +/- 109.65, N = 3SE +/- 108.90, N = 3SE +/- 159.95, N = 3SE +/- 240.50, N = 3SE +/- 289.82, N = 3SE +/- 113.76, N = 329625.9629502.0829918.7029824.3429785.6130279.771. (CXX) g++ options: -O3 -std=c++11 -fopenmp
OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Points2Imageznver3x86-64znver1znver2haswellskylake5K10K15K20K25KMin: 29407.01 / Avg: 29625.96 / Max: 29746.12Min: 29316.63 / Avg: 29502.08 / Max: 29693.71Min: 29599.52 / Avg: 29918.7 / Max: 30096.91Min: 29345.16 / Avg: 29824.34 / Max: 30100.08Min: 29288.15 / Avg: 29785.61 / Max: 30292.02Min: 30144.09 / Avg: 30279.77 / Max: 30505.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000znver3x86-64znver1znver2haswellskylake1020304050SE +/- 0.25, N = 3SE +/- 0.18, N = 3SE +/- 0.19, N = 3SE +/- 0.61, N = 3SE +/- 0.23, N = 3SE +/- 0.15, N = 341.9142.1942.2542.0942.2342.78-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -O3 -ldl -lz -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000znver3x86-64znver1znver2haswellskylake918273645Min: 41.54 / Avg: 41.91 / Max: 42.37Min: 41.84 / Avg: 42.19 / Max: 42.43Min: 41.89 / Avg: 42.25 / Max: 42.55Min: 40.99 / Avg: 42.09 / Max: 43.09Min: 41.79 / Avg: 42.23 / Max: 42.54Min: 42.55 / Avg: 42.78 / Max: 43.061. (CC) gcc options: -O3 -ldl -lz -lpthread

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATznver3x86-64znver1znver2haswellskylake120M240M360M480M600MSE +/- 963540.42, N = 3SE +/- 1444160.80, N = 3SE +/- 114637.70, N = 3SE +/- 5908890.92, N = 3SE +/- 2024395.89, N = 3SE +/- 4152069.13, N = 3539663428.11521237587.67523840766.07529343739.10519167343.78520771613.79-march=znver3-march=x86-64-march=znver1-march=znver2-march=haswell-march=skylake1. (CC) gcc options: -O3 -march=native -lm
OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATznver3x86-64znver1znver2haswellskylake90M180M270M360M450MMin: 538671763.49 / Avg: 539663428.11 / Max: 541590232.6Min: 518586634.81 / Avg: 521237587.67 / Max: 523556108.88Min: 523639684.45 / Avg: 523840766.07 / Max: 524036701.03Min: 522602910.45 / Avg: 529343739.1 / Max: 541120454.49Min: 516749905.1 / Avg: 519167343.78 / Max: 523188807.12Min: 516550703.49 / Avg: 520771613.79 / Max: 529075369.471. (CC) gcc options: -O3 -march=native -lm