EPYC 7502 AOCC 2.3 Compiler Comparison

AMD EPYC 7502 testing of various benchmarks under AMD AOCC 2.3, GCC 10.2, LLVM Clang 11. CFLAGS/CXXFLAGS of "-O3 -march=znver2" throughout. Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012080-HA-EPYC7502A97
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

AV1 2 Tests
Chess Test Suite 2 Tests
C/C++ Compiler Tests 21 Tests
Compression Tests 2 Tests
CPU Massive 19 Tests
Creator Workloads 17 Tests
Cryptography 2 Tests
Database Test Suite 3 Tests
Encoding 7 Tests
Game Development 2 Tests
HPC - High Performance Computing 6 Tests
Imaging 4 Tests
Common Kernel Benchmarks 3 Tests
Machine Learning 4 Tests
Multi-Core 13 Tests
Programmer / Developer System Benchmarks 2 Tests
Renderers 2 Tests
Server 5 Tests
Server CPU Tests 12 Tests
Single-Threaded 6 Tests
Texture Compression 2 Tests
Video Encoding 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 10.2
December 07 2020
  3 Hours, 49 Minutes
LLVM Clang 11
December 08 2020
  6 Hours, 56 Minutes
AMD AOCC 2.3
December 07 2020
  3 Hours, 58 Minutes
Invert Hiding All Results Option
  4 Hours, 54 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


EPYC 7502 AOCC 2.3 Compiler Comparison Suite 1.0.0 System Test suite extracted from EPYC 7502 AOCC 2.3 Compiler Comparison. pts/ncnn-1.0.3 -1 Target: CPU - Model: blazeface pts/ncnn-1.0.3 -1 Target: CPU - Model: mnasnet pts/c-ray-1.2.0 Total Time - 4K, 16 Rays Per Pixel pts/onednn-1.5.0 --rnn --batch=inputs/rnn/rnn_training --cfg=f32 --engine=cpu Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU pts/ncnn-1.0.3 -1 Target: CPU-v3-v3 - Model: mobilenet-v3 pts/ncnn-1.0.3 -1 Target: CPU-v2-v2 - Model: mobilenet-v2 pts/ncnn-1.0.3 -1 Target: CPU - Model: efficientnet-b0 pts/dav1d-1.6.0 -i chimera_10b_1080p.ivf Video Input: Chimera 1080p 10-bit pts/daphne-1.0.0 OpenMP points2image Backend: OpenMP - Kernel: Points2Image pts/libraw-1.0.0 Post-Processing Benchmark pts/svt-av1-2.2.1 -enc-mode 0 -n 20 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Encoder Mode: Enc Mode 0 - Input: 1080p pts/graphics-magick-2.0.1 -sharpen 0x2.0 Operation: Sharpen pts/openssl-1.11.0 RSA 4096-bit Performance pts/daphne-1.0.0 OpenMP euclidean_cluster Backend: OpenMP - Kernel: Euclidean Cluster pts/svt-av1-2.2.1 -enc-mode 4 -n 80 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Encoder Mode: Enc Mode 4 - Input: 1080p pts/svt-av1-2.2.1 -enc-mode 8 -n 320 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Encoder Mode: Enc Mode 8 - Input: 1080p pts/graphics-magick-2.0.1 -resize 50% Operation: Resizing pts/ncnn-1.0.3 -1 Target: CPU - Model: googlenet pts/encode-mp3-1.7.4 WAV To MP3 pts/onednn-1.5.0 --matmul --batch=inputs/matmul/shapes_transformer --cfg=f32 --engine=cpu Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU pts/onednn-1.5.0 --ip --batch=inputs/ip/ip_1d --cfg=f32 --engine=cpu Harness: IP Batch 1D - Data Type: f32 - Engine: CPU pts/tnn-1.0.0 -dt NAIVE -mp ../benchmark/benchmark-model/mobilenet_v2.tnnproto Target: CPU - Model: MobileNet v2 pts/astcenc-1.0.2 -medium Preset: Medium pts/ncnn-1.0.3 -1 Target: CPU - Model: resnet50 pts/ncnn-1.0.3 -1 Target: CPU - Model: squeezenet pts/graphics-magick-2.0.1 -enhance Operation: Enhanced pts/ncnn-1.0.3 -1 Target: CPU - Model: vgg16 pts/astcenc-1.0.2 -thorough Preset: Thorough pts/webp-1.0.0 -q 100 -m 6 Encode Settings: Quality 100, Highest Compression pts/onednn-1.5.0 --deconv --batch=inputs/deconv/deconv_1d --cfg=f32 --engine=cpu Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU pts/onednn-1.5.0 --ip --batch=inputs/ip/ip_1d --cfg=u8s8f32 --engine=cpu Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU pts/aobench-1.0.1 Size: 2048 x 2048 - Total Time pts/onednn-1.5.0 --ip --batch=inputs/ip/ip_all --cfg=u8s8f32 --engine=cpu Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU pts/ncnn-1.0.3 -1 Target: CPU - Model: mobilenet pts/tscp-1.2.2 AI Chess Performance pts/vpxenc-3.0.0 --cpu-used=0 Speed: Speed 0 pts/astcenc-1.0.2 -exhaustive Preset: Exhaustive pts/basis-1.0.2 -uastc -uastc_level 2 -uastc_rdo_q .75 Settings: UASTC Level 2 + RDO Post-Processing pts/vpxenc-3.0.0 --cpu-used=5 Speed: Speed 5 pts/ncnn-1.0.3 -1 Target: CPU - Model: resnet18 pts/onednn-1.5.0 --deconv --batch=inputs/deconv/deconv_1d --cfg=u8s8f32 --engine=cpu Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU pts/daphne-1.0.0 OpenMP ndt_mapping Backend: OpenMP - Kernel: NDT Mapping pts/stockfish-1.2.0 Total Time pts/compress-lz4-1.0.0 -b3 -e3 Compression Level: 3 - Compression Speed pts/onednn-1.5.0 --deconv --batch=inputs/deconv/deconv_3d --cfg=u8s8f32 --engine=cpu Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU pts/svt-vp9-1.2.2 -tune 1 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p pts/graphics-magick-2.0.1 -swirl 90 Operation: Swirl pts/x265-1.3.0 Bosphorus_3840x2160.y4m Video Input: Bosphorus 4K pts/webp-1.0.0 -q 100 -lossless -m 6 Encode Settings: Quality 100, Lossless, Highest Compression pts/rnnoise-1.0.2 pts/ncnn-1.0.3 -1 Target: CPU - Model: yolov4-tiny pts/pgbench-1.10.1 -s 1 -c 50 -S Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average Latency pts/compress-zstd-1.2.1 -b19 Compression Level: 19 pts/svt-vp9-1.2.2 -tune 0 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p pts/compress-lz4-1.0.0 -b1 -e1 Compression Level: 1 - Compression Speed pts/scimark2-1.3.2 TEST_COMPOSITE Computational Test: Composite pts/sqlite-speedtest-1.0.0 Timed Time - Size 1,000 pts/mrbayes-1.4.0 Primate Phylogeny Analysis pts/pgbench-1.10.1 -s 1 -c 50 -S Scaling Factor: 1 - Clients: 50 - Mode: Read Only pts/x264-2.6.1 H.264 Video Encoding pts/dav1d-1.6.0 -i summer_nature_1080p.ivf Video Input: Summer Nature 1080p pts/pgbench-1.10.1 -s 1 -c 1 Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency pts/pgbench-1.10.1 -s 1 -c 1 Scaling Factor: 1 - Clients: 1 - Mode: Read Write pts/compress-lz4-1.0.0 -b9 -e9 Compression Level: 9 - Compression Speed pts/svt-vp9-1.2.2 -tune 2 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Tuning: VMAF Optimized - Input: Bosphorus 1080p pts/x265-1.3.0 Bosphorus_1920x1080_120fps_420_8bit_YUV.y4m Video Input: Bosphorus 1080p pts/pgbench-1.10.1 -s 1 -c 1 -S Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average Latency pts/tjbench-1.1.1 decompression-throughput Test: Decompression Throughput pts/cryptopp-1.0.1 b1 Test: Unkeyed Algorithms pts/pgbench-1.10.1 -s 1 -c 1 -S Scaling Factor: 1 - Clients: 1 - Mode: Read Only pts/tnn-1.0.0 -dt NAIVE -mp ../benchmark/benchmark-model/squeezenet_v1.1.tnnproto Target: CPU - Model: SqueezeNet v1.1 pts/nginx-1.2.2 Static Web Page Serving pts/dav1d-1.6.0 -i summer_nature_4k.ivf Video Input: Summer Nature 4K pts/graphics-magick-2.0.1 -rotate 90 Operation: Rotate pts/dav1d-1.6.0 -i chimera_8b_1080p.ivf Video Input: Chimera 1080p pts/onednn-1.5.0 --deconv --batch=inputs/deconv/deconv_3d --cfg=f32 --engine=cpu Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU pts/basis-1.0.2 -uastc -uastc_level 3 Settings: UASTC Level 3 pts/basis-1.0.2 -uastc -uastc_level 2 Settings: UASTC Level 2 pts/pgbench-1.10.1 -s 1 -c 50 Scaling Factor: 1 - Clients: 50 - Mode: Read Write pts/pgbench-1.10.1 -s 1 -c 50 Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average Latency pts/compress-zstd-1.2.1 -b3 Compression Level: 3 pts/webp-1.0.0 -q 100 -lossless Encode Settings: Quality 100, Lossless pts/hint-1.0.3 FLOAT Test: FLOAT pts/ncnn-1.0.3 -1 Target: CPU - Model: alexnet pts/ncnn-1.0.3 -1 Target: CPU - Model: shufflenet-v2 pts/redis-1.3.0 -t set Test: SET pts/redis-1.3.0 -t get Test: GET pts/redis-1.3.0 -t lpush Test: LPUSH pts/onednn-1.5.0 --rnn --batch=inputs/rnn/rnn_inference --cfg=f32 --engine=cpu Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU