EPYC 7502 AOCC 2.3 Compiler Comparison

AMD EPYC 7502 testing of various benchmarks under AMD AOCC 2.3, GCC 10.2, LLVM Clang 11. CFLAGS/CXXFLAGS of "-O3 -march=znver2" throughout. Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012080-HA-EPYC7502A97
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

AV1 2 Tests
Chess Test Suite 2 Tests
C/C++ Compiler Tests 21 Tests
Compression Tests 2 Tests
CPU Massive 19 Tests
Creator Workloads 17 Tests
Cryptography 2 Tests
Database Test Suite 3 Tests
Encoding 7 Tests
Game Development 2 Tests
HPC - High Performance Computing 6 Tests
Imaging 4 Tests
Common Kernel Benchmarks 3 Tests
Machine Learning 4 Tests
Multi-Core 13 Tests
Programmer / Developer System Benchmarks 2 Tests
Renderers 2 Tests
Server 5 Tests
Server CPU Tests 12 Tests
Single-Threaded 6 Tests
Texture Compression 2 Tests
Video Encoding 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 10.2
December 07 2020
  3 Hours, 49 Minutes
LLVM Clang 11
December 08 2020
  6 Hours, 56 Minutes
AMD AOCC 2.3
December 07 2020
  3 Hours, 58 Minutes
Invert Hiding All Results Option
  4 Hours, 54 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


EPYC 7502 AOCC 2.3 Compiler Comparison Suite 1.0.0 System Test suite extracted from EPYC 7502 AOCC 2.3 Compiler Comparison. pts/aobench-1.0.1 Size: 2048 x 2048 - Total Time pts/astcenc-1.0.2 -medium Preset: Medium pts/astcenc-1.0.2 -thorough Preset: Thorough pts/astcenc-1.0.2 -exhaustive Preset: Exhaustive pts/basis-1.0.2 -uastc -uastc_level 2 Settings: UASTC Level 2 pts/basis-1.0.2 -uastc -uastc_level 3 Settings: UASTC Level 3 pts/basis-1.0.2 -uastc -uastc_level 2 -uastc_rdo_q .75 Settings: UASTC Level 2 + RDO Post-Processing pts/c-ray-1.2.0 Total Time - 4K, 16 Rays Per Pixel pts/cryptopp-1.0.1 b1 Test: Unkeyed Algorithms pts/daphne-1.0.0 OpenMP ndt_mapping Backend: OpenMP - Kernel: NDT Mapping pts/daphne-1.0.0 OpenMP points2image Backend: OpenMP - Kernel: Points2Image pts/daphne-1.0.0 OpenMP euclidean_cluster Backend: OpenMP - Kernel: Euclidean Cluster pts/dav1d-1.6.0 -i chimera_8b_1080p.ivf Video Input: Chimera 1080p pts/dav1d-1.6.0 -i summer_nature_4k.ivf Video Input: Summer Nature 4K pts/dav1d-1.6.0 -i summer_nature_1080p.ivf Video Input: Summer Nature 1080p pts/dav1d-1.6.0 -i chimera_10b_1080p.ivf Video Input: Chimera 1080p 10-bit pts/graphics-magick-2.0.1 -swirl 90 Operation: Swirl pts/graphics-magick-2.0.1 -rotate 90 Operation: Rotate pts/graphics-magick-2.0.1 -sharpen 0x2.0 Operation: Sharpen pts/graphics-magick-2.0.1 -enhance Operation: Enhanced pts/graphics-magick-2.0.1 -resize 50% Operation: Resizing pts/hint-1.0.3 FLOAT Test: FLOAT pts/encode-mp3-1.7.4 WAV To MP3 pts/tjbench-1.1.1 decompression-throughput Test: Decompression Throughput pts/libraw-1.0.0 Post-Processing Benchmark pts/compress-lz4-1.0.0 -b1 -e1 Compression Level: 1 - Compression Speed pts/compress-lz4-1.0.0 -b3 -e3 Compression Level: 3 - Compression Speed pts/compress-lz4-1.0.0 -b9 -e9 Compression Level: 9 - Compression Speed pts/ncnn-1.0.3 -1 Target: CPU - Model: squeezenet pts/ncnn-1.0.3 -1 Target: CPU - Model: mobilenet pts/ncnn-1.0.3 -1 Target: CPU-v2-v2 - Model: mobilenet-v2 pts/ncnn-1.0.3 -1 Target: CPU-v3-v3 - Model: mobilenet-v3 pts/ncnn-1.0.3 -1 Target: CPU - Model: shufflenet-v2 pts/ncnn-1.0.3 -1 Target: CPU - Model: mnasnet pts/ncnn-1.0.3 -1 Target: CPU - Model: efficientnet-b0 pts/ncnn-1.0.3 -1 Target: CPU - Model: blazeface pts/ncnn-1.0.3 -1 Target: CPU - Model: googlenet pts/ncnn-1.0.3 -1 Target: CPU - Model: vgg16 pts/ncnn-1.0.3 -1 Target: CPU - Model: resnet18 pts/ncnn-1.0.3 -1 Target: CPU - Model: alexnet pts/ncnn-1.0.3 -1 Target: CPU - Model: resnet50 pts/ncnn-1.0.3 -1 Target: CPU - Model: yolov4-tiny pts/nginx-1.2.2 Static Web Page Serving pts/onednn-1.5.0 --ip --batch=inputs/ip/ip_1d --cfg=f32 --engine=cpu Harness: IP Batch 1D - Data Type: f32 - Engine: CPU pts/onednn-1.5.0 --ip --batch=inputs/ip/ip_1d --cfg=u8s8f32 --engine=cpu Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU pts/onednn-1.5.0 --ip --batch=inputs/ip/ip_all --cfg=u8s8f32 --engine=cpu Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU pts/onednn-1.5.0 --deconv --batch=inputs/deconv/deconv_1d --cfg=f32 --engine=cpu Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU pts/onednn-1.5.0 --deconv --batch=inputs/deconv/deconv_3d --cfg=f32 --engine=cpu Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU pts/onednn-1.5.0 --deconv --batch=inputs/deconv/deconv_1d --cfg=u8s8f32 --engine=cpu Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU pts/onednn-1.5.0 --deconv --batch=inputs/deconv/deconv_3d --cfg=u8s8f32 --engine=cpu Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU pts/onednn-1.5.0 --rnn --batch=inputs/rnn/rnn_training --cfg=f32 --engine=cpu Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU pts/onednn-1.5.0 --rnn --batch=inputs/rnn/rnn_inference --cfg=f32 --engine=cpu Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU pts/onednn-1.5.0 --matmul --batch=inputs/matmul/shapes_transformer --cfg=f32 --engine=cpu Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU pts/openssl-1.11.0 RSA 4096-bit Performance pts/pgbench-1.10.1 -s 1 -c 1 -S Scaling Factor: 1 - Clients: 1 - Mode: Read Only pts/pgbench-1.10.1 -s 1 -c 1 -S Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average Latency pts/pgbench-1.10.1 -s 1 -c 1 Scaling Factor: 1 - Clients: 1 - Mode: Read Write pts/pgbench-1.10.1 -s 1 -c 1 Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency pts/pgbench-1.10.1 -s 1 -c 50 -S Scaling Factor: 1 - Clients: 50 - Mode: Read Only pts/pgbench-1.10.1 -s 1 -c 50 -S Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average Latency pts/pgbench-1.10.1 -s 1 -c 50 Scaling Factor: 1 - Clients: 50 - Mode: Read Write pts/pgbench-1.10.1 -s 1 -c 50 Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average Latency pts/redis-1.3.0 -t lpush Test: LPUSH pts/redis-1.3.0 -t get Test: GET pts/redis-1.3.0 -t set Test: SET pts/rnnoise-1.0.2 pts/scimark2-1.3.2 TEST_COMPOSITE Computational Test: Composite pts/sqlite-speedtest-1.0.0 Timed Time - Size 1,000 pts/stockfish-1.2.0 Total Time pts/svt-av1-2.2.1 -enc-mode 0 -n 20 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Encoder Mode: Enc Mode 0 - Input: 1080p pts/svt-av1-2.2.1 -enc-mode 4 -n 80 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Encoder Mode: Enc Mode 4 - Input: 1080p pts/svt-av1-2.2.1 -enc-mode 8 -n 320 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Encoder Mode: Enc Mode 8 - Input: 1080p pts/svt-vp9-1.2.2 -tune 2 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Tuning: VMAF Optimized - Input: Bosphorus 1080p pts/svt-vp9-1.2.2 -tune 1 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p pts/svt-vp9-1.2.2 -tune 0 -i Bosphorus_1920x1080_120fps_420_8bit_YUV.yuv -w 1920 -h 1080 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p pts/mrbayes-1.4.0 Primate Phylogeny Analysis pts/tnn-1.0.0 -dt NAIVE -mp ../benchmark/benchmark-model/mobilenet_v2.tnnproto Target: CPU - Model: MobileNet v2 pts/tnn-1.0.0 -dt NAIVE -mp ../benchmark/benchmark-model/squeezenet_v1.1.tnnproto Target: CPU - Model: SqueezeNet v1.1 pts/tscp-1.2.2 AI Chess Performance pts/vpxenc-3.0.0 --cpu-used=0 Speed: Speed 0 pts/vpxenc-3.0.0 --cpu-used=5 Speed: Speed 5 pts/webp-1.0.0 -q 100 -lossless Encode Settings: Quality 100, Lossless pts/webp-1.0.0 -q 100 -m 6 Encode Settings: Quality 100, Highest Compression pts/webp-1.0.0 -q 100 -lossless -m 6 Encode Settings: Quality 100, Lossless, Highest Compression pts/x264-2.6.1 H.264 Video Encoding pts/x265-1.3.0 Bosphorus_3840x2160.y4m Video Input: Bosphorus 4K pts/x265-1.3.0 Bosphorus_1920x1080_120fps_420_8bit_YUV.y4m Video Input: Bosphorus 1080p pts/compress-zstd-1.2.1 -b3 Compression Level: 3 pts/compress-zstd-1.2.1 -b19 Compression Level: 19