AMD EPYC 9754 Bergamo SMT On/Off Comparison

Benchmarks by Michael Larabel for a future article (post 19th) looking at SMT on/off comparison toggled via BIOS. SMT comparison testing of AMD EPYC 9754 128-Core CPUs on Titanite with Ubuntu 22.04 LTS.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2307190-NE-BERGAMOSM27
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C++ Boost Tests 2 Tests
Timed Code Compilation 5 Tests
C/C++ Compiler Tests 8 Tests
CPU Massive 14 Tests
Creator Workloads 11 Tests
Cryptography 4 Tests
Database Test Suite 2 Tests
Fortran Tests 5 Tests
Game Development 5 Tests
HPC - High Performance Computing 11 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 3 Tests
Molecular Dynamics 4 Tests
MPI Benchmarks 2 Tests
Multi-Core 23 Tests
NVIDIA GPU Compute 2 Tests
Intel oneAPI 6 Tests
OpenMPI Tests 8 Tests
Programmer / Developer System Benchmarks 5 Tests
Python Tests 8 Tests
Raytracing 2 Tests
Renderers 5 Tests
Scientific Computing 4 Tests
Software Defined Radio 2 Tests
Server 3 Tests
Server CPU Tests 12 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Comparison
Transpose Comparison

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
EPYC 9754 1P: SMT On
July 14 2023
  16 Hours, 51 Minutes
EPYC 9754 1P: SMT Off
July 13 2023
  15 Hours, 43 Minutes
EPYC 9754 2P: SMT On
July 11 2023
  14 Hours, 12 Minutes
EPYC 9754 2P: SMT Off
July 12 2023
  13 Hours, 40 Minutes
Invert Hiding All Results Option
  15 Hours, 6 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 9754 Bergamo SMT On/Off Comparison Suite 1.0.0 System Test suite extracted from AMD EPYC 9754 Bergamo SMT On/Off Comparison. pts/specfem3d-1.0.0 layered_halfspace Model: Layered Halfspace pts/heffte-1.0.0 c2c fftw double 512 512 512 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 pts/libxsmm-1.0.1 128 128 128 M N K: 128 pts/specfem3d-1.0.0 waterlayered_halfspace Model: Water-layered Halfspace pts/specfem3d-1.0.0 tomographic_model Model: Tomographic Model pts/specfem3d-1.0.0 homogeneous_halfspace Model: Homogeneous Halfspace pts/heffte-1.0.0 r2c fftw float 512 512 512 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 pts/toybrot-1.2.0 raymarched/TBB/rmTBB Implementation: TBB pts/toybrot-1.2.0 raymarched/OMP/rmOpenMP Implementation: OpenMP pts/astcenc-1.4.0 -fast -repeats 120 Preset: Fast pts/astcenc-1.4.0 -thorough -repeats 10 Preset: Thorough pts/astcenc-1.4.0 -exhaustive -repeats 2 Preset: Exhaustive pts/xmrig-1.1.0 --bench=1M Variant: Monero - Hash Count: 1M pts/xmrig-1.1.0 -a rx/wow --bench=1M Variant: Wownero - Hash Count: 1M pts/graph500-1.0.1 26 Scale: 26 pts/heffte-1.0.0 c2c fftw float 512 512 512 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 pts/libxsmm-1.0.1 256 256 256 M N K: 256 pts/minibude-1.0.0 --deck ../data/bm1 --iterations 500 Implementation: OpenMP - Input Deck: BM1 pts/specfem3d-1.0.0 Mount_StHelens Model: Mount St. Helens pts/minibude-1.0.0 --deck ../data/bm2 --iterations 10 Implementation: OpenMP - Input Deck: BM2 pts/nekrs-1.1.0 kershaw kershaw.par Input: Kershaw pts/nekrs-1.1.0 turbPipePeriodic turbPipe.par Input: TurboPipe Periodic pts/tensorflow-2.1.0 --device cpu --batch_size=256 --model=alexnet Device: CPU - Batch Size: 256 - Model: AlexNet pts/tensorflow-2.1.0 --device cpu --batch_size=512 --model=alexnet Device: CPU - Batch Size: 512 - Model: AlexNet pts/tensorflow-2.1.0 --device cpu --batch_size=256 --model=googlenet Device: CPU - Batch Size: 256 - Model: GoogLeNet pts/tensorflow-2.1.0 --device cpu --batch_size=256 --model=resnet50 Device: CPU - Batch Size: 256 - Model: ResNet-50 pts/tensorflow-2.1.0 --device cpu --batch_size=512 --model=googlenet Device: CPU - Batch Size: 512 - Model: GoogLeNet pts/tensorflow-2.1.0 --device cpu --batch_size=512 --model=resnet50 Device: CPU - Batch Size: 512 - Model: ResNet-50 pts/cloverleaf-1.1.0 Lagrangian-Eulerian Hydrodynamics pts/deepsparse-1.5.0 zoo:nlp/document_classification/obert-base/pytorch/huggingface/imdb/base-none --scenario async Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream pts/helsing-1.0.2 10000000000000 99999999999999 Digit Range: 14 digit pts/deepsparse-1.5.0 zoo:nlp/sentiment_analysis/bert-base/pytorch/huggingface/sst2/12layer_pruned90-none --scenario async Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream pts/deepsparse-1.5.0 zoo:nlp/question_answering/bert-base/pytorch/huggingface/squad/12layer_pruned90-none --scenario async Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream pts/deepsparse-1.5.0 zoo:cv/detection/yolov5-s/pytorch/ultralytics/coco/base-none --scenario async Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream pts/deepsparse-1.5.0 zoo:cv/classification/resnet_v1-50/pytorch/sparseml/imagenet/base-none --scenario async Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream pts/deepsparse-1.5.0 zoo:nlp/text_classification/distilbert-none/pytorch/huggingface/mnli/base-none --scenario async Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream pts/deepsparse-1.5.0 zoo:cv/segmentation/yolact-darknet53/pytorch/dbolya/coco/pruned90-none --scenario async Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream pts/deepsparse-1.5.0 zoo:nlp/text_classification/bert-base/pytorch/huggingface/sst2/base-none --scenario async Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream pts/deepsparse-1.5.0 zoo:nlp/token_classification/bert-base/pytorch/huggingface/conll2003/base-none --scenario async Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream pts/heffte-1.0.0 r2c fftw double 512 512 512 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 pts/npb-1.4.5 bt.C Test / Class: BT.C pts/npb-1.4.5 cg.C Test / Class: CG.C pts/npb-1.4.5 ep.D Test / Class: EP.D pts/npb-1.4.5 ft.C Test / Class: FT.C pts/npb-1.4.5 is.D Test / Class: IS.D pts/npb-1.4.5 lu.C Test / Class: LU.C pts/npb-1.4.5 mg.C Test / Class: MG.C pts/npb-1.4.5 sp.B Test / Class: SP.B pts/npb-1.4.5 sp.C Test / Class: SP.C pts/namd-1.2.1 ATPase Simulation - 327,506 Atoms pts/openvino-1.2.0 -m models/intel/face-detection-0206/FP16/face-detection-0206.xml -d CPU Model: Face Detection FP16 - Device: CPU pts/openvino-1.2.0 -m models/intel/person-detection-0106/FP16/person-detection-0106.xml -d CPU Model: Person Detection FP16 - Device: CPU pts/openvino-1.2.0 -m models/intel/person-detection-0106/FP32/person-detection-0106.xml -d CPU Model: Person Detection FP32 - Device: CPU pts/openvino-1.2.0 -m models/intel/vehicle-detection-0202/FP16/vehicle-detection-0202.xml -d CPU Model: Vehicle Detection FP16 - Device: CPU pts/openvino-1.2.0 -m models/intel/face-detection-0206/FP16-INT8/face-detection-0206.xml -d CPU Model: Face Detection FP16-INT8 - Device: CPU pts/openvino-1.2.0 -m models/intel/vehicle-detection-0202/FP16-INT8/vehicle-detection-0202.xml -d CPU Model: Vehicle Detection FP16-INT8 - Device: CPU pts/openvino-1.2.0 -m models/intel/weld-porosity-detection-0001/FP16/weld-porosity-detection-0001.xml -d CPU Model: Weld Porosity Detection FP16 - Device: CPU pts/openvino-1.2.0 -m models/intel/machine-translation-nar-en-de-0002/FP16/machine-translation-nar-en-de-0002.xml -d CPU Model: Machine Translation EN To DE FP16 - Device: CPU pts/openvino-1.2.0 -m models/intel/weld-porosity-detection-0001/FP16-INT8/weld-porosity-detection-0001.xml -d CPU Model: Weld Porosity Detection FP16-INT8 - Device: CPU pts/openvino-1.2.0 -m models/intel/person-vehicle-bike-detection-2004/FP16/person-vehicle-bike-detection-2004.xml -d CPU Model: Person Vehicle Bike Detection FP16 - Device: CPU pts/openvino-1.2.0 -m models/intel/age-gender-recognition-retail-0013/FP16/age-gender-recognition-retail-0013.xml -d CPU Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU pts/openvino-1.2.0 -m models/intel/age-gender-recognition-retail-0013/FP16-INT8/age-gender-recognition-retail-0013.xml -d CPU Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU pts/minife-1.0.0 -‐nx 264 --ny 256 -‐nz 256 Problem Size: Small pts/cp2k-1.4.1 -i benchmarks/QS_DM_LS/H2O-dft-ls.inp Input: H2O-DFT-LS pts/aircrack-ng-1.3.0 pts/primesieve-1.9.0 1e12 Length: 1e12 pts/primesieve-1.9.0 1e13 Length: 1e13 pts/stockfish-1.4.0 Total Time pts/compress-7zip-1.10.0 Test: Compression Rating pts/compress-7zip-1.10.0 Test: Decompression Rating pts/john-the-ripper-1.8.0 --format=bcrypt Test: bcrypt pts/john-the-ripper-1.8.0 --format=wpapsk Test: WPA PSK pts/john-the-ripper-1.8.0 --format=bcrypt Test: Blowfish pts/john-the-ripper-1.8.0 --format=md5crypt Test: MD5 pts/build-llvm-1.5.0 Ninja Build System: Ninja pts/build-llvm-1.5.0 Build System: Unix Makefiles pts/build-linux-kernel-1.15.0 defconfig Build: defconfig pts/build-linux-kernel-1.15.0 allmodconfig Build: allmodconfig pts/blender-3.6.0 -b ../bmw27_gpu.blend -o output.test -x 1 -F JPEG -f 1 -- --cycles-device CPU Blend File: BMW27 - Compute: CPU-Only pts/blender-3.6.0 -b ../classroom_gpu.blend -o output.test -x 1 -F JPEG -f 1 -- --cycles-device CPU Blend File: Classroom - Compute: CPU-Only pts/blender-3.6.0 -b ../fishy_cat_gpu.blend -o output.test -x 1 -F JPEG -f 1 -- --cycles-device CPU Blend File: Fishy Cat - Compute: CPU-Only pts/blender-3.6.0 -b ../barbershop_interior_gpu.blend -o output.test -x 1 -F JPEG -f 1 -- --cycles-device CPU Blend File: Barbershop - Compute: CPU-Only pts/blender-3.6.0 -b ../pavillon_barcelone_gpu.blend -o output.test -x 1 -F JPEG -f 1 -- --cycles-device CPU Blend File: Pabellon Barcelona - Compute: CPU-Only pts/build-godot-4.0.0 Time To Compile pts/embree-1.5.0 pathtracer_ispc -c crown/crown.ecs Binary: Pathtracer ISPC - Model: Crown pts/embree-1.5.0 pathtracer_ispc -c asian_dragon/asian_dragon.ecs Binary: Pathtracer ISPC - Model: Asian Dragon pts/oidn-2.0.0 -r RT.hdr_alb_nrm.3840x2160 -d cpu Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only pts/oidn-2.0.0 -r RT.ldr_alb_nrm.3840x2160 -d cpu Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only pts/oidn-2.0.0 -r RTLightmap.hdr.4096x4096 -d cpu Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only pts/openvkl-1.3.0 vklBenchmark --benchmark_filter=ispc Benchmark: vklBenchmark ISPC pts/luxcorerender-1.4.0 DLSC/LuxCoreScene/render.cfg -D renderengine.type PATHCPU Scene: DLSC - Acceleration: CPU pts/luxcorerender-1.4.0 OrangeJuice/LuxCoreScene/render.cfg -D renderengine.type PATHCPU Scene: Orange Juice - Acceleration: CPU pts/luxcorerender-1.4.0 LuxCore2.1Benchmark/LuxCoreScene/render.cfg -D renderengine.type PATHCPU Scene: LuxCore Benchmark - Acceleration: CPU pts/luxcorerender-1.4.0 RainbowColorsAndPrism/LuxCoreScene/render.cfg -D renderengine.type PATHCPU Scene: Rainbow Colors and Prism - Acceleration: CPU pts/ospray-2.12.0 --benchmark_filter=particle_volume/ao/real_time Benchmark: particle_volume/ao/real_time pts/ospray-2.12.0 --benchmark_filter=particle_volume/scivis/real_time Benchmark: particle_volume/scivis/real_time pts/ospray-2.12.0 --benchmark_filter=gravity_spheres_volume/dim_512/ao/real_time Benchmark: gravity_spheres_volume/dim_512/ao/real_time pts/ospray-2.12.0 --benchmark_filter=gravity_spheres_volume/dim_512/scivis/real_time Benchmark: gravity_spheres_volume/dim_512/scivis/real_time pts/ospray-studio-1.1.0 --cameras 1 1 --resolution 3840 2160 --spp 1 --renderer pathtracer Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer pts/ospray-studio-1.1.0 --cameras 2 2 --resolution 3840 2160 --spp 1 --renderer pathtracer Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer pts/ospray-studio-1.1.0 --cameras 3 3 --resolution 3840 2160 --spp 1 --renderer pathtracer Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer pts/ospray-studio-1.1.0 --cameras 1 1 --resolution 3840 2160 --spp 16 --renderer pathtracer Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer pts/ospray-studio-1.1.0 --cameras 1 1 --resolution 3840 2160 --spp 32 --renderer pathtracer Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer pts/ospray-studio-1.1.0 --cameras 2 2 --resolution 3840 2160 --spp 16 --renderer pathtracer Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer pts/ospray-studio-1.1.0 --cameras 2 2 --resolution 3840 2160 --spp 32 --renderer pathtracer Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer pts/ospray-studio-1.1.0 --cameras 3 3 --resolution 3840 2160 --spp 16 --renderer pathtracer Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer pts/ospray-studio-1.1.0 --cameras 3 3 --resolution 3840 2160 --spp 32 --renderer pathtracer Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer pts/appleseed-1.0.1 emily.appleseed Scene: Emily pts/appleseed-1.0.1 disney_material_1.appleseed Scene: Disney Material pts/appleseed-1.0.1 material_tester_ambient_occlusion.appleseed Scene: Material Tester pts/build-gem5-1.0.0 Time To Compile pts/build-nodejs-1.3.0 Time To Compile pts/liquid-dsp-1.6.0 -n 256 -b 256 -f 512 Threads: 256 - Buffer Length: 256 - Filter Length: 512 pts/liquid-dsp-1.6.0 -n 512 -b 256 -f 512 Threads: 512 - Buffer Length: 256 - Filter Length: 512 pts/srsran-2.1.0 tests/benchmarks/phy/upper/channel_processors/pusch_processor_benchmark -m throughput_total -R 100 -P pusch_scs15_50MHz_256qam_max Test: PUSCH Processor Benchmark, Throughput Total pts/openssl-3.1.0 sha256 Algorithm: SHA256 pts/openssl-3.1.0 sha512 Algorithm: SHA512 pts/openssl-3.1.0 rsa4096 Algorithm: RSA4096 pts/openssl-3.1.0 -evp chacha20 Algorithm: ChaCha20 pts/openssl-3.1.0 -evp aes-128-gcm Algorithm: AES-128-GCM pts/openssl-3.1.0 -evp aes-256-gcm Algorithm: AES-256-GCM pts/openssl-3.1.0 -evp chacha20-poly1305 Algorithm: ChaCha20-Poly1305 pts/pgbench-1.13.0 -s 1000 -c 800 -S Scaling Factor: 1000 - Clients: 800 - Mode: Read Only pts/pgbench-1.13.0 -s 1000 -c 800 -S Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average Latency pts/mysqlslap-1.4.0 --concurrency=2048 Clients: 2048 pts/mysqlslap-1.4.0 --concurrency=4096 Clients: 4096