AMD SME Benchmark Genoa 4th Gen AMD EPYC "Genoa" Secure Memory Encryption (SME) benchmarks by Michael Larabel for a future article. AMD SME Enabled: Processor: 2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads), Motherboard: AMD Titanite_4G (RTI1002E BIOS), Chipset: AMD Device 14a4, Memory: 1520GB, Disk: 800GB INTEL SSDPF21Q800GB, Graphics: ASPEED, Monitor: VGA HDMI, Network: Broadcom NetXtreme BCM5720 PCIe OS: Ubuntu 22.10, Kernel: 6.1.0-phx (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server 1.21.1.4, Vulkan: 1.3.224, Compiler: GCC 12.2.0 + Clang 15.0.2-1, File-System: ext4, Screen Resolution: 1920x1080 No SME: Processor: 2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads), Motherboard: AMD Titanite_4G (RTI1002E BIOS), Chipset: AMD Device 14a4, Memory: 1520GB, Disk: 800GB INTEL SSDPF21Q800GB, Graphics: ASPEED, Monitor: VGA HDMI, Network: Broadcom NetXtreme BCM5720 PCIe OS: Ubuntu 22.10, Kernel: 6.1.0-phx (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server 1.21.1.4, Vulkan: 1.3.224, Compiler: GCC 12.2.0 + Clang 15.0.2-1, File-System: ext4, Screen Resolution: 1920x1080 QuantLib 1.21 MFLOPS > Higher Is Better AMD SME Enabled . 3061.3 |===================================================== No SME .......... 3052.8 |===================================================== High Performance Conjugate Gradient 3.1 GFLOP/s > Higher Is Better AMD SME Enabled . 87.15 |===================================================== No SME .......... 88.39 |====================================================== NAS Parallel Benchmarks 3.4 Test / Class: BT.C Total Mop/s > Higher Is Better AMD SME Enabled . 494917.44 |================================================== No SME .......... 496467.98 |================================================== NAS Parallel Benchmarks 3.4 Test / Class: EP.C Total Mop/s > Higher Is Better AMD SME Enabled . 16462.35 |=================================================== No SME .......... 16457.94 |=================================================== NAS Parallel Benchmarks 3.4 Test / Class: FT.C Total Mop/s > Higher Is Better AMD SME Enabled . 220214.75 |================================================= No SME .......... 223096.07 |================================================== NAS Parallel Benchmarks 3.4 Test / Class: SP.C Total Mop/s > Higher Is Better AMD SME Enabled . 253299.33 |================================================== No SME .......... 255564.19 |================================================== miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 GFInst/s > Higher Is Better AMD SME Enabled . 7290.64 |==================================================== No SME .......... 7281.28 |==================================================== miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 Billion Interactions/s > Higher Is Better AMD SME Enabled . 291.63 |===================================================== No SME .......... 291.25 |===================================================== miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 GFInst/s > Higher Is Better AMD SME Enabled . 8588.75 |==================================================== No SME .......... 8633.54 |==================================================== miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 Billion Interactions/s > Higher Is Better AMD SME Enabled . 343.55 |===================================================== No SME .......... 345.34 |===================================================== Rodinia 3.1 Test: OpenMP LavaMD Seconds < Lower Is Better AMD SME Enabled . 16.67 |====================================================== No SME .......... 16.51 |===================================================== Rodinia 3.1 Test: OpenMP CFD Solver Seconds < Lower Is Better AMD SME Enabled . 6.043 |====================================================== No SME .......... 5.938 |===================================================== NAMD 2.14 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better AMD SME Enabled . 0.12991 |==================================================== No SME .......... 0.12831 |=================================================== NWChem 7.0.2 Input: C240 Buckyball Seconds < Lower Is Better AMD SME Enabled . 1543.1 |===================================================== No SME .......... 1524.4 |==================================================== Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Seconds < Lower Is Better AMD SME Enabled . 4.42424568 |================================================= No SME .......... 4.37420527 |================================================ OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time Seconds < Lower Is Better AMD SME Enabled . 27.10 |====================================================== No SME .......... 25.07 |================================================== OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time Seconds < Lower Is Better AMD SME Enabled . 22.13 |====================================================== No SME .......... 22.08 |====================================================== OpenRadioss 2022.10.13 Model: Bumper Beam Seconds < Lower Is Better AMD SME Enabled . 79.97 |====================================================== No SME .......... 79.85 |====================================================== OpenRadioss 2022.10.13 Model: Cell Phone Drop Test Seconds < Lower Is Better AMD SME Enabled . 18.32 |====================================================== No SME .......... 18.45 |====================================================== OpenRadioss 2022.10.13 Model: INIVOL and Fluid Structure Interaction Drop Container Seconds < Lower Is Better AMD SME Enabled . 80.90 |====================================================== No SME .......... 80.88 |====================================================== RELION 3.1.1 Test: Basic - Device: CPU Seconds < Lower Is Better AMD SME Enabled . 130.43 |===================================================== No SME .......... 128.66 |==================================================== LULESH 2.0.3 z/s > Higher Is Better AMD SME Enabled . 57686.09 |================================================== No SME .......... 59069.41 |=================================================== Xmrig 6.18.1 Variant: Monero - Hash Count: 1M H/s > Higher Is Better AMD SME Enabled . 101932.1 |================================================= No SME .......... 105141.7 |=================================================== Xmrig 6.18.1 Variant: Wownero - Hash Count: 1M H/s > Higher Is Better AMD SME Enabled . 123484.1 |================================================== No SME .......... 126508.3 |=================================================== DaCapo Benchmark 9.12-MR1 Java Test: H2 msec < Lower Is Better AMD SME Enabled . 5050 |======================================================= No SME .......... 4807 |==================================================== Renaissance 0.14 Test: Finagle HTTP Requests ms < Lower Is Better AMD SME Enabled . 12347.5 |==================================================== No SME .......... 12286.3 |==================================================== Renaissance 0.14 Test: In-Memory Database Shootout ms < Lower Is Better AMD SME Enabled . 4838.5 |===================================================== No SME .......... 4764.6 |==================================================== Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed MB/s > Higher Is Better AMD SME Enabled . 49.9 |==================================================== No SME .......... 52.9 |======================================================= Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed MB/s > Higher Is Better AMD SME Enabled . 3837.3 |===================================================== No SME .......... 3825.0 |===================================================== srsRAN 22.04.1 Test: OFDM_Test Samples / Second > Higher Is Better AMD SME Enabled . 162633333 |================================================== No SME .......... 161733333 |================================================== srsRAN 22.04.1 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM eNb Mb/s > Higher Is Better AMD SME Enabled . 408.5 |===================================================== No SME .......... 415.1 |====================================================== srsRAN 22.04.1 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM UE Mb/s > Higher Is Better AMD SME Enabled . 157.7 |====================================================== No SME .......... 157.8 |====================================================== srsRAN 22.04.1 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM eNb Mb/s > Higher Is Better AMD SME Enabled . 415.7 |====================================================== No SME .......... 413.9 |====================================================== srsRAN 22.04.1 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM UE Mb/s > Higher Is Better AMD SME Enabled . 165.9 |====================================================== No SME .......... 165.8 |====================================================== srsRAN 22.04.1 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM eNb Mb/s > Higher Is Better AMD SME Enabled . 444.8 |====================================================== No SME .......... 445.2 |====================================================== srsRAN 22.04.1 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM UE Mb/s > Higher Is Better AMD SME Enabled . 165.7 |====================================================== No SME .......... 166.0 |====================================================== srsRAN 22.04.1 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM eNb Mb/s > Higher Is Better AMD SME Enabled . 445.7 |====================================================== No SME .......... 444.0 |====================================================== srsRAN 22.04.1 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM UE Mb/s > Higher Is Better AMD SME Enabled . 172.2 |====================================================== No SME .......... 172.7 |====================================================== srsRAN 22.04.1 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM eNb Mb/s > Higher Is Better AMD SME Enabled . 139.1 |====================================================== No SME .......... 139.7 |====================================================== srsRAN 22.04.1 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM UE Mb/s > Higher Is Better AMD SME Enabled . 94.9 |======================================================= No SME .......... 94.4 |======================================================= AOM AV1 3.5 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K Frames Per Second > Higher Is Better AMD SME Enabled . 33.12 |==================================================== No SME .......... 34.47 |====================================================== Embree 3.13 Binary: Pathtracer ISPC - Model: Crown Frames Per Second > Higher Is Better AMD SME Enabled . 180.57 |==================================================== No SME .......... 183.25 |===================================================== Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Very Fast Frames Per Second > Higher Is Better AMD SME Enabled . 73.32 |===================================================== No SME .......... 74.31 |====================================================== Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Frames Per Second > Higher Is Better AMD SME Enabled . 76.22 |===================================================== No SME .......... 77.68 |====================================================== SVT-AV1 1.4 Encoder Mode: Preset 13 - Input: Bosphorus 4K Frames Per Second > Higher Is Better AMD SME Enabled . 251.44 |===================================================== No SME .......... 248.33 |==================================================== x264 2022-02-22 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better AMD SME Enabled . 103.07 |=================================================== No SME .......... 106.86 |===================================================== x265 3.4 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better AMD SME Enabled . 23.29 |====================================================== No SME .......... 23.48 |====================================================== ACES DGEMM 1.0 Sustained Floating-Point Rate GFLOP/s > Higher Is Better AMD SME Enabled . 70.28 |====================================================== No SME .......... 70.37 |====================================================== Intel Open Image Denoise 1.4.0 Run: RTLightmap.hdr.4096x4096 Images / Sec > Higher Is Better AMD SME Enabled . 1.66 |======================================================= No SME .......... 1.66 |======================================================= OpenVKL 1.3.1 Benchmark: vklBenchmark ISPC Items / Sec > Higher Is Better AMD SME Enabled . 1286 |====================================================== No SME .......... 1322 |======================================================= OSPRay 2.10 Benchmark: particle_volume/pathtracer/real_time Items Per Second > Higher Is Better AMD SME Enabled . 229.88 |===================================================== No SME .......... 230.62 |===================================================== OSPRay 2.10 Benchmark: gravity_spheres_volume/dim_512/ao/real_time Items Per Second > Higher Is Better AMD SME Enabled . 43.38 |===================================================== No SME .......... 43.85 |====================================================== 7-Zip Compression 22.01 Test: Compression Rating MIPS > Higher Is Better AMD SME Enabled . 885135 |=================================================== No SME .......... 917782 |===================================================== 7-Zip Compression 22.01 Test: Decompression Rating MIPS > Higher Is Better AMD SME Enabled . 1169038 |==================================================== No SME .......... 1160632 |==================================================== libavif avifenc 0.11 Encoder Speed: 2 Seconds < Lower Is Better AMD SME Enabled . 35.26 |====================================================== No SME .......... 34.69 |===================================================== libavif avifenc 0.11 Encoder Speed: 6 Seconds < Lower Is Better AMD SME Enabled . 2.469 |====================================================== No SME .......... 2.393 |==================================================== Timed Gem5 Compilation 21.2 Time To Compile Seconds < Lower Is Better AMD SME Enabled . 142.18 |===================================================== No SME .......... 138.64 |==================================================== Timed Godot Game Engine Compilation 3.2.3 Time To Compile Seconds < Lower Is Better AMD SME Enabled . 35.04 |====================================================== No SME .......... 34.14 |===================================================== Timed Linux Kernel Compilation 6.1 Build: defconfig Seconds < Lower Is Better AMD SME Enabled . 25.30 |===================================================== No SME .......... 25.71 |====================================================== Timed Linux Kernel Compilation 6.1 Build: allmodconfig Seconds < Lower Is Better AMD SME Enabled . 148.44 |===================================================== No SME .......... 146.33 |==================================================== Timed LLVM Compilation 13.0 Build System: Ninja Seconds < Lower Is Better AMD SME Enabled . 76.63 |====================================================== No SME .......... 75.33 |===================================================== Timed LLVM Compilation 13.0 Build System: Unix Makefiles Seconds < Lower Is Better AMD SME Enabled . 162.63 |===================================================== No SME .......... 160.13 |==================================================== OSPRay Studio 0.11 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer ms < Lower Is Better AMD SME Enabled . 22614 |====================================================== No SME .......... 22043 |===================================================== Liquid-DSP 2021.01.31 Threads: 256 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better AMD SME Enabled . 10344666667 |================================================ No SME .......... 10332000000 |================================================ Liquid-DSP 2021.01.31 Threads: 384 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better AMD SME Enabled . 10350000000 |================================================ No SME .......... 10346000000 |================================================ ASKAP 1.0 Test: tConvolve MPI - Degridding Mpix/sec > Higher Is Better AMD SME Enabled . 78718.7 |================================================= No SME .......... 83598.3 |==================================================== ASKAP 1.0 Test: tConvolve MPI - Gridding Mpix/sec > Higher Is Better AMD SME Enabled . 89541.8 |================================================== No SME .......... 93071.0 |==================================================== ASTC Encoder 4.0 Preset: Thorough MT/s > Higher Is Better AMD SME Enabled . 106.56 |===================================================== No SME .......... 106.42 |===================================================== ASTC Encoder 4.0 Preset: Exhaustive MT/s > Higher Is Better AMD SME Enabled . 11.84 |====================================================== No SME .......... 11.82 |====================================================== Graph500 3.0 Scale: 26 bfs median_TEPS > Higher Is Better AMD SME Enabled . 1358510000 |=============================================== No SME .......... 1426480000 |================================================= Graph500 3.0 Scale: 26 bfs max_TEPS > Higher Is Better AMD SME Enabled . 1526380000 |================================================= No SME .......... 1533180000 |================================================= Graph500 3.0 Scale: 26 sssp median_TEPS > Higher Is Better AMD SME Enabled . 572510000 |================================================ No SME .......... 593153000 |================================================== Graph500 3.0 Scale: 26 sssp max_TEPS > Higher Is Better AMD SME Enabled . 835467000 |================================================== No SME .......... 838505000 |================================================== GROMACS 2022.1 Implementation: MPI CPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better AMD SME Enabled . 18.62 |====================================================== No SME .......... 18.71 |====================================================== PostgreSQL 15 Scaling Factor: 100 - Clients: 250 - Mode: Read Only TPS > Higher Is Better AMD SME Enabled . 2970869 |==================================================== No SME .......... 2951147 |==================================================== TensorFlow 2.10 Device: CPU - Batch Size: 64 - Model: AlexNet images/sec > Higher Is Better AMD SME Enabled . 505.26 |===================================================== No SME .......... 508.40 |===================================================== KTX-Software toktx 4.0 Settings: Zstd Compression 9 Seconds < Lower Is Better AMD SME Enabled . 2.776 |====================================================== No SME .......... 2.734 |===================================================== KTX-Software toktx 4.0 Settings: Zstd Compression 19 Seconds < Lower Is Better AMD SME Enabled . 19.88 |====================================================== No SME .......... 18.86 |=================================================== Neural Magic DeepSparse 1.1 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream items/sec > Higher Is Better AMD SME Enabled . 83.64 |====================================================== No SME .......... 84.25 |====================================================== Neural Magic DeepSparse 1.1 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream ms/batch < Lower Is Better AMD SME Enabled . 1143.04 |==================================================== No SME .......... 1134.42 |==================================================== Neural Magic DeepSparse 1.1 Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream items/sec > Higher Is Better AMD SME Enabled . 745.64 |==================================================== No SME .......... 762.70 |===================================================== Neural Magic DeepSparse 1.1 Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream ms/batch < Lower Is Better AMD SME Enabled . 128.40 |===================================================== No SME .......... 125.53 |==================================================== Neural Magic DeepSparse 1.1 Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream items/sec > Higher Is Better AMD SME Enabled . 840.94 |==================================================== No SME .......... 858.47 |===================================================== Neural Magic DeepSparse 1.1 Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream ms/batch < Lower Is Better AMD SME Enabled . 113.83 |===================================================== No SME .......... 111.46 |==================================================== Neural Magic DeepSparse 1.1 Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream items/sec > Higher Is Better AMD SME Enabled . 1911.40 |=================================================== No SME .......... 1962.10 |==================================================== Neural Magic DeepSparse 1.1 Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream ms/batch < Lower Is Better AMD SME Enabled . 50.11 |====================================================== No SME .......... 48.83 |===================================================== Neural Magic DeepSparse 1.1 Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream items/sec > Higher Is Better AMD SME Enabled . 1179.00 |=================================================== No SME .......... 1204.85 |==================================================== Neural Magic DeepSparse 1.1 Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream ms/batch < Lower Is Better AMD SME Enabled . 81.23 |====================================================== No SME .......... 79.49 |===================================================== Neural Magic DeepSparse 1.1 Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream items/sec > Higher Is Better AMD SME Enabled . 601.93 |==================================================== No SME .......... 617.03 |===================================================== Neural Magic DeepSparse 1.1 Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream ms/batch < Lower Is Better AMD SME Enabled . 158.94 |===================================================== No SME .......... 155.10 |==================================================== Neural Magic DeepSparse 1.1 Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream items/sec > Higher Is Better AMD SME Enabled . 83.82 |====================================================== No SME .......... 84.28 |====================================================== Neural Magic DeepSparse 1.1 Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream ms/batch < Lower Is Better AMD SME Enabled . 1143.01 |==================================================== No SME .......... 1133.31 |==================================================== WRF 4.2.2 Input: conus 2.5km Seconds < Lower Is Better AMD SME Enabled . 4116.62 |==================================================== No SME .......... 4077.19 |==================================================== GPAW 22.1 Input: Carbon Nanotube Seconds < Lower Is Better AMD SME Enabled . 22.98 |====================================================== No SME .......... 23.17 |====================================================== Blender 3.4 Blend File: Classroom - Compute: CPU-Only Seconds < Lower Is Better AMD SME Enabled . 20.95 |====================================================== No SME .......... 20.99 |====================================================== Blender 3.4 Blend File: Barbershop - Compute: CPU-Only Seconds < Lower Is Better AMD SME Enabled . 81.57 |====================================================== No SME .......... 80.77 |===================================================== OpenVINO 2022.2.dev Model: Face Detection FP16 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 101.55 |===================================================== No SME .......... 101.90 |===================================================== OpenVINO 2022.2.dev Model: Face Detection FP16 - Device: CPU ms < Lower Is Better AMD SME Enabled . 471.53 |===================================================== No SME .......... 469.81 |===================================================== OpenVINO 2022.2.dev Model: Person Detection FP16 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 42.33 |===================================================== No SME .......... 43.29 |====================================================== OpenVINO 2022.2.dev Model: Person Detection FP16 - Device: CPU ms < Lower Is Better AMD SME Enabled . 1127.51 |==================================================== No SME .......... 1102.15 |=================================================== OpenVINO 2022.2.dev Model: Person Detection FP32 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 42.03 |===================================================== No SME .......... 42.76 |====================================================== OpenVINO 2022.2.dev Model: Person Detection FP32 - Device: CPU ms < Lower Is Better AMD SME Enabled . 1134.68 |==================================================== No SME .......... 1115.56 |=================================================== OpenVINO 2022.2.dev Model: Vehicle Detection FP16 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 7274.98 |=================================================== No SME .......... 7437.73 |==================================================== OpenVINO 2022.2.dev Model: Vehicle Detection FP16 - Device: CPU ms < Lower Is Better AMD SME Enabled . 6.59 |======================================================= No SME .......... 6.44 |====================================================== OpenVINO 2022.2.dev Model: Face Detection FP16-INT8 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 193.83 |===================================================== No SME .......... 193.95 |===================================================== OpenVINO 2022.2.dev Model: Face Detection FP16-INT8 - Device: CPU ms < Lower Is Better AMD SME Enabled . 247.23 |===================================================== No SME .......... 246.95 |===================================================== OpenVINO 2022.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 11184.76 |=================================================== No SME .......... 11180.63 |=================================================== OpenVINO 2022.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU ms < Lower Is Better AMD SME Enabled . 4.28 |======================================================= No SME .......... 4.28 |======================================================= OpenVINO 2022.2.dev Model: Weld Porosity Detection FP16 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 9990.19 |==================================================== No SME .......... 9997.68 |==================================================== OpenVINO 2022.2.dev Model: Weld Porosity Detection FP16 - Device: CPU ms < Lower Is Better AMD SME Enabled . 4.79 |======================================================= No SME .......... 4.79 |======================================================= OpenVINO 2022.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 963.14 |===================================================== No SME .......... 967.90 |===================================================== OpenVINO 2022.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU ms < Lower Is Better AMD SME Enabled . 49.78 |====================================================== No SME .......... 49.54 |====================================================== OpenVINO 2022.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 19704.84 |=================================================== No SME .......... 19801.40 |=================================================== OpenVINO 2022.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU ms < Lower Is Better AMD SME Enabled . 9.67 |======================================================= No SME .......... 9.62 |======================================================= OpenVINO 2022.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 8993.90 |==================================================== No SME .......... 9027.72 |==================================================== OpenVINO 2022.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU ms < Lower Is Better AMD SME Enabled . 5.33 |======================================================= No SME .......... 5.31 |======================================================= OpenVINO 2022.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 148736.04 |================================================= No SME .......... 150792.42 |================================================== OpenVINO 2022.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU ms < Lower Is Better AMD SME Enabled . 0.55 |======================================================= No SME .......... 0.55 |======================================================= OpenVINO 2022.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU FPS > Higher Is Better AMD SME Enabled . 167545.54 |================================================== No SME .......... 165194.42 |================================================= OpenVINO 2022.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU ms < Lower Is Better AMD SME Enabled . 0.36 |======================================================= No SME .......... 0.36 |======================================================= Xsbench 2017-07-06 Lookups/s > Higher Is Better AMD SME Enabled . 29021428 |================================================== No SME .......... 29806415 |=================================================== nginx 1.23.2 Connections: 500 Requests Per Second > Higher Is Better AMD SME Enabled . 196386.41 |================================================= No SME .......... 201056.69 |================================================== ONNX Runtime 1.11 Model: super-resolution-10 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better AMD SME Enabled . 5583 |======================================================= No SME .......... 5600 |======================================================= Appleseed 2.0 Beta Scene: Emily Seconds < Lower Is Better AMD SME Enabled . 150.92 |===================================================== No SME .......... 142.95 |================================================== PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State Seconds < Lower Is Better AMD SME Enabled . 0.934 |====================================================== No SME .......... 0.884 |=================================================== PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing Seconds < Lower Is Better AMD SME Enabled . 1.744 |====================================================== No SME .......... 1.691 |==================================================== oneDNN 3.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better AMD SME Enabled . 0.850418 |================================================== No SME .......... 0.863726 |=================================================== oneDNN 3.0 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better AMD SME Enabled . 3.92313 |==================================================== No SME .......... 3.89299 |==================================================== oneDNN 3.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU ms < Lower Is Better AMD SME Enabled . 0.526628 |=================================================== No SME .......... 0.522052 |=================================================== oneDNN 3.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU ms < Lower Is Better AMD SME Enabled . 23.14 |====================================================== No SME .......... 22.68 |===================================================== oneDNN 3.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better AMD SME Enabled . 0.918482 |=================================================== No SME .......... 0.916133 |=================================================== oneDNN 3.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU ms < Lower Is Better AMD SME Enabled . 2002.43 |==================================================== No SME .......... 2011.15 |====================================================