Paulls 75F22EPYC

2 x AMD EPYC 7F52 16-Core testing with a Supermicro H11DSU-iN (2.1b BIOS) and llvmpipe 504GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2006303-NI-PAULLS75F77&grs.

Paulls 75F22EPYCProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution2 x AMD EPYC 7F52 16-Core2 x AMD EPYC 7F52 16-Core @ 3.50GHz (32 Cores / 64 Threads)Supermicro H11DSU-iN (2.1b BIOS)AMD Starship/Matisse504GB2 x 3841GB Micron_9200_MTFDHAL3T8TCTllvmpipe 504GB4 x Intel I350Ubuntu 20.045.4.0-39-generic (x86_64)GNOME Shell 3.36.2X Server 1.20.8modesetting 1.20.83.3 Mesa 20.0.4 (LLVM 9.0.1 128 bits)GCC 9.3.0ext41024x768OpenBenchmarking.org- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301038- Python 3.8.2- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Paulls 75F22EPYCmlpack: scikit_linearridgeregressionpybench: Total For Average Test Timesblender: Pabellon Barcelona - CPU-Onlyblender: Pabellon Barcelona - OpenCLblender: Barbershop - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Classroom - CPU-Onlyblender: Barbershop - OpenCLblender: Fishy Cat - OpenCLblender: Classroom - OpenCLblender: BMW27 - CPU-Onlyblender: BMW27 - OpenCLsysbench: CPUredis: SETredis: LPUSHredis: SADDredis: LPOPgromacs: Water Benchmarkopenssl: RSA 4096-bit Performancey-cruncher: Calculating 500M Pi Digitsc-ray: Total Time - 4K, 16 Rays Per Pixelbuild-llvm: Time To Compilestockfish: Total Timesvt-av1: Enc Mode 8 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 0 - 1080ponednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Deconvolution Batch deconv_3d - u8s8f32 - CPUonednn: Deconvolution Batch deconv_1d - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: IP Batch All - u8s8f32 - CPUonednn: IP Batch 1D - u8s8f32 - CPUbyte: Floating-Point Arithmeticbyte: Register Arithmeticbyte: Integer Arithmeticbyte: Dhrystone 2fftw: Float + SSE - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 2D FFT Size 512fftw: Float + SSE - 2D FFT Size 256fftw: Float + SSE - 2D FFT Size 128fftw: Float + SSE - 1D FFT Size 512fftw: Float + SSE - 1D FFT Size 256fftw: Float + SSE - 1D FFT Size 128fftw: Float + SSE - 2D FFT Size 64fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 1D FFT Size 64fftw: Float + SSE - 1D FFT Size 32fftw: Stock - 2D FFT Size 4096fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 1024fftw: Stock - 1D FFT Size 4096fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 1024fftw: Stock - 2D FFT Size 512fftw: Stock - 2D FFT Size 256fftw: Stock - 2D FFT Size 128fftw: Stock - 1D FFT Size 512fftw: Stock - 1D FFT Size 256fftw: Stock - 1D FFT Size 128fftw: Stock - 2D FFT Size 64fftw: Stock - 2D FFT Size 32fftw: Stock - 1D FFT Size 32namd: ATPase Simulation - 327,506 Atomsnpb: SP.Bnpb: MG.Cnpb: LU.Cnpb: IS.Dnpb: FT.Cnpb: EP.Dnpb: EP.Cnpb: CG.Cnpb: BT.Cstream: Scalestream: Copymlpack: scikit_svmsysbench: Memoryredis: GETonednn: Recurrent Neural Network Training - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch deconv_3d - f32 - CPUonednn: Deconvolution Batch deconv_1d - f32 - CPUonednn: IP Batch All - f32 - CPUonednn: IP Batch 1D - f32 - CPUfftw: Stock - 1D FFT Size 64stream: Addstream: Triad2 x AMD EPYC 7F52 16-Core198.811002152.99960.84209.4571.48115.78483.18933.63200.1145.84356.7265236.39141519964.761394811.131718711.202092018.774.4379124.710.02116.485220.7018293637059.5827.5310.1170.9160510.549263143.3161.687781.989691.0882311.53841.3804911140624001.0173552956939188509075381351617384553955741321466313566024198418054112519007138315999.56892.97138.87891.58107.78394.57787.37761.38026.48184.48110.97681.48611.510058.7101230.55244104682.91101663.85149930.032314.9576020.422821.162798.0546663.14121161.43184946.9188315.623.852185353.72452015112.33406.1175.599193.553892.3591915.02591.568998591.4193801.4195537.9OpenBenchmarking.org

Mlpack Benchmark

Benchmark: scikit_linearridgeregression

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregression2 x AMD EPYC 7F52 16-Core4080120160200SE +/- 0.10, N = 3198.81

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Times2 x AMD EPYC 7F52 16-Core2004006008001000SE +/- 0.67, N = 31002

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CPU-Only2 x AMD EPYC 7F52 16-Core306090120150SE +/- 0.39, N = 3152.99

Blender

Blend File: Pabellon Barcelona - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: OpenCL2 x AMD EPYC 7F52 16-Core2004006008001000SE +/- 3.04, N = 3960.84

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CPU-Only2 x AMD EPYC 7F52 16-Core50100150200250SE +/- 0.17, N = 3209.45

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CPU-Only2 x AMD EPYC 7F52 16-Core1632486480SE +/- 0.26, N = 371.48

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CPU-Only2 x AMD EPYC 7F52 16-Core306090120150SE +/- 0.14, N = 3115.78

Blender

Blend File: Barbershop - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: OpenCL2 x AMD EPYC 7F52 16-Core100200300400500SE +/- 2.98, N = 3483.18

Blender

Blend File: Fishy Cat - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: OpenCL2 x AMD EPYC 7F52 16-Core2004006008001000SE +/- 8.38, N = 3933.63

Blender

Blend File: Classroom - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: OpenCL2 x AMD EPYC 7F52 16-Core4080120160200SE +/- 1.39, N = 3200.11

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CPU-Only2 x AMD EPYC 7F52 16-Core1020304050SE +/- 0.08, N = 345.84

Blender

Blend File: BMW27 - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: OpenCL2 x AMD EPYC 7F52 16-Core80160240320400SE +/- 1.39, N = 3356.72

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: CPU2 x AMD EPYC 7F52 16-Core14K28K42K56K70KSE +/- 26.79, N = 365236.391. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SET2 x AMD EPYC 7F52 16-Core300K600K900K1200K1500KSE +/- 15547.88, N = 151519964.761. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPUSH2 x AMD EPYC 7F52 16-Core300K600K900K1200K1500KSE +/- 8826.56, N = 31394811.131. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SADD2 x AMD EPYC 7F52 16-Core400K800K1200K1600K2000KSE +/- 24252.72, N = 151718711.201. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: LPOP2 x AMD EPYC 7F52 16-Core400K800K1200K1600K2000KSE +/- 30406.66, N = 152092018.771. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.1Water Benchmark2 x AMD EPYC 7F52 16-Core0.99831.99662.99493.99324.9915SE +/- 0.007, N = 34.4371. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance2 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 6.76, N = 39124.71. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Y-Cruncher

Calculating 500M Pi Digits

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.7.8.9503Calculating 500M Pi Digits2 x AMD EPYC 7F52 16-Core3691215SE +/- 0.03, N = 310.02

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel2 x AMD EPYC 7F52 16-Core48121620SE +/- 0.03, N = 316.491. (CC) gcc options: -lm -lpthread -O3

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To Compile2 x AMD EPYC 7F52 16-Core50100150200250SE +/- 1.28, N = 3220.70

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time2 x AMD EPYC 7F52 16-Core20M40M60M80M100MSE +/- 567137.28, N = 3829363701. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080p2 x AMD EPYC 7F52 16-Core1326395265SE +/- 0.42, N = 359.581. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080p2 x AMD EPYC 7F52 16-Core246810SE +/- 0.020, N = 37.5311. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080p2 x AMD EPYC 7F52 16-Core0.02630.05260.07890.10520.1315SE +/- 0.000, N = 30.1171. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.20610.41220.61830.82441.0305SE +/- 0.001826, N = 30.916051MIN: 0.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.12360.24720.37080.49440.618SE +/- 0.006347, N = 150.549263MIN: 0.441. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core306090120150SE +/- 1.99, N = 15143.32MIN: 119.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.37980.75961.13941.51921.899SE +/- 0.02266, N = 41.68778MIN: 1.561. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.44770.89541.34311.79082.2385SE +/- 0.01691, N = 31.98969MIN: 1.831. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.24490.48980.73470.97961.2245SE +/- 0.01141, N = 151.08823MIN: 0.981. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core3691215SE +/- 0.12, N = 311.54MIN: 10.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.31060.62120.93181.24241.553SE +/- 0.02321, N = 31.38049MIN: 1.171. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

BYTE Unix Benchmark

Computational Test: Floating-Point Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Floating-Point Arithmetic2 x AMD EPYC 7F52 16-Core0.2250.450.6750.91.1251

BYTE Unix Benchmark

Computational Test: Register Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Register Arithmetic2 x AMD EPYC 7F52 16-Core0.2250.450.6750.91.1251

BYTE Unix Benchmark

Computational Test: Integer Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Integer Arithmetic2 x AMD EPYC 7F52 16-Core0.2250.450.6750.91.1251

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 22 x AMD EPYC 7F52 16-Core9M18M27M36M45MSE +/- 500579.02, N = 540624001.0

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 40962 x AMD EPYC 7F52 16-Core4K8K12K16K20KSE +/- 92.09, N = 3173551. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 20482 x AMD EPYC 7F52 16-Core6K12K18K24K30KSE +/- 474.34, N = 3295691. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 10242 x AMD EPYC 7F52 16-Core8K16K24K32K40KSE +/- 335.60, N = 3391881. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 40962 x AMD EPYC 7F52 16-Core11K22K33K44K55KSE +/- 498.15, N = 3509071. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 20482 x AMD EPYC 7F52 16-Core12K24K36K48K60KSE +/- 233.25, N = 3538131. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 10242 x AMD EPYC 7F52 16-Core11K22K33K44K55KSE +/- 489.91, N = 3516171. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 5122 x AMD EPYC 7F52 16-Core8K16K24K32K40KSE +/- 224.91, N = 3384551. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2562 x AMD EPYC 7F52 16-Core8K16K24K32K40KSE +/- 171.12, N = 3395571. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1282 x AMD EPYC 7F52 16-Core9K18K27K36K45KSE +/- 212.66, N = 3413211. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 5122 x AMD EPYC 7F52 16-Core10K20K30K40K50KSE +/- 90.47, N = 3466311. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2562 x AMD EPYC 7F52 16-Core8K16K24K32K40KSE +/- 501.91, N = 4356601. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1282 x AMD EPYC 7F52 16-Core5K10K15K20K25KSE +/- 73.66, N = 3241981. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 642 x AMD EPYC 7F52 16-Core9K18K27K36K45KSE +/- 155.72, N = 3418051. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 322 x AMD EPYC 7F52 16-Core9K18K27K36K45KSE +/- 38.83, N = 3411251. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 642 x AMD EPYC 7F52 16-Core4K8K12K16K20KSE +/- 17.15, N = 3190071. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 322 x AMD EPYC 7F52 16-Core3K6K9K12K15KSE +/- 135.13, N = 3138311. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 40962 x AMD EPYC 7F52 16-Core13002600390052006500SE +/- 16.61, N = 35999.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 20482 x AMD EPYC 7F52 16-Core15003000450060007500SE +/- 14.17, N = 36892.91. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 10242 x AMD EPYC 7F52 16-Core15003000450060007500SE +/- 17.39, N = 37138.81. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 40962 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 12.07, N = 37891.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 20482 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 25.90, N = 38107.71. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 10242 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 7.11, N = 38394.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 5122 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 13.54, N = 37787.31. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2562 x AMD EPYC 7F52 16-Core17003400510068008500SE +/- 6.04, N = 37761.31. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1282 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 3.29, N = 38026.41. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 512

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 5122 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 25.56, N = 38184.41. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 256

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2562 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 4.62, N = 38110.91. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 128

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1282 x AMD EPYC 7F52 16-Core16003200480064008000SE +/- 9.56, N = 37681.41. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 642 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 23.70, N = 38611.51. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 322 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 73.69, N = 310058.71. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 322 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 9.74, N = 3101231. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13ATPase Simulation - 327,506 Atoms2 x AMD EPYC 7F52 16-Core0.12430.24860.37290.49720.6215SE +/- 0.00098, N = 30.55244

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B2 x AMD EPYC 7F52 16-Core20K40K60K80K100KSE +/- 1531.31, N = 3104682.911. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C2 x AMD EPYC 7F52 16-Core20K40K60K80K100KSE +/- 993.32, N = 3101663.851. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C2 x AMD EPYC 7F52 16-Core30K60K90K120K150KSE +/- 99.80, N = 3149930.031. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D2 x AMD EPYC 7F52 16-Core5001000150020002500SE +/- 5.93, N = 32314.951. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C2 x AMD EPYC 7F52 16-Core16K32K48K64K80KSE +/- 974.27, N = 1576020.421. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D2 x AMD EPYC 7F52 16-Core6001200180024003000SE +/- 0.18, N = 32821.161. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C2 x AMD EPYC 7F52 16-Core6001200180024003000SE +/- 6.58, N = 32798.051. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C2 x AMD EPYC 7F52 16-Core10K20K30K40K50KSE +/- 76.79, N = 346663.141. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C2 x AMD EPYC 7F52 16-Core30K60K90K120K150KSE +/- 343.45, N = 3121161.431. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scale2 x AMD EPYC 7F52 16-Core40K80K120K160K200KSE +/- 1653.21, N = 5184946.91. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copy2 x AMD EPYC 7F52 16-Core40K80K120K160K200KSE +/- 1993.40, N = 20188315.61. (CC) gcc options: -O3 -march=native -fopenmp

Mlpack Benchmark

Benchmark: scikit_svm

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svm2 x AMD EPYC 7F52 16-Core612182430SE +/- 0.52, N = 1523.85

Sysbench

Test: Memory

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 2018-07-28Test: Memory2 x AMD EPYC 7F52 16-Core500K1000K1500K2000K2500KSE +/- 60717.00, N = 152185353.721. (CC) gcc options: -pthread -O3 -funroll-loops -ggdb3 -march=amdfam10 -rdynamic -ldl -laio -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: GET2 x AMD EPYC 7F52 16-Core400K800K1200K1600K2000KSE +/- 37597.22, N = 152015112.331. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core90180270360450SE +/- 6.78, N = 15406.12MIN: 344.191. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core1.25982.51963.77945.03926.299SE +/- 0.60492, N = 155.59919MIN: 1.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.79961.59922.39883.19843.998SE +/- 0.10727, N = 123.55389MIN: 2.721. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.53081.06161.59242.12322.654SE +/- 0.04494, N = 152.35919MIN: 1.721. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch All - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core48121620SE +/- 0.24, N = 1515.03MIN: 13.061. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPU2 x AMD EPYC 7F52 16-Core0.3530.7061.0591.4121.765SE +/- 0.02607, N = 151.56899MIN: 1.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

FFTW

Build: Stock - Size: 1D FFT Size 64

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 642 x AMD EPYC 7F52 16-Core2K4K6K8K10KSE +/- 233.31, N = 158591.41. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Add2 x AMD EPYC 7F52 16-Core40K80K120K160K200KSE +/- 6417.88, N = 5193801.41. (CC) gcc options: -O3 -march=native -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triad2 x AMD EPYC 7F52 16-Core40K80K120K160K200KSE +/- 9403.33, N = 5195537.91. (CC) gcc options: -O3 -march=native -fopenmp


Phoronix Test Suite v10.8.4