Threadripper 3960X GCC 10 LTO Testing

AMD Ryzen Threadripper 3960X 24-Core testing with a MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 19.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1912215-PTS-THREADRI04&grs.

Threadripper 3960X GCC 10 LTO TestingProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O3 -march=native -fltoAMD Ryzen Threadripper 3960X 24-Core @ 3.80GHz (24 Cores / 48 Threads)MSI Creator TRX40 (MS-7C59) v1.0 (1.12N1 BIOS)AMD Starship/Matisse32768MB1000GB Sabrent Rocket 4.0 1TBGigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz)AMD Baffin HDMI/DPASUS VP28UAquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Device 2723Ubuntu 19.105.4.0-nvme-hwmon (x86_64)GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.5 Mesa 19.2.1 (LLVM 9.0.0)GCC 10.0.0 20191208ext43840x2160OpenBenchmarking.org- CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- --disable-multilib --enable-checking=release- NONE / errors=remount-ro,relatime,rw- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8301025- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + tsx_async_abort: Not affected

Threadripper 3960X GCC 10 LTO Testingnginx: Static Web Page Servingrocksdb: Read While Writingrocksdb: Rand Fill Syncrocksdb: Seq Fillrocksdb: Rand Readrocksdb: Rand Fillsqlite-speedtest: Timed Time - Size 1,000pgbench: Buffer Test - Heavy Contention - Read Onlypgbench: Buffer Test - Normal Load - Read Onlygromacs: Water Benchmarkaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingaskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingopenssl: RSA 4096-bit Performanceradiance: SMP Parallelradiance: Serialencode-mp3: WAV To MP3encode-flac: WAV To FLACcompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19compress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9build-imagemagick: Time To Compilestockfish: Total Timehimeno: Poisson Pressure Solvermt-dgemm: Sustained Floating-Point Ratettsiod-renderer: Phong Rendering With Soft-Shadow Mappingmkl-dnn: Convolution Batch conv_googlenet_v3 - f32mkl-dnn: Recurrent Neural Network Training - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Deconvolution Batch deconv_1d - f32tscp: AI Chess Performancecrafty: Elapsed Timebyte: Floating-Point Arithmeticbyte: Register Arithmeticbyte: Integer Arithmeticbyte: Dhrystone 2qmcpack: mrbayes: Primate Phylogeny Analysisfftw: Float + SSE - 2D FFT Size 4096fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 1D FFT Size 32fftw: Stock - 2D FFT Size 4096fftw: Stock - 2D FFT Size 32fftw: Stock - 1D FFT Size 32minife: Smallhpcc: Max Ping Pong Bandwidthhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMhpcc: G-Fftehpcc: G-Fftehpcc: G-HPLsqlite: 1-O3 -march=native -flto43673.89490176724409101084014731977791611456.442703431.182912701920.0626192.5174117.585435.313366.191949.817182.9174.534556.1356.6228.04410.16819.86775.246796139884766.3969348.761475950.33052.8269194.665126.9712.316271422472920928711167070357.61895.169.1322450546350154888969.712677112017720.9822951.4283.331170.452340.167221.688745.7879132.7570715.2173715.2173763.6848714.232OpenBenchmarking.org

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page Serving-O3 -march=native -flto9K18K27K36K45KSE +/- 294.79, N = 343673.891. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native -flto

Facebook RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Read While Writing-O3 -march=native -flto1000K2000K3000K4000K5000KSE +/- 9839.07, N = 349017671. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

Facebook RocksDB

Test: Random Fill Sync

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random Fill Sync-O3 -march=native -flto5K10K15K20K25KSE +/- 31.00, N = 3244091. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

Facebook RocksDB

Test: Sequential Fill

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Sequential Fill-O3 -march=native -flto200K400K600K800K1000KSE +/- 923.75, N = 310108401. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

Facebook RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random Read-O3 -march=native -flto30M60M90M120M150MSE +/- 1236531.46, N = 31473197771. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

Facebook RocksDB

Test: Random Fill

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random Fill-O3 -march=native -flto200K400K600K800K1000KSE +/- 8977.10, N = 39161141. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O3 -march=native -flto1326395265SE +/- 0.10, N = 356.441. (CC) gcc options: -O3 -march=native -flto -ldl -lz -lpthread

PostgreSQL pgbench

Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Only-O3 -march=native -flto150K300K450K600K750KSE +/- 5480.07, N = 3703431.181. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O3 -march=native -flto150K300K450K600K750KSE +/- 375.35, N = 3701920.061. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -flto -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2019.4Water Benchmark-O3 -march=native -flto0.56631.13261.69892.26522.8315SE +/- 0.001, N = 32.5171. (CXX) g++ options: -mavx2 -mfma -O3 -march=native -flto -std=c++11 -funroll-all-loops -pthread -lrt -lpthread -lm

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - Degridding-O3 -march=native -flto9001800270036004500SE +/- 21.33, N = 34117.581. (CXX) g++ options: -lpthread

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve OpenMP - Gridding-O3 -march=native -flto12002400360048006000SE +/- 64.06, N = 35435.311. (CXX) g++ options: -lpthread

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - Degridding-O3 -march=native -flto7001400210028003500SE +/- 2.36, N = 33366.191. (CXX) g++ options: -lpthread

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MT - Gridding-O3 -march=native -flto400800120016002000SE +/- 2.80, N = 31949.811. (CXX) g++ options: -lpthread

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance-O3 -march=native -flto15003000450060007500SE +/- 21.57, N = 37182.91. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl

Radiance Benchmark

Test: SMP Parallel

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: SMP Parallel-O3 -march=native -flto4080120160200174.53

Radiance Benchmark

Test: Serial

OpenBenchmarking.orgSeconds, Fewer Is BetterRadiance Benchmark 5.0Test: Serial-O3 -march=native -flto120240360480600556.14

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=native -flto246810SE +/- 0.067, N = 36.6221. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -flto -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=native -flto246810SE +/- 0.006, N = 58.0441. (CXX) g++ options: -O3 -march=native -flto -fvisibility=hidden -logg -lm

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19-O3 -march=native -flto3691215SE +/- 0.02, N = 310.171. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9-O3 -march=native -flto510152025SE +/- 0.06, N = 319.871. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=native -flto

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile-O3 -march=native -flto20406080100SE +/- 0.02, N = 375.25

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O3 -march=native -flto20M40M60M80M100MSE +/- 774628.71, N = 3796139881. (CXX) g++ options: -m64 -lpthread -O3 -march=native -flto -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=native -flto10002000300040005000SE +/- 22.94, N = 34766.401. (CC) gcc options: -O3 -march=native -flto -mavx2

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate-O3 -march=native -flto246810SE +/- 0.132221, N = 38.7614751. (CC) gcc options: -O3 -march=native -fopenmp -flto

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow Mapping-O3 -march=native -flto2004006008001000SE +/- 0.32, N = 3950.331. (CXX) g++ options: -O3 -march=native -flto -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

MKL-DNN DNNL

Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32-O3 -march=native -flto1224364860SE +/- 0.34, N = 352.83MIN: 51.441. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Recurrent Neural Network Training - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Recurrent Neural Network Training - Data Type: f32-O3 -march=native -flto4080120160200SE +/- 0.33, N = 3194.67MIN: 192.641. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Convolution Batch conv_alexnet - Data Type: f32-O3 -march=native -flto306090120150SE +/- 0.18, N = 3126.97MIN: 125.841. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

MKL-DNN DNNL

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN DNNL 1.1Harness: Deconvolution Batch deconv_1d - Data Type: f32-O3 -march=native -flto0.52121.04241.56362.08482.606SE +/- 0.00389, N = 42.31627MIN: 2.251. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -fopenmp -pie -lpthread -ldl

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O3 -march=native -flto300K600K900K1200K1500KSE +/- 1795.91, N = 514224721. (CC) gcc options: -O3 -march=native -flto

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time-O3 -march=native -flto2M4M6M8M10MSE +/- 13239.35, N = 392092871. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

BYTE Unix Benchmark

Computational Test: Floating-Point Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Floating-Point Arithmetic-O3 -march=native -flto0.2250.450.6750.91.12511. (CC) gcc options: -O3 -march=native -flto

BYTE Unix Benchmark

Computational Test: Register Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Register Arithmetic-O3 -march=native -flto0.2250.450.6750.91.12511. (CC) gcc options: -O3 -march=native -flto

BYTE Unix Benchmark

Computational Test: Integer Arithmetic

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Integer Arithmetic-O3 -march=native -flto0.2250.450.6750.91.12511. (CC) gcc options: -O3 -march=native -flto

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2-O3 -march=native -flto14M28M42M56M70MSE +/- 508456.63, N = 367070357.61. (CC) gcc options: -O3 -march=native -flto

QMCPACK

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.8-O3 -march=native -flto4008001200160020001895.11. (CXX) g++ options: -O3 -march=native -flto -fopenmp -fomit-frame-pointer -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -lm

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O3 -march=native -flto1530456075SE +/- 0.87, N = 469.131. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -march=native -flto -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O3 -march=native -flto5K10K15K20K25KSE +/- 153.78, N = 3245051. (CC) gcc options: -pthread -O3 -march=native -flto -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32-O3 -march=native -flto10K20K30K40K50KSE +/- 82.72, N = 3463501. (CC) gcc options: -pthread -O3 -march=native -flto -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32-O3 -march=native -flto3K6K9K12K15KSE +/- 25.21, N = 3154881. (CC) gcc options: -pthread -O3 -march=native -flto -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-O3 -march=native -flto2K4K6K8K10KSE +/- 155.20, N = 38969.71. (CC) gcc options: -pthread -O3 -march=native -flto -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32-O3 -march=native -flto3K6K9K12K15KSE +/- 14.19, N = 3126771. (CC) gcc options: -pthread -O3 -march=native -flto -lm

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32-O3 -march=native -flto2K4K6K8K10KSE +/- 55.87, N = 3112011. (CC) gcc options: -pthread -O3 -march=native -flto -lm

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: Small-O3 -march=native -flto17003400510068008500SE +/- 12.76, N = 37720.981. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth-O3 -march=native -flto5K10K15K20K25KSE +/- 301.57, N = 322951.431. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -flto -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth-O3 -march=native -flto0.74951.4992.24852.9983.7475SE +/- 0.02086, N = 33.331171. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -flto -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency-O3 -march=native -flto0.10180.20360.30540.40720.509SE +/- 0.00082, N = 30.452341. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -flto -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access-O3 -march=native -flto0.03760.07520.11280.15040.188SE +/- 0.00027, N = 30.167221. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -flto -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad-O3 -march=native -flto0.380.761.141.521.9SE +/- 0.00593, N = 31.688741. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -flto -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans-O3 -march=native -flto1.30232.60463.90695.20926.5115SE +/- 0.01376, N = 35.787911. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -flto -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM-O3 -march=native -flto816243240SE +/- 0.64, N = 332.761. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -flto -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte-O3 -march=native -flto48121620SE +/- 0.45, N = 315.221. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -flto -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte-O3 -march=native -flto48121620SE +/- 0.45, N = 315.221. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -flto -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL-O3 -march=native -flto1428425670SE +/- 0.14, N = 363.681. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -flto -funroll-loops2. ATLAS + Open MPI 3.1.3

SQLite

Threads / Copies: 1

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 1-O3 -march=native -flto48121620SE +/- 0.05, N = 314.231. (CC) gcc options: -O3 -march=native -flto -lz -lm -ldl -lpthread


Phoronix Test Suite v10.8.4