GCC 12 pgo Optimizations On AMD Threadripper

AMD Ryzen Threadripper 3990X GCC 12 profile guided optimizations (PGO) benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2207207-NE-PGORUN80001.

GCC 12 pgo Optimizations On AMD ThreadripperProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionTR 3990XTR 3990X - PGOAMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads)Gigabyte TRX40 AORUS PRO WIFI (F4p BIOS)AMD Starship/Matisse128GBSamsung SSD 970 EVO Plus 500GB + 32GB Flash DriveAMD Radeon RX 5700 8GB (1750/875MHz)AMD Navi 10 HDMI AudioDELL P2415QIntel I211 + Intel Wi-Fi 6 AX200Fedora Linux 365.18.11-200.fc36.x86_64 (x86_64)GNOME Shell 42.3.1X Server 1.20.14 + Wayland4.6 Mesa 22.1.3 (LLVM 14.0.0 DRM 3.46)GCC 12.1.1 20220507btrfs3840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"Compiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301039Graphics Details- GLAMOR - BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-D1820201-101Python Details- Python 3.10.5Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

GCC 12 pgo Optimizations On AMD Threadripperyquake2: Software CPU - On - On - 1920 x 1080yquake2: Software CPU - On - On - 3840 x 2160yquake2: Software CPU - Off - On - 1920 x 1080yquake2: Software CPU - Off - On - 3840 x 2160yquake2: Software CPU - On - Off - 1920 x 1080yquake2: Software CPU - On - Off - 3840 x 2160yquake2: Software CPU - Off - Off - 1920 x 1080yquake2: Software CPU - Off - Off - 3840 x 2160yquake2: Software CPU Color Light - On - On - 1920 x 1080yquake2: Software CPU Color Light - On - On - 3840 x 2160yquake2: Software CPU Color Light - Off - On - 1920 x 1080yquake2: Software CPU Color Light - Off - On - 3840 x 2160yquake2: Software CPU Color Light - On - Off - 1920 x 1080yquake2: Software CPU Color Light - On - Off - 3840 x 2160yquake2: Software CPU Color Light - Off - Off - 1920 x 1080yquake2: Software CPU Color Light - Off - Off - 3840 x 2160scimark2: Compositescimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationbotan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptsvt-hevc: 10 - Bosphorus 4Khimeno: Poisson Pressure Solverc-ray: Total Time - 4K, 16 Rays Per Pixelaobench: 2048 x 2048 - Total Timeopenssl: SHA256openssl: RSA4096openssl: RSA4096daphne: OpenMP - NDT Mappingastcenc: MediumTR 3990XTR 3990X - PGO117.135.0120.535.6117.434.9120.235.6109.633.0110.433.6108.033.4111.433.83539.053518.089694.012218.1092.95992.4145729.5295713.900358.861366.251432.209444.960138.890140.494603.401596.611148.704636.6084389.46527.6326020528674913433.9854909.0899.063.6566118.535.9121.936.8118.735.9121.836.7111.734.4112.635.0111.034.4114.135.23579.403570.1610555.342253.8993.46193.0335666.1105682.727358.690365.190436.282448.216144.500142.663668.683661.511150.114423.3235159.43828.3497115075072613338.9848927.3886.573.4315OpenBenchmarking.org

yquake2

Renderer: Software CPU - AF: On - MSAA: On - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU - AF: On - MSAA: On - Resolution: 1920 x 1080TR 3990XTR 3990X - PGO306090120150SE +/- 0.18, N = 3SE +/- 0.88, N = 3117.1118.5-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU - AF: On - MSAA: On - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU - AF: On - MSAA: On - Resolution: 3840 x 2160TR 3990XTR 3990X - PGO816243240SE +/- 0.26, N = 3SE +/- 0.03, N = 335.035.9-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU - AF: Off - MSAA: On - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU - AF: Off - MSAA: On - Resolution: 1920 x 1080TR 3990XTR 3990X - PGO306090120150SE +/- 0.81, N = 3SE +/- 0.74, N = 3120.5121.9-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU - AF: Off - MSAA: On - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU - AF: Off - MSAA: On - Resolution: 3840 x 2160TR 3990XTR 3990X - PGO816243240SE +/- 0.17, N = 3SE +/- 0.06, N = 335.636.8-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU - AF: On - MSAA: Off - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU - AF: On - MSAA: Off - Resolution: 1920 x 1080TR 3990XTR 3990X - PGO306090120150SE +/- 0.17, N = 3SE +/- 0.40, N = 3117.4118.7-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU - AF: On - MSAA: Off - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU - AF: On - MSAA: Off - Resolution: 3840 x 2160TR 3990XTR 3990X - PGO816243240SE +/- 0.17, N = 3SE +/- 0.06, N = 334.935.9-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU - AF: Off - MSAA: Off - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU - AF: Off - MSAA: Off - Resolution: 1920 x 1080TR 3990XTR 3990X - PGO306090120150SE +/- 0.55, N = 3SE +/- 0.44, N = 3120.2121.8-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU - AF: Off - MSAA: Off - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU - AF: Off - MSAA: Off - Resolution: 3840 x 2160TR 3990XTR 3990X - PGO816243240SE +/- 0.07, N = 3SE +/- 0.03, N = 335.636.7-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU Color Light - AF: On - MSAA: On - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU Color Light - AF: On - MSAA: On - Resolution: 1920 x 1080TR 3990XTR 3990X - PGO306090120150SE +/- 0.41, N = 3SE +/- 0.22, N = 3109.6111.7-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU Color Light - AF: On - MSAA: On - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU Color Light - AF: On - MSAA: On - Resolution: 3840 x 2160TR 3990XTR 3990X - PGO816243240SE +/- 0.07, N = 3SE +/- 0.06, N = 333.034.4-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU Color Light - AF: Off - MSAA: On - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU Color Light - AF: Off - MSAA: On - Resolution: 1920 x 1080TR 3990XTR 3990X - PGO306090120150SE +/- 0.67, N = 3SE +/- 0.29, N = 3110.4112.6-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU Color Light - AF: Off - MSAA: On - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU Color Light - AF: Off - MSAA: On - Resolution: 3840 x 2160TR 3990XTR 3990X - PGO816243240SE +/- 0.03, N = 3SE +/- 0.09, N = 333.635.0-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU Color Light - AF: On - MSAA: Off - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU Color Light - AF: On - MSAA: Off - Resolution: 1920 x 1080TR 3990XTR 3990X - PGO20406080100SE +/- 0.35, N = 3SE +/- 0.36, N = 3108.0111.0-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU Color Light - AF: On - MSAA: Off - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU Color Light - AF: On - MSAA: Off - Resolution: 3840 x 2160TR 3990XTR 3990X - PGO816243240SE +/- 0.24, N = 3SE +/- 0.07, N = 333.434.4-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU Color Light - AF: Off - MSAA: Off - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU Color Light - AF: Off - MSAA: Off - Resolution: 1920 x 1080TR 3990XTR 3990X - PGO306090120150SE +/- 0.63, N = 3SE +/- 0.20, N = 3111.4114.1-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU Color Light - AF: Off - MSAA: Off - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 8.10Renderer: Software CPU Color Light - AF: Off - MSAA: Off - Resolution: 3840 x 2160TR 3990XTR 3990X - PGO816243240SE +/- 0.12, N = 3SE +/- 0.09, N = 333.835.2-fprofile-correction1. (CC) gcc options: -shared -lm -ldl -rdynamic -pthread -lSDL2 -O3 -march=native -flto -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeTR 3990XTR 3990X - PGO8001600240032004000SE +/- 5.40, N = 3SE +/- 2.71, N = 33539.053579.40-fprofile-correction1. (CC) gcc options: -O3 -march=native -flto -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyTR 3990XTR 3990X - PGO8001600240032004000SE +/- 7.02, N = 3SE +/- 5.27, N = 33518.083570.16-fprofile-correction1. (CC) gcc options: -O3 -march=native -flto -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationTR 3990XTR 3990X - PGO2K4K6K8K10KSE +/- 21.63, N = 3SE +/- 7.18, N = 39694.0110555.34-fprofile-correction1. (CC) gcc options: -O3 -march=native -flto -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationTR 3990XTR 3990X - PGO5001000150020002500SE +/- 4.69, N = 3SE +/- 3.40, N = 32218.102253.89-fprofile-correction1. (CC) gcc options: -O3 -march=native -flto -lm

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMITR 3990XTR 3990X - PGO20406080100SE +/- 0.03, N = 3SE +/- 0.01, N = 392.9693.461. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptTR 3990XTR 3990X - PGO20406080100SE +/- 0.03, N = 3SE +/- 0.03, N = 392.4193.031. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256TR 3990XTR 3990X - PGO12002400360048006000SE +/- 10.83, N = 3SE +/- 2.04, N = 35729.535666.111. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptTR 3990XTR 3990X - PGO12002400360048006000SE +/- 11.44, N = 3SE +/- 3.35, N = 35713.905682.731. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishTR 3990XTR 3990X - PGO80160240320400SE +/- 0.09, N = 3SE +/- 3.71, N = 5358.86358.691. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptTR 3990XTR 3990X - PGO80160240320400SE +/- 0.48, N = 3SE +/- 4.30, N = 5366.25365.191. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishTR 3990XTR 3990X - PGO90180270360450SE +/- 0.19, N = 3SE +/- 0.32, N = 3432.21436.281. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptTR 3990XTR 3990X - PGO100200300400500SE +/- 0.10, N = 3SE +/- 0.08, N = 3444.96448.221. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256TR 3990XTR 3990X - PGO306090120150SE +/- 0.16, N = 3SE +/- 0.57, N = 3138.89144.501. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptTR 3990XTR 3990X - PGO306090120150SE +/- 0.04, N = 3SE +/- 0.69, N = 3140.49142.661. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305TR 3990XTR 3990X - PGO140280420560700SE +/- 0.15, N = 3SE +/- 0.42, N = 3603.40668.681. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptTR 3990XTR 3990X - PGO140280420560700SE +/- 0.30, N = 3SE +/- 0.65, N = 3596.61661.511. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 4KTR 3990XTR 3990X - PGO306090120150SE +/- 1.22, N = 8SE +/- 1.66, N = 5148.70150.11-fprofile-correction1. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverTR 3990XTR 3990X - PGO10002000300040005000SE +/- 65.91, N = 3SE +/- 59.07, N = 34636.614423.32-fprofile-correction1. (CC) gcc options: -O3 -march=native -flto -mavx2

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelTR 3990XTR 3990X - PGO3691215SE +/- 0.030, N = 3SE +/- 0.065, N = 39.4659.438-fprofile-correction1. (CC) gcc options: -lm -lpthread -O3 -march=native -flto

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeTR 3990XTR 3990X - PGO714212835SE +/- 0.08, N = 3SE +/- 0.04, N = 327.6328.35-fprofile-correction1. (CC) gcc options: -lm -O3 -march=native -flto

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256TR 3990XTR 3990X - PGO15000M30000M45000M60000M75000MSE +/- 572595073.99, N = 12SE +/- 637527883.34, N = 126020528674971150750726-lssl-fprofile-correction1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096TR 3990XTR 3990X - PGO3K6K9K12K15KSE +/- 83.53, N = 3SE +/- 96.06, N = 313433.913338.9-lssl-fprofile-correction1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096TR 3990XTR 3990X - PGO200K400K600K800K1000KSE +/- 1268.34, N = 3SE +/- 1886.05, N = 3854909.0848927.3-lssl-fprofile-correction1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lcrypto -ldl

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT MappingTR 3990XTR 3990X - PGO2004006008001000SE +/- 10.11, N = 4SE +/- 8.71, N = 15899.06886.571. (CXX) g++ options: -O3 -std=c++11 -fopenmp

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: MediumTR 3990XTR 3990X - PGO0.82271.64542.46813.29084.1135SE +/- 0.0056, N = 3SE +/- 0.0105, N = 33.65663.4315-fprofile-correction1. (CXX) g++ options: -O3 -march=native -flto -pthread


Phoronix Test Suite v10.8.4