xanmod-kernel-v3

AMD Ryzen 9 3900X 12-Core testing with a MSI MPG X570 GAMING PLUS (MS-7C37) v2.0 (A.61 BIOS) and MSI NVIDIA GeForce GT 1030 on Debian stable-updates via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2008021-NE-XANMODKER88&sor&grt.

xanmod-kernel-v3ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDisplay ServerDisplay DriverCompilerFile-Systemstock-linux-kernelxanmod-kernel-stockxanmod-kernel-stock-v2xanmod-kernel-optimizedxanmod-kernel-optimized-blenderAMD Ryzen 9 3900X 12-Core (12 Cores / 24 Threads)MSI MPG X570 GAMING PLUS (MS-7C37) v2.0 (A.61 BIOS)AMD Starship/Matisse4 x 16384 MB DDR4-3200MT/s CMK32GX4M2D3000C161000GB Samsung SSD 970 EVO 1TB + 8002GB Western Digital WD80EMAZ-00WMSI NVIDIA GeForce GT 1030NVIDIA GP108 HD AudioRealtek RTL8111/8168/8411Debian testing5.7.6-050706-lowlatency (x86_64)X Server 1.20.8modesetting 1.20.8GCC 9.3.0 + Clang 9.0.1-13 + LLVM 9.0.1ext45.7.10-xanmod1 (x86_64)4 x 16384 MB DDR4-3000MT/s CMK32GX4M2D3000C16Debian stable-updatesOpenBenchmarking.orgEnvironment Details- stock-linux-kernel: RADV_PERFTEST=aco NVM_CD_FLAGS=- xanmod-kernel-stock: RADV_PERFTEST=aco NVM_CD_FLAGS=- xanmod-kernel-stock-v2: RADV_PERFTEST=aco NVM_CD_FLAGS=- xanmod-kernel-optimized: RADV_PERFTEST=aco- xanmod-kernel-optimized-blender: RADV_PERFTEST=acoCompiler Details- stock-linux-kernel, xanmod-kernel-stock, xanmod-kernel-stock-v2, xanmod-kernel-optimized: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-0xEOmg/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- stock-linux-kernel, xanmod-kernel-stock, xanmod-kernel-stock-v2, xanmod-kernel-optimized: NONE / errors=remount-ro,relatime,rwProcessor Details- CPU Microcode: 0x8701013Java Details- stock-linux-kernel, xanmod-kernel-stock, xanmod-kernel-stock-v2, xanmod-kernel-optimized: OpenJDK Runtime Environment (build 11.0.7+10-post-Debian-3deb10u1)Python Details- stock-linux-kernel, xanmod-kernel-stock, xanmod-kernel-stock-v2, xanmod-kernel-optimized: Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

xanmod-kernel-v3compress-7zip: Compress Speed Testblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyc-ray: Total Time - 4K, 16 Rays Per Pixelcachebench: Readcachebench: Writecachebench: Read / Modify / Writecore-latency: Average Latency Between CPU Coresdarktable: Boat - CPU-onlydarktable: Masskrug - CPU-onlydarktable: Server Rack - CPU-onlydarktable: Server Room - CPU-onlydeepspeech: CPUffmpeg: H.264 HD To NTSC DVfftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 4096gnupg: 2GB File Encryptionhpcg: java-scimark2: Compositejava-scimark2: Monte Carlojava-scimark2: Fast Fourier Transformjava-scimark2: Sparse Matrix Multiplyjava-scimark2: Dense LU Matrix Factorizationjava-scimark2: Jacobi Successive Over-Relaxationkeydb: mysqlslap: 1npb: BT.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: LU.Cnpb: MG.Cnpb: SP.Bonednn: IP Batch All - f32 - CPUonednn: IP Batch All - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch deconv_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch deconv_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUopenssl: RSA 4096-bit Performanceramspeed: Add - Integerramspeed: Copy - Integerramspeed: Scale - Integerramspeed: Triad - Integerramspeed: Average - Integerramspeed: Add - Floating Pointramspeed: Copy - Floating Pointramspeed: Scale - Floating Pointramspeed: Triad - Floating Pointramspeed: Average - Floating Pointrays1bench: Large Scenesqlite: 1sqlite: 128sqlite-speedtest: Timed Time - Size 1,000build-ffmpeg: Time To Compilebuild-linux-kernel: Time To Compilebuild-php: Time To Compilewireguard: x265: H.265 1080p Video Encodingstock-linux-kernelxanmod-kernel-stockxanmod-kernel-stock-v2xanmod-kernel-optimizedxanmod-kernel-optimized-blender8123142.9353025.4632183.80546415961924.050792619142.268.4014.4530.1403.03556.365845.2891467345476571812129111.1905.683633039.191704.382030.912720.276768.171972.21119368.9225330353.28785.50782.0312485.7233564.2816862.7013080.8442.523124.615212.07264.6884613.14533.61101207.24942.50890.8053161.939223494.031078.6026675.0427448.3431334.7628792.5531350.5727445.7826941.8131043.7129306.8488.0539.2362449.76970.11532.48747.36639.101230.35668.7340.6088167942.7473014.5232004.95629738163966.482233318144.048.4024.4800.1383.01556.367275.4021496845621555672026911.2435.697403075.131700.612067.652701.656940.611965.14123834.2225930132.56788.37787.4012492.0733654.8616879.8013058.5842.460924.514412.06724.6647713.14573.60538205.18741.52520.8032841.940853506.530294.8427440.9626628.9231765.5828894.3031894.4827024.5727390.7030172.8428902.0888.3140.3132456.35667.55032.11846.48438.528191.27968.908317042.0443050.1032511.70329576262834.856835413139.738.5454.4410.1432.99856.254005.1711492345825562321987611.3115.541783069.361701.682076.802700.256892.471975.61130220.5827529948.81823.86812.4412085.2633335.3716422.8112922.6341.586623.809212.51814.5125013.52713.47507200.18640.52610.7849781.882923595.231234.9726885.9426780.8331510.4028801.1730949.8026768.4327120.6230019.6729355.4190.0738.41767.82931.82746.00338.145184.80470.97111.00296.71163.16444.76373.42OpenBenchmarking.org

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Testxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel20K40K60K80K100KSE +/- 190.50, N = 2SE +/- 584.03, N = 3SE +/- 703.40, N = 38317081679812311. (CXX) g++ options: -pipe -lpthread

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CPU-Onlyxanmod-kernel-optimized-blender20406080100SE +/- 0.33, N = 3111.00

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CPU-Onlyxanmod-kernel-optimized-blender60120180240300SE +/- 0.13, N = 3296.71

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CPU-Onlyxanmod-kernel-optimized-blender4080120160200SE +/- 0.32, N = 3163.16

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CPU-Onlyxanmod-kernel-optimized-blender100200300400500SE +/- 0.49, N = 3444.76

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CPU-Onlyxanmod-kernel-optimized-blender80160240320400SE +/- 0.81, N = 3373.42

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel1020304050SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 342.0442.7542.941. (CC) gcc options: -lm -lpthread -O3

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Readxanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v27001400210028003500SE +/- 8.27, N = 3SE +/- 8.65, N = 3SE +/- 7.16, N = 33050.103025.463014.52MIN: 3027.65 / MAX: 3065.82MIN: 3006.36 / MAX: 3036.42MIN: 2973.27 / MAX: 3043.121. (CC) gcc options: -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Writexanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v27K14K21K28K35KSE +/- 384.10, N = 3SE +/- 426.99, N = 3SE +/- 348.21, N = 332511.7032183.8132004.96MIN: 27980.81 / MAX: 34688.08MIN: 27584.78 / MAX: 34495.84MIN: 27420.12 / MAX: 34639.581. (CC) gcc options: -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / Writexanmod-kernel-stock-v2xanmod-kernel-optimizedstock-linux-kernel14K28K42K56K70KSE +/- 755.52, N = 3SE +/- 14.57, N = 3SE +/- 273.27, N = 363966.4862834.8661924.05MIN: 55607.63 / MAX: 68405.98MIN: 55279.71 / MAX: 66668.02MIN: 54980.84 / MAX: 66137.051. (CC) gcc options: -lrt

Core-Latency

Average Latency Between CPU Cores

OpenBenchmarking.orgns, Fewer Is BetterCore-LatencyAverage Latency Between CPU Coresxanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v2306090120150139.73142.26144.04MIN: 42.74 / MAX: 166.14MIN: 42.6 / MAX: 169.15MIN: 42.79 / MAX: 172.061. (CXX) g++ options: -std=c++11 -pthread -O3

Darktable

Test: Boat - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.2Test: Boat - Acceleration: CPU-onlystock-linux-kernelxanmod-kernel-stock-v2xanmod-kernel-optimized246810SE +/- 0.006, N = 3SE +/- 0.006, N = 3SE +/- 0.012, N = 38.4018.4028.545

Darktable

Test: Masskrug - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.2Test: Masskrug - Acceleration: CPU-onlyxanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v21.0082.0163.0244.0325.04SE +/- 0.016, N = 3SE +/- 0.020, N = 3SE +/- 0.017, N = 34.4414.4534.480

Darktable

Test: Server Rack - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.2Test: Server Rack - Acceleration: CPU-onlyxanmod-kernel-stock-v2stock-linux-kernelxanmod-kernel-optimized0.03220.06440.09660.12880.161SE +/- 0.000, N = 3SE +/- 0.002, N = 4SE +/- 0.002, N = 30.1380.1400.143

Darktable

Test: Server Room - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.2Test: Server Room - Acceleration: CPU-onlyxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel0.68291.36582.04872.73163.4145SE +/- 0.004, N = 3SE +/- 0.006, N = 3SE +/- 0.008, N = 32.9983.0153.035

DeepSpeech

Acceleration: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterDeepSpeech 0.6Acceleration: CPUxanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v21326395265SE +/- 0.16, N = 3SE +/- 0.04, N = 3SE +/- 0.25, N = 356.2556.3756.37

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVxanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v21.21552.4313.64654.8626.0775SE +/- 0.001, N = 3SE +/- 0.042, N = 3SE +/- 0.034, N = 35.1715.2895.4021. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lm -lxcb -pthread -lbz2 -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32xanmod-kernel-stock-v2xanmod-kernel-optimizedstock-linux-kernel3K6K9K12K15KSE +/- 58.71, N = 3SE +/- 216.56, N = 4SE +/- 150.35, N = 31496814923146731. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32xanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel10K20K30K40K50KSE +/- 91.12, N = 3SE +/- 78.67, N = 3SE +/- 77.66, N = 34582545621454761. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096stock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v212K24K36K48K60KSE +/- 677.30, N = 5SE +/- 728.33, N = 3SE +/- 917.95, N = 35718156232555671. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096stock-linux-kernelxanmod-kernel-stock-v2xanmod-kernel-optimized5K10K15K20K25KSE +/- 226.07, N = 3SE +/- 158.09, N = 3SE +/- 112.65, N = 32129120269198761. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

GnuPG

2GB File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 1.4.222GB File Encryptionstock-linux-kernelxanmod-kernel-stock-v2xanmod-kernel-optimized3691215SE +/- 0.16, N = 3SE +/- 0.12, N = 15SE +/- 0.13, N = 1511.1911.2411.311. (CC) gcc options: -O2 -MT -MD -MP -MF

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1xanmod-kernel-stock-v2stock-linux-kernelxanmod-kernel-optimized1.28192.56383.84575.12766.4095SE +/- 0.00163, N = 3SE +/- 0.01502, N = 3SE +/- 0.00226, N = 35.697405.683635.541781. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

Java SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Compositexanmod-kernel-stock-v2xanmod-kernel-optimizedstock-linux-kernel7001400210028003500SE +/- 34.71, N = 4SE +/- 7.44, N = 4SE +/- 29.80, N = 43075.133069.363039.19

Java SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Monte Carlostock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v2400800120016002000SE +/- 20.11, N = 4SE +/- 3.71, N = 4SE +/- 19.26, N = 41704.381701.681700.61

Java SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Fast Fourier Transformxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel400800120016002000SE +/- 3.49, N = 4SE +/- 22.09, N = 4SE +/- 6.68, N = 42076.802067.652030.91

Java SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Sparse Matrix Multiplystock-linux-kernelxanmod-kernel-stock-v2xanmod-kernel-optimized6001200180024003000SE +/- 26.42, N = 4SE +/- 37.81, N = 4SE +/- 12.34, N = 42720.272701.652700.25

Java SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Dense LU Matrix Factorizationxanmod-kernel-stock-v2xanmod-kernel-optimizedstock-linux-kernel15003000450060007500SE +/- 80.90, N = 4SE +/- 15.70, N = 4SE +/- 72.44, N = 46940.616892.476768.17

Java SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Jacobi Successive Over-Relaxationxanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v2400800120016002000SE +/- 2.63, N = 4SE +/- 26.34, N = 4SE +/- 31.69, N = 41975.611972.211965.14

KeyDB

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 5.3.1xanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel30K60K90K120K150KSE +/- 1668.86, N = 3SE +/- 1088.54, N = 15SE +/- 1002.40, N = 15130220.58123834.22119368.921. (CXX) g++ options: -O2 -levent -lpthread -lz -lpcre

MariaDB

Clients: 1

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 1xanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel60120180240300SE +/- 4.27, N = 9SE +/- 4.19, N = 9SE +/- 5.99, N = 92752592531. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llz4 -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cstock-linux-kernelxanmod-kernel-stock-v2xanmod-kernel-optimized7K14K21K28K35KSE +/- 172.89, N = 3SE +/- 178.89, N = 3SE +/- 55.35, N = 330353.2830132.5629948.811. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel2004006008001000SE +/- 2.42, N = 3SE +/- 0.09, N = 3SE +/- 0.25, N = 3823.86788.37785.501. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel2004006008001000SE +/- 0.21, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3812.44787.40782.031. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cxanmod-kernel-stock-v2stock-linux-kernelxanmod-kernel-optimized3K6K9K12K15KSE +/- 24.79, N = 3SE +/- 41.50, N = 3SE +/- 7.96, N = 312492.0712485.7212085.261. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cxanmod-kernel-stock-v2stock-linux-kernelxanmod-kernel-optimized7K14K21K28K35KSE +/- 19.64, N = 3SE +/- 5.41, N = 3SE +/- 31.44, N = 333654.8633564.2833335.371. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cxanmod-kernel-stock-v2stock-linux-kernelxanmod-kernel-optimized4K8K12K16K20KSE +/- 3.96, N = 3SE +/- 3.42, N = 3SE +/- 3.88, N = 316879.8016862.7016422.811. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Bstock-linux-kernelxanmod-kernel-stock-v2xanmod-kernel-optimized3K6K9K12K15KSE +/- 151.16, N = 3SE +/- 102.39, N = 15SE +/- 89.96, N = 313080.8413058.5812922.631. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

oneDNN

Harness: IP Batch All - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPUxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel1020304050SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 341.5942.4642.52MIN: 40.37MIN: 41.45MIN: 41.571. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPUxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel612182430SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 323.8124.5124.62MIN: 23.16MIN: 23.86MIN: 24.121. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUxanmod-kernel-stock-v2stock-linux-kernelxanmod-kernel-optimized3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.0712.0712.52MIN: 11.48MIN: 11.66MIN: 11.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPUxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel1.05492.10983.16474.21965.2745SE +/- 0.00986, N = 3SE +/- 0.00522, N = 3SE +/- 0.00217, N = 34.512504.664774.68846MIN: 4.29MIN: 4.52MIN: 4.551. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUstock-linux-kernelxanmod-kernel-stock-v2xanmod-kernel-optimized3691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 313.1513.1513.53MIN: 12.78MIN: 12.72MIN: 13.121. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPUxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel0.81251.6252.43753.254.0625SE +/- 0.00233, N = 3SE +/- 0.00029, N = 3SE +/- 0.00224, N = 33.475073.605383.61101MIN: 3.35MIN: 3.52MIN: 3.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel50100150200250SE +/- 0.29, N = 3SE +/- 0.44, N = 3SE +/- 0.29, N = 3200.19205.19207.25MIN: 193.59MIN: 198.4MIN: 201.061. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel1020304050SE +/- 0.30, N = 3SE +/- 0.42, N = 3SE +/- 0.23, N = 340.5341.5342.51MIN: 38.63MIN: 39.45MIN: 40.781. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel0.18120.36240.54360.72480.906SE +/- 0.002024, N = 3SE +/- 0.001743, N = 3SE +/- 0.001986, N = 30.7849780.8032840.805316MIN: 0.71MIN: 0.75MIN: 0.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUxanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v20.43670.87341.31011.74682.1835SE +/- 0.00095, N = 3SE +/- 0.00229, N = 3SE +/- 0.00175, N = 31.882921.939221.94085MIN: 1.79MIN: 1.87MIN: 1.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performancexanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel8001600240032004000SE +/- 14.89, N = 3SE +/- 3.23, N = 3SE +/- 6.31, N = 33595.23506.53494.01. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integerxanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v27K14K21K28K35KSE +/- 468.24, N = 3SE +/- 414.95, N = 5SE +/- 21.34, N = 331234.9731078.6030294.841. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integerxanmod-kernel-stock-v2xanmod-kernel-optimizedstock-linux-kernel6K12K18K24K30KSE +/- 408.95, N = 3SE +/- 337.97, N = 3SE +/- 8.98, N = 327440.9626885.9426675.041. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integerstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v26K12K18K24K30KSE +/- 342.02, N = 3SE +/- 182.76, N = 3SE +/- 3.83, N = 327448.3426780.8326628.921. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integerxanmod-kernel-stock-v2xanmod-kernel-optimizedstock-linux-kernel7K14K21K28K35KSE +/- 382.19, N = 5SE +/- 439.89, N = 4SE +/- 460.40, N = 431765.5831510.4031334.761. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integerxanmod-kernel-stock-v2xanmod-kernel-optimizedstock-linux-kernel6K12K18K24K30KSE +/- 465.21, N = 3SE +/- 460.72, N = 3SE +/- 426.05, N = 328894.3028801.1728792.551. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating Pointxanmod-kernel-stock-v2stock-linux-kernelxanmod-kernel-optimized7K14K21K28K35KSE +/- 64.13, N = 3SE +/- 447.67, N = 4SE +/- 453.79, N = 431894.4831350.5730949.801. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating Pointstock-linux-kernelxanmod-kernel-stock-v2xanmod-kernel-optimized6K12K18K24K30KSE +/- 392.11, N = 3SE +/- 405.20, N = 3SE +/- 368.45, N = 327445.7827024.5726768.431. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating Pointxanmod-kernel-stock-v2xanmod-kernel-optimizedstock-linux-kernel6K12K18K24K30KSE +/- 405.62, N = 3SE +/- 351.33, N = 3SE +/- 27.11, N = 327390.7027120.6226941.811. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Floating Pointstock-linux-kernelxanmod-kernel-stock-v2xanmod-kernel-optimized7K14K21K28K35KSE +/- 461.90, N = 4SE +/- 7.15, N = 3SE +/- 2.05, N = 331043.7130172.8430019.671. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating Pointxanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v26K12K18K24K30KSE +/- 78.82, N = 3SE +/- 477.47, N = 3SE +/- 493.83, N = 329355.4129306.8428902.081. (CC) gcc options: -O3 -march=native

rays1bench

Large Scene

OpenBenchmarking.orgmrays/s, More Is Betterrays1bench 2020-01-09Large Scenexanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel20406080100SE +/- 0.20, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 390.0788.3188.05

SQLite

Threads / Copies: 1

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 1xanmod-kernel-optimizedstock-linux-kernelxanmod-kernel-stock-v2xanmod-kernel-stock918273645SE +/- 0.27, N = 3SE +/- 0.23, N = 3SE +/- 0.56, N = 3SE +/- 0.57, N = 338.4239.2440.3140.611. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

SQLite

Threads / Copies: 128

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 128stock-linux-kernelxanmod-kernel-stock-v25001000150020002500SE +/- 2.38, N = 3SE +/- 10.49, N = 32449.772456.361. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000xanmod-kernel-stock-v2xanmod-kernel-optimizedstock-linux-kernel1632486480SE +/- 0.95, N = 15SE +/- 1.20, N = 15SE +/- 1.15, N = 1567.5567.8370.121. (CC) gcc options: -O2 -ldl -lz -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compilexanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel816243240SE +/- 0.17, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 331.8332.1232.49

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compilexanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel1122334455SE +/- 0.36, N = 3SE +/- 0.10, N = 346.0046.4847.37

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compilexanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel918273645SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 338.1538.5339.10

WireGuard + Linux Networking Stack Stress Test

OpenBenchmarking.orgSeconds, Fewer Is BetterWireGuard + Linux Networking Stack Stress Testxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel50100150200250SE +/- 0.86, N = 3SE +/- 1.10, N = 3SE +/- 0.93, N = 3184.80191.28230.36

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.1.2H.265 1080p Video Encodingxanmod-kernel-optimizedxanmod-kernel-stock-v2stock-linux-kernel1632486480SE +/- 0.23, N = 3SE +/- 0.08, N = 3SE +/- 0.14, N = 370.9768.9068.731. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma


Phoronix Test Suite v10.8.4