xanmod-kernel-v3

AMD Ryzen 9 3900X 12-Core testing with a MSI MPG X570 GAMING PLUS (MS-7C37) v2.0 (A.61 BIOS) and MSI NVIDIA GeForce GT 1030 on Debian stable-updates via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2008021-NE-XANMODKER88&sro&grr.

xanmod-kernel-v3ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDisplay ServerDisplay DriverCompilerFile-Systemstock-linux-kernelxanmod-kernel-stockxanmod-kernel-stock-v2xanmod-kernel-optimizedxanmod-kernel-optimized-blenderAMD Ryzen 9 3900X 12-Core (12 Cores / 24 Threads)MSI MPG X570 GAMING PLUS (MS-7C37) v2.0 (A.61 BIOS)AMD Starship/Matisse4 x 16384 MB DDR4-3200MT/s CMK32GX4M2D3000C161000GB Samsung SSD 970 EVO 1TB + 8002GB Western Digital WD80EMAZ-00WMSI NVIDIA GeForce GT 1030NVIDIA GP108 HD AudioRealtek RTL8111/8168/8411Debian testing5.7.6-050706-lowlatency (x86_64)X Server 1.20.8modesetting 1.20.8GCC 9.3.0 + Clang 9.0.1-13 + LLVM 9.0.1ext45.7.10-xanmod1 (x86_64)4 x 16384 MB DDR4-3000MT/s CMK32GX4M2D3000C16Debian stable-updatesOpenBenchmarking.orgEnvironment Details- stock-linux-kernel: RADV_PERFTEST=aco NVM_CD_FLAGS=- xanmod-kernel-stock: RADV_PERFTEST=aco NVM_CD_FLAGS=- xanmod-kernel-stock-v2: RADV_PERFTEST=aco NVM_CD_FLAGS=- xanmod-kernel-optimized: RADV_PERFTEST=aco- xanmod-kernel-optimized-blender: RADV_PERFTEST=acoCompiler Details- stock-linux-kernel, xanmod-kernel-stock, xanmod-kernel-stock-v2, xanmod-kernel-optimized: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-0xEOmg/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- stock-linux-kernel, xanmod-kernel-stock, xanmod-kernel-stock-v2, xanmod-kernel-optimized: NONE / errors=remount-ro,relatime,rwProcessor Details- CPU Microcode: 0x8701013Java Details- stock-linux-kernel, xanmod-kernel-stock, xanmod-kernel-stock-v2, xanmod-kernel-optimized: OpenJDK Runtime Environment (build 11.0.7+10-post-Debian-3deb10u1)Python Details- stock-linux-kernel, xanmod-kernel-stock, xanmod-kernel-stock-v2, xanmod-kernel-optimized: Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

xanmod-kernel-v3mysqlslap: 1sqlite: 128blender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlysqlite-speedtest: Timed Time - Size 1,000fftw: Float + SSE - 2D FFT Size 4096blender: Classroom - CPU-Onlykeydb: core-latency: Average Latency Between CPU Coreswireguard: hpcg: npb: EP.Dblender: Fishy Cat - CPU-Onlycachebench: Read / Modify / Writecachebench: Writecachebench: Readblender: BMW27 - CPU-Onlynpb: BT.Cramspeed: Triad - Integerramspeed: Add - Integerramspeed: Add - Floating Pointramspeed: Triad - Floating Pointnpb: SP.Bnpb: LU.Cramspeed: Scale - Integerramspeed: Average - Integerramspeed: Copy - Integerramspeed: Copy - Floating Pointramspeed: Scale - Floating Pointramspeed: Average - Floating Pointonednn: IP Batch All - u8s8f32 - CPUonednn: IP Batch All - f32 - CPUbuild-linux-kernel: Time To Compilejava-scimark2: Compositedeepspeech: CPUc-ray: Total Time - 4K, 16 Rays Per Pixelgnupg: 2GB File Encryptionbuild-php: Time To Compilesqlite: 1onednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUnpb: FT.Ccompress-7zip: Compress Speed Testbuild-ffmpeg: Time To Compileopenssl: RSA 4096-bit Performanceonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUnpb: EP.Cnpb: MG.Crays1bench: Large Scenedarktable: Boat - CPU-onlyx265: H.265 1080p Video Encodingfftw: Float + SSE - 1D FFT Size 4096onednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUdarktable: Masskrug - CPU-onlyffmpeg: H.264 HD To NTSC DVdarktable: Server Room - CPU-onlyfftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 1D FFT Size 32onednn: Deconvolution Batch deconv_3d - u8s8f32 - CPUonednn: Deconvolution Batch deconv_3d - f32 - CPUdarktable: Server Rack - CPU-onlyjava-scimark2: Jacobi Successive Over-Relaxationjava-scimark2: Dense LU Matrix Factorizationjava-scimark2: Sparse Matrix Multiplyjava-scimark2: Fast Fourier Transformjava-scimark2: Monte Carlostock-linux-kernelxanmod-kernel-stockxanmod-kernel-stock-v2xanmod-kernel-optimizedxanmod-kernel-optimized-blender2532449.76970.11521291119368.92142.26230.3565.68363782.0361924.05079261932183.8054641593025.4630353.2831334.7631078.6031350.5731043.7113080.8433564.2827448.3428792.5526675.0427445.7826941.8129306.8424.615242.523147.3663039.1956.3658442.93511.19039.10139.236207.24942.508912485.728123132.4873494.00.8053161.93922785.5016862.7088.058.40168.735718113.145312.07264.4535.2893.03545476146733.611014.688460.1401972.216768.172720.272030.911704.3840.6082592456.35667.55020269123834.22144.04191.2795.69740787.4063966.48223331832004.9562973813014.5230132.5631765.5830294.8431894.4830172.8413058.5833654.8626628.9228894.3027440.9627024.5727390.7028902.0824.514442.460946.4843075.1356.3672742.74711.24338.52840.313205.18741.525212492.078167932.1183506.50.8032841.94085788.3716879.8088.318.40268.905556713.145712.06724.4805.4023.01545621149683.605384.664770.1381965.146940.612701.652067.651700.6127567.82919876130220.58139.73184.8045.54178812.4462834.85683541332511.7032957623050.1029948.8131510.4031234.9730949.8030019.6712922.6333335.3726780.8328801.1726885.9426768.4327120.6229355.4123.809241.586646.0033069.3656.2540042.04411.31138.14538.417200.18640.526112085.268317031.8273595.20.7849781.88292823.8616422.8190.078.54570.975623213.527112.51814.4415.1712.99845825149233.475074.512500.1431975.616892.472700.252076.801701.68444.76373.42296.71163.16111.00OpenBenchmarking.org

MariaDB

Clients: 1

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 1stock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v260120180240300SE +/- 5.99, N = 9SE +/- 4.27, N = 9SE +/- 4.19, N = 92532752591. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llz4 -llzma -lbz2 -laio -lnuma -lpcre2-8 -lcrypt -lz -lm -lssl -lcrypto -ldl

SQLite

Threads / Copies: 128

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 128stock-linux-kernelxanmod-kernel-stock-v25001000150020002500SE +/- 2.38, N = 3SE +/- 10.49, N = 32449.772456.361. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CPU-Onlyxanmod-kernel-optimized-blender100200300400500SE +/- 0.49, N = 3444.76

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CPU-Onlyxanmod-kernel-optimized-blender80160240320400SE +/- 0.81, N = 3373.42

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000stock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21632486480SE +/- 1.15, N = 15SE +/- 1.20, N = 15SE +/- 0.95, N = 1570.1267.8367.551. (CC) gcc options: -O2 -ldl -lz -lpthread

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096stock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v25K10K15K20K25KSE +/- 226.07, N = 3SE +/- 112.65, N = 3SE +/- 158.09, N = 32129119876202691. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CPU-Onlyxanmod-kernel-optimized-blender60120180240300SE +/- 0.13, N = 3296.71

KeyDB

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 5.3.1stock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v230K60K90K120K150KSE +/- 1002.40, N = 15SE +/- 1668.86, N = 3SE +/- 1088.54, N = 15119368.92130220.58123834.221. (CXX) g++ options: -O2 -levent -lpthread -lz -lpcre

Core-Latency

Average Latency Between CPU Cores

OpenBenchmarking.orgns, Fewer Is BetterCore-LatencyAverage Latency Between CPU Coresstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v2306090120150142.26139.73144.04MIN: 42.6 / MAX: 169.15MIN: 42.74 / MAX: 166.14MIN: 42.79 / MAX: 172.061. (CXX) g++ options: -std=c++11 -pthread -O3

WireGuard + Linux Networking Stack Stress Test

OpenBenchmarking.orgSeconds, Fewer Is BetterWireGuard + Linux Networking Stack Stress Teststock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v250100150200250SE +/- 0.93, N = 3SE +/- 0.86, N = 3SE +/- 1.10, N = 3230.36184.80191.28

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1stock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21.28192.56383.84575.12766.4095SE +/- 0.01502, N = 3SE +/- 0.00226, N = 3SE +/- 0.00163, N = 35.683635.541785.697401. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v22004006008001000SE +/- 0.12, N = 3SE +/- 0.21, N = 3SE +/- 0.07, N = 3782.03812.44787.401. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CPU-Onlyxanmod-kernel-optimized-blender4080120160200SE +/- 0.32, N = 3163.16

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / Writestock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v214K28K42K56K70KSE +/- 273.27, N = 3SE +/- 14.57, N = 3SE +/- 755.52, N = 361924.0562834.8663966.48MIN: 54980.84 / MAX: 66137.05MIN: 55279.71 / MAX: 66668.02MIN: 55607.63 / MAX: 68405.981. (CC) gcc options: -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Writestock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v27K14K21K28K35KSE +/- 426.99, N = 3SE +/- 384.10, N = 3SE +/- 348.21, N = 332183.8132511.7032004.96MIN: 27584.78 / MAX: 34495.84MIN: 27980.81 / MAX: 34688.08MIN: 27420.12 / MAX: 34639.581. (CC) gcc options: -lrt

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Readstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v27001400210028003500SE +/- 8.65, N = 3SE +/- 8.27, N = 3SE +/- 7.16, N = 33025.463050.103014.52MIN: 3006.36 / MAX: 3036.42MIN: 3027.65 / MAX: 3065.82MIN: 2973.27 / MAX: 3043.121. (CC) gcc options: -lrt

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CPU-Onlyxanmod-kernel-optimized-blender20406080100SE +/- 0.33, N = 3111.00

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v27K14K21K28K35KSE +/- 172.89, N = 3SE +/- 55.35, N = 3SE +/- 178.89, N = 330353.2829948.8130132.561. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

RAMspeed SMP

Type: Triad - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Integerstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v27K14K21K28K35KSE +/- 460.40, N = 4SE +/- 439.89, N = 4SE +/- 382.19, N = 531334.7631510.4031765.581. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Integerstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v27K14K21K28K35KSE +/- 414.95, N = 5SE +/- 468.24, N = 3SE +/- 21.34, N = 331078.6031234.9730294.841. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Add - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Add - Benchmark: Floating Pointstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v27K14K21K28K35KSE +/- 447.67, N = 4SE +/- 453.79, N = 4SE +/- 64.13, N = 331350.5730949.8031894.481. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Triad - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Triad - Benchmark: Floating Pointstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v27K14K21K28K35KSE +/- 461.90, N = 4SE +/- 2.05, N = 3SE +/- 7.15, N = 331043.7130019.6730172.841. (CC) gcc options: -O3 -march=native

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Bstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v23K6K9K12K15KSE +/- 151.16, N = 3SE +/- 89.96, N = 3SE +/- 102.39, N = 1513080.8412922.6313058.581. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v27K14K21K28K35KSE +/- 5.41, N = 3SE +/- 31.44, N = 3SE +/- 19.64, N = 333564.2833335.3733654.861. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

RAMspeed SMP

Type: Scale - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Integerstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v26K12K18K24K30KSE +/- 342.02, N = 3SE +/- 182.76, N = 3SE +/- 3.83, N = 327448.3426780.8326628.921. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Integerstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v26K12K18K24K30KSE +/- 426.05, N = 3SE +/- 460.72, N = 3SE +/- 465.21, N = 328792.5528801.1728894.301. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Integer

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Integerstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v26K12K18K24K30KSE +/- 8.98, N = 3SE +/- 337.97, N = 3SE +/- 408.95, N = 326675.0426885.9427440.961. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Copy - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Copy - Benchmark: Floating Pointstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v26K12K18K24K30KSE +/- 392.11, N = 3SE +/- 368.45, N = 3SE +/- 405.20, N = 327445.7826768.4327024.571. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Scale - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Scale - Benchmark: Floating Pointstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v26K12K18K24K30KSE +/- 27.11, N = 3SE +/- 351.33, N = 3SE +/- 405.62, N = 326941.8127120.6227390.701. (CC) gcc options: -O3 -march=native

RAMspeed SMP

Type: Average - Benchmark: Floating Point

OpenBenchmarking.orgMB/s, More Is BetterRAMspeed SMP 3.5.0Type: Average - Benchmark: Floating Pointstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v26K12K18K24K30KSE +/- 477.47, N = 3SE +/- 78.82, N = 3SE +/- 493.83, N = 329306.8429355.4128902.081. (CC) gcc options: -O3 -march=native

oneDNN

Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: u8s8f32 - Engine: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v2612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 324.6223.8124.51MIN: 24.12MIN: 23.16MIN: 23.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch All - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21020304050SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 342.5241.5942.46MIN: 41.57MIN: 40.37MIN: 41.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To Compilestock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21122334455SE +/- 0.10, N = 3SE +/- 0.36, N = 347.3746.0046.48

Java SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Compositestock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v27001400210028003500SE +/- 29.80, N = 4SE +/- 7.44, N = 4SE +/- 34.71, N = 43039.193069.363075.13

DeepSpeech

Acceleration: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterDeepSpeech 0.6Acceleration: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21326395265SE +/- 0.04, N = 3SE +/- 0.16, N = 3SE +/- 0.25, N = 356.3756.2556.37

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21020304050SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 342.9442.0442.751. (CC) gcc options: -lm -lpthread -O3

GnuPG

2GB File Encryption

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 1.4.222GB File Encryptionstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v23691215SE +/- 0.16, N = 3SE +/- 0.13, N = 15SE +/- 0.12, N = 1511.1911.3111.241. (CC) gcc options: -O2 -MT -MD -MP -MF

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compilestock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v2918273645SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 339.1038.1538.53

SQLite

Threads / Copies: 1

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 1stock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stockxanmod-kernel-stock-v2918273645SE +/- 0.23, N = 3SE +/- 0.27, N = 3SE +/- 0.57, N = 3SE +/- 0.56, N = 339.2438.4240.6140.311. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v250100150200250SE +/- 0.29, N = 3SE +/- 0.29, N = 3SE +/- 0.44, N = 3207.25200.19205.19MIN: 201.06MIN: 193.59MIN: 198.41. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21020304050SE +/- 0.23, N = 3SE +/- 0.30, N = 3SE +/- 0.42, N = 342.5140.5341.53MIN: 40.78MIN: 38.63MIN: 39.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v23K6K9K12K15KSE +/- 41.50, N = 3SE +/- 7.96, N = 3SE +/- 24.79, N = 312485.7212085.2612492.071. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Teststock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v220K40K60K80K100KSE +/- 703.40, N = 3SE +/- 190.50, N = 2SE +/- 584.03, N = 38123183170816791. (CXX) g++ options: -pipe -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compilestock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v2816243240SE +/- 0.13, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 332.4931.8332.12

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performancestock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v28001600240032004000SE +/- 6.31, N = 3SE +/- 14.89, N = 3SE +/- 3.23, N = 33494.03595.23506.51. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v20.18120.36240.54360.72480.906SE +/- 0.001986, N = 3SE +/- 0.002024, N = 3SE +/- 0.001743, N = 30.8053160.7849780.803284MIN: 0.76MIN: 0.71MIN: 0.751. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v20.43670.87341.31011.74682.1835SE +/- 0.00229, N = 3SE +/- 0.00095, N = 3SE +/- 0.00175, N = 31.939221.882921.94085MIN: 1.87MIN: 1.79MIN: 1.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v22004006008001000SE +/- 0.25, N = 3SE +/- 2.42, N = 3SE +/- 0.09, N = 3785.50823.86788.371. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v24K8K12K16K20KSE +/- 3.42, N = 3SE +/- 3.88, N = 3SE +/- 3.96, N = 316862.7016422.8116879.801. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.4

rays1bench

Large Scene

OpenBenchmarking.orgmrays/s, More Is Betterrays1bench 2020-01-09Large Scenestock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v220406080100SE +/- 0.05, N = 3SE +/- 0.20, N = 3SE +/- 0.04, N = 388.0590.0788.31

Darktable

Test: Boat - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.2Test: Boat - Acceleration: CPU-onlystock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v2246810SE +/- 0.006, N = 3SE +/- 0.012, N = 3SE +/- 0.006, N = 38.4018.5458.402

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.1.2H.265 1080p Video Encodingstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21632486480SE +/- 0.14, N = 3SE +/- 0.23, N = 3SE +/- 0.08, N = 368.7370.9768.901. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096stock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v212K24K36K48K60KSE +/- 677.30, N = 5SE +/- 728.33, N = 3SE +/- 917.95, N = 35718156232555671. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v23691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 313.1513.5313.15MIN: 12.78MIN: 13.12MIN: 12.721. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v23691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 312.0712.5212.07MIN: 11.66MIN: 11.97MIN: 11.481. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Darktable

Test: Masskrug - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.2Test: Masskrug - Acceleration: CPU-onlystock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21.0082.0163.0244.0325.04SE +/- 0.020, N = 3SE +/- 0.016, N = 3SE +/- 0.017, N = 34.4534.4414.480

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21.21552.4313.64654.8626.0775SE +/- 0.042, N = 3SE +/- 0.001, N = 3SE +/- 0.034, N = 35.2895.1715.4021. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lm -lxcb -pthread -lbz2 -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

Darktable

Test: Server Room - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.2Test: Server Room - Acceleration: CPU-onlystock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v20.68291.36582.04872.73163.4145SE +/- 0.008, N = 3SE +/- 0.004, N = 3SE +/- 0.006, N = 33.0352.9983.015

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32stock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v210K20K30K40K50KSE +/- 77.66, N = 3SE +/- 91.12, N = 3SE +/- 78.67, N = 34547645825456211. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32stock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v23K6K9K12K15KSE +/- 150.35, N = 3SE +/- 216.56, N = 4SE +/- 58.71, N = 31467314923149681. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: u8s8f32 - Engine: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v20.81251.6252.43753.254.0625SE +/- 0.00224, N = 3SE +/- 0.00233, N = 3SE +/- 0.00029, N = 33.611013.475073.60538MIN: 3.52MIN: 3.35MIN: 3.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPUstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v21.05492.10983.16474.21965.2745SE +/- 0.00217, N = 3SE +/- 0.00986, N = 3SE +/- 0.00522, N = 34.688464.512504.66477MIN: 4.55MIN: 4.29MIN: 4.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Darktable

Test: Server Rack - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 3.0.2Test: Server Rack - Acceleration: CPU-onlystock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v20.03220.06440.09660.12880.161SE +/- 0.002, N = 4SE +/- 0.002, N = 3SE +/- 0.000, N = 30.1400.1430.138

Java SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Jacobi Successive Over-Relaxationstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v2400800120016002000SE +/- 26.34, N = 4SE +/- 2.63, N = 4SE +/- 31.69, N = 41972.211975.611965.14

Java SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Dense LU Matrix Factorizationstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v215003000450060007500SE +/- 72.44, N = 4SE +/- 15.70, N = 4SE +/- 80.90, N = 46768.176892.476940.61

Java SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Sparse Matrix Multiplystock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v26001200180024003000SE +/- 26.42, N = 4SE +/- 12.34, N = 4SE +/- 37.81, N = 42720.272700.252701.65

Java SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Fast Fourier Transformstock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v2400800120016002000SE +/- 6.68, N = 4SE +/- 3.49, N = 4SE +/- 22.09, N = 42030.912076.802067.65

Java SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterJava SciMark 2.0Computational Test: Monte Carlostock-linux-kernelxanmod-kernel-optimizedxanmod-kernel-stock-v2400800120016002000SE +/- 20.11, N = 4SE +/- 3.71, N = 4SE +/- 19.26, N = 41704.381701.681700.61


Phoronix Test Suite v10.8.4