Intel Kernel Scaling Optimizations On AMD Genoa

AMD EPYC 9654 benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2304019-NE-INTELKERN69.

ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDisplay ServerCompilerFile-SystemScreen ResolutionClear Linux 3 Threads 6 Threads 12 Threads 24 Threads 48 Threads 96 Threads 192 Threads 384 ThreadsAMD EPYC 9654 96-Core @ 2.40GHz (3 Cores)AMD Titanite_4G (RTI1004D BIOS)AMD Device 14a41520GB2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007ASPEEDBroadcom NetXtreme BCM5720 PCIeClear Linux OS 386606.2.8-1293.native (x86_64)X ServerGCC 12.2.1 20230323 releases/gcc-12.2.0-616-g1b6b7f214c + Clang 15.0.7 + LLVM 15.0.7ext4800x600AMD EPYC 9654 96-Core @ 2.40GHz (6 Cores)AMD EPYC 9654 96-Core @ 2.40GHz (12 Cores)AMD EPYC 9654 96-Core @ 2.40GHz (24 Cores)AMD EPYC 9654 96-Core @ 2.40GHz (48 Cores)AMD EPYC 9654 96-Core @ 2.40GHz (96 Cores)2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores)2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysEnvironment Details- FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z,now -Wl,-z,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags" CXXFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z,now -Wl,-z,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mrelax-cmpxchg-loop -fvisibility-inlines-hidden -Wl,--enable-new-dtags -std=gnu++17" FCFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z,now -Wl,-z,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" CFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z,now -Wl,-z,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mrelax-cmpxchg-loop" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" Compiler Details- --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-gnu-indirect-function --enable-host-shared --enable-languages=c,c++,fortran,go,jit --enable-ld=default --enable-libstdcxx-pch --enable-linux-futex --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=x86-64-v3 --with-gcc-major-version-only --with-glibc-version=2.35 --with-gnu-ld --with-isl --with-pic --with-ppl=yes --with-tune=sapphirerapids --with-zstd Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Python Details- Python 3.11.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

blosc: blosclz shuffleblosc: blosclz bitshuffleopenvkl: vklBenchmark ISPCbuild-llvm: Ninjamemcached: 1:5memcached: 1:10memcached: 1:100gromacs: MPI CPU - water_GMX50_baredaphne: OpenMP - NDT Mappingdaphne: OpenMP - Points2Imagedaphne: OpenMP - Euclidean Clustermysqlslap: 512mysqlslap: 1024mysqlslap: 2048mysqlslap: 4096mysqlslap: 8192pgbench: 1000 - 500 - Read Onlypgbench: 1000 - 500 - Read Only - Average Latencypgbench: 1000 - 800 - Read Onlypgbench: 1000 - 800 - Read Only - Average Latencypgbench: 1000 - 1000 - Read Onlypgbench: 1000 - 1000 - Read Only - Average Latencypgbench: 1000 - 500 - Read Writepgbench: 1000 - 500 - Read Write - Average Latencypgbench: 1000 - 800 - Read Writepgbench: 1000 - 800 - Read Write - Average Latencypgbench: 1000 - 1000 - Read Writepgbench: 1000 - 1000 - Read Write - Average Latencystress-ng: Pollstress-ng: Mallocstress-ng: Semaphoresstress-ng: Context Switchingrocksdb: Rand Fillrocksdb: Rand Readrocksdb: Update Randrocksdb: Read While Writingrocksdb: Read Rand Write Randonnx: GPT-2 - CPU - Standardonnx: GPT-2 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardonnx: super-resolution-10 - CPU - Standardonnx: super-resolution-10 - CPU - Standarddragonflydb: 50 - 1:1dragonflydb: 50 - 1:5dragonflydb: 200 - 1:1dragonflydb: 200 - 1:5Clear Linux 3 Threads 6 Threads 12 Threads 24 Threads 48 Threads 96 Threads 192 Threads 384 Threads14864.86726.1484281.647756342.52737199.55731869.930.5771497.7255251.8460077731580.055452772682642581441453.4691315186.0831235088.0981392635.9091210766.0801133288.250207642.31746208.28274742.511598345.838003122546876235025438460764892998.366110.16360.6382331566.8313.037176.702442.050223.78031675279.581846162.521395674.141528577.2219957.99456.5922301.1461525644.941444643.691460141.391.0301693.3656002.3221564041784.305465003633073042919941.7122767492.8912666693.7502627619.0292190836.5262038549.059433822.352245686.96568067.973157050.10116300950566622627962896400121340696.106810.4021.22205818.29222.759143.936980.155912.47502940540.923122983.642838943.132991716.5924813.712388.21771191.6122036420.912231958.842418070.751.9871845.2549536.0795348491876.906906696144974035217780.9584791761.6704583792.1823987912.5393476823.0103148231.766855199.578631852.171140550.306465487.6987502110145604361537415673901694829121.6118.220162.14861465.41426.113038.2934118.4888.439053466341.223577457.303344564.183544883.1530906.416925.6326642.7573318242.603257852.773753535.614.0791936.4641767.7007082961908.9894188077854146410032370.4999016990.8878450181.184611378.1805159315.5074723221.1721741737.9929798829.202441085.4412990582.76109220519893008174684328165362556648132.4057.551213.67976271.75530.186033.2021122.6418.153414403726.764558138.924338063.674485799.5432904.218569.1573374.5263988403.904907076.234227589.887.1931625.8637128.0392505751901.4394893487157445014827350.33813877120.57712925210.775787606.3487356010.8766894814.5053427214.56101730074.724869808.4125991362.22127197339204094796508443538213422543130.7367.648405.51762181.23629.883733.5251120.7368.282495423657.305646481.605763835.975965009.4030049.020160.1864270.0494000099.844894915.904533164.7210.8281101.0732428.1720708941860.3890190684852543711017480.45610320930.77610780860.928875625.710827019.6758076012.3866738430.40259187800.489675139.2748227338.571357609636548689121084057871833652165128.1287.803816.16716162.15131.938931.3153121.1498.254776281327.926525049.8719541.312893.21154202.8652591881.064273958.344723725.6319.355872.1721042.2048971571737.146636636706525389970790.5049238330.8718583791.167595958.4756011413.4295540718.14813429013.79455696317.9918664446.0397933563.12523630129174157051200771064062019952122.36948.270664.86504205.55022.978743.5198122.3868.170746070240.206279681.0515348.211685.31396192.7272621772.864517240.608688154.1719.376903.8414661.9421912601667.1467767670567261310119000.4988979050.8959134611.100724186.9056701111.9396471615.45327368220.78605020044.7838002306.16128197099.80488877113378407247663512183225176718596.913810.31674.61936216.48115.971462.618793.719110.6737OpenBenchmarking.org

C-Blosc

Test: blosclz shuffle

MinAvgMaxClear Linux148652354832904OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.3Test: blosclz shuffle3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads8K16K24K32K40K

C-Blosc

Test: blosclz bitshuffle

MinAvgMaxClear Linux67261360120160OpenBenchmarking.orgMB/s, More Is BetterC-Blosc 2.3Test: blosclz bitshuffle3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads5K10K15K20K25K

OpenVKL

Benchmark: vklBenchmark ISPC

MinAvgMaxClear Linux485791396OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPC3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads400800120016002000

Timed LLVM Compilation

Build System: Ninja

MinAvgMaxClear Linux19311824282OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads11002200330044005500

Memcached

Set To Get Ratio: 1:5

MinAvgMaxClear Linux75634326048514000100OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:53 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads1.1M2.2M3.3M4.4M5.5M

Memcached

Set To Get Ratio: 1:10

MinAvgMaxClear Linux73720032831064907076OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:103 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads1.3M2.6M3.9M5.2M6.5M

Memcached

Set To Get Ratio: 1:100

MinAvgMaxClear Linux73187038170328688154OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:1003 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads2M4M6M8M10M

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

MinAvgMaxClear Linux0.68.119.4OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bare3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads612182430

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: NDT Mapping

MinAvgMaxClear Linux87214341936OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous Suite 2021.11.02Backend: OpenMP - Kernel: NDT Mapping3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads5001000150020002500

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Points2Image

MinAvgMaxClear Linux146623847756002OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous Suite 2021.11.02Backend: OpenMP - Kernel: Points2Image3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads14K28K42K56K70K

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Euclidean Cluster

MinAvgMaxClear Linux158017901909OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous Suite 2021.11.02Backend: OpenMP - Kernel: Euclidean Cluster3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads5001000150020002500

MariaDB

Clients: 512

MinAvgMaxClear Linux545739948OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 5123 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads2004006008001000

MariaDB

Clients: 1024

MinAvgMaxClear Linux277688934OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 10243 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads2004006008001000

MariaDB

Clients: 2048

MinAvgMaxClear Linux268640871OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 20483 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads2004006008001000

MariaDB

Clients: 4096

MinAvgMaxClear Linux264504672OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 40963 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads2004006008001000

MariaDB

Clients: 8192

MinAvgMaxClear Linux258.0433.4613.0OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 81923 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads160320480640800

PostgreSQL

Scaling Factor: 1000 - Clients: 500 - Mode: Read Only

MinAvgMaxClear Linux1441458193271482735OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 500 - Mode: Read Only3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads400K800K1200K1600K2000K

PostgreSQL

Scaling Factor: 1000 - Clients: 500 - Mode: Read Only - Average Latency

MinAvgMaxClear Linux0.31.13.5OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 500 - Mode: Read Only - Average Latency3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads1.813.625.437.249.05

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Only

MinAvgMaxClear Linux1315187538361387712OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 800 - Mode: Read Only3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads400K800K1200K1600K2000K

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average Latency

MinAvgMaxClear Linux0.61.86.1OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 800 - Mode: Read Only - Average Latency3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads3691215

PostgreSQL

Scaling Factor: 1000 - Clients: 1000 - Mode: Read Only

MinAvgMaxClear Linux1235087295031292521OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 1000 - Mode: Read Only3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads400K800K1200K1600K2000K

PostgreSQL

Scaling Factor: 1000 - Clients: 1000 - Mode: Read Only - Average Latency

MinAvgMaxClear Linux0.82.48.1OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 1000 - Mode: Read Only - Average Latency3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads3691215

PostgreSQL

Scaling Factor: 1000 - Clients: 500 - Mode: Read Write

MinAvgMaxClear Linux139265494487562OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 500 - Mode: Read Write3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads20K40K60K80K100K

PostgreSQL

Scaling Factor: 1000 - Clients: 500 - Mode: Read Write - Average Latency

MinAvgMaxClear Linux5.712.935.9OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 500 - Mode: Read Write - Average Latency3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads1020304050

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Write

MinAvgMaxClear Linux121075047082701OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 800 - Mode: Read Write3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads20K40K60K80K100K

PostgreSQL

Scaling Factor: 1000 - Clients: 800 - Mode: Read Write - Average Latency

MinAvgMaxClear Linux9.723.466.1OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 800 - Mode: Read Write - Average Latency3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads20406080100

PostgreSQL

Scaling Factor: 1000 - Clients: 1000 - Mode: Read Write

MinAvgMaxClear Linux113324753380760OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 1000 - Mode: Read Write3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads20K40K60K80K100K

PostgreSQL

Scaling Factor: 1000 - Clients: 1000 - Mode: Read Write - Average Latency

MinAvgMaxClear Linux12.431.388.3OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 1000 - Clients: 1000 - Mode: Read Write - Average Latency3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads20406080100

Stress-NG

Test: Poll

MinAvgMaxClear Linux207642677516027368221OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.06Test: Poll3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads7M14M21M28M35M

Stress-NG

Test: Malloc

MinAvgMaxClear Linux746208182882102605020045OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.06Test: Malloc3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads160M320M480M640M800M

Stress-NG

Test: Semaphores

MinAvgMaxClear Linux274743945451838002306OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.06Test: Semaphores3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads10M20M30M40M50M

Stress-NG

Test: Context Switching

MinAvgMaxClear Linux159834640570104128197100OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.06Test: Context Switching3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads40M80M120M160M200M

RocksDB

Test: Random Fill

MinAvgMaxClear Linux4888779465801357609OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Random Fill3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads400K800K1200K1600K2000K

RocksDB

Test: Random Read

MinAvgMaxClear Linux254687624788170981291741570OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Random Read3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads400M800M1200M1600M2000M

RocksDB

Test: Update Random

MinAvgMaxClear Linux3502546881251210840OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Update Random3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads400K800K1200K1600K2000K

RocksDB

Test: Read While Writing

MinAvgMaxClear Linux384607438694612183225OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read While Writing3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads4M8M12M16M20M

RocksDB

Test: Read Random Write Random

MinAvgMaxClear Linux64892921219573652165OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read Random Write Random3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads1000K2000K3000K4000K5000K

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

MinAvgMaxClear Linux96.1115.8132.4OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.14Model: GPT-2 - Device: CPU - Executor: Standard3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads4080120160200

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

MinAvgMaxClear Linux7.68.810.4OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.14Model: GPT-2 - Device: CPU - Executor: Standard3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads48121620

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

MinAvgMaxClear Linux0.63.66.2OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.14Model: fcn-resnet101-11 - Device: CPU - Executor: Standard3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads3691215

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

MinAvgMaxClear Linux1624861567OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.14Model: fcn-resnet101-11 - Device: CPU - Executor: Standard3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads400800120016002000

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

MinAvgMaxClear Linux13.024.131.9OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.14Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads918273645

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

MinAvgMaxClear Linux31.345.476.7OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.14Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads20406080100

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

MinAvgMaxClear Linux42.1102.7122.6OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.14Model: super-resolution-10 - Device: CPU - Executor: Standard3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads4080120160200

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

MinAvgMaxClear Linux8.211.023.8OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.14Model: super-resolution-10 - Device: CPU - Executor: Standard3 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads384 Threads816243240

Dragonflydb

Clients: 50 - Set To Get Ratio: 1:1

MinAvgMaxClear Linux167528043230166281328OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 0.6Clients: 50 - Set To Get Ratio: 1:13 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads1.6M3.2M4.8M6.4M8M

Dragonflydb

Clients: 50 - Set To Get Ratio: 1:5

MinAvgMaxClear Linux184616345079946525050OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 0.6Clients: 50 - Set To Get Ratio: 1:53 Threads6 Threads12 Threads24 Threads48 Threads96 Threads192 Threads2M4M6M8M10M

Dragonflydb

Clients: 200 - Set To Get Ratio: 1:1

MinAvgMaxClear Linux139567435362165763836OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 0.6Clients: 200 - Set To Get Ratio: 1:13 Threads6 Threads12 Threads24 Threads48 Threads1.4M2.8M4.2M5.6M7M

Dragonflydb

Clients: 200 - Set To Get Ratio: 1:5

MinAvgMaxClear Linux152857737031975965009OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 0.6Clients: 200 - Set To Get Ratio: 1:53 Threads6 Threads12 Threads24 Threads48 Threads1.6M3.2M4.8M6.4M8M


Phoronix Test Suite v10.8.4