2990WX March

AMD Ryzen Threadripper 2990WX 32-Core testing with a ASUS ROG ZENITH EXTREME (1701 BIOS) and Gigabyte AMD Radeon RX 470/480/570/570X/580/580X/590 4GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2103152-HA-2990WXMAR69.

2990WX MarchProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen Resolution123AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1701 BIOS)AMD 17h32GBSamsung SSD 970 EVO 500GB + 250GB Western Digital WDS250G2X0C-00L350Gigabyte AMD Radeon RX 470/480/570/570X/580/580X/590 4GB (1244/1750MHz)Realtek ALC1220LG Ultra HDIntel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adUbuntu 20.105.8.0-44-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.94.6 Mesa 20.2.1 (LLVM 11.0.0)1.2.131GCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800820dPython Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

2990WX Marchsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDjpegxl: PNG - 5jpegxl: PNG - 7jpegxl: PNG - 8jpegxl: JPEG - 5jpegxl: JPEG - 7jpegxl: JPEG - 8jpegxl-decode: 1jpegxl-decode: Allsrslte: OFDM_Testsrslte: PHY_DL_Testsrslte: PHY_DL_Testluaradio: Five Back to Back FIR Filtersluaradio: FM Deemphasis Filterluaradio: Hilbert Transformluaradio: Complex Phasegnuradio: Five Back to Back FIR Filtersgnuradio: Signal Source (Cosine)gnuradio: FIR Filtergnuradio: IIR Filtergnuradio: FM Deemphasis Filtergnuradio: Hilbert Transformaom-av1: Speed 0 Two-Passaom-av1: Speed 4 Two-Passaom-av1: Speed 6 Realtimeaom-av1: Speed 6 Two-Passaom-av1: Speed 8 Realtimeonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUliquid-dsp: 1 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 8 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 57astcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3mnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3sysbench: RAM / Memorysysbench: CPU1232.280.922.963.3258.848.260.7253.2653.4722.9132.45173.9182300000227.986.2493.1405.098.1570.6369.93139.8601.9575.8809.1411.90.214.5117.2413.3352.036.6819412.45042.511773.5752120.40346.960315.9356725.46452.631223.8001613898.83886.8613965.63918.541.9161714208.13947.671.5598860627667120953333236713333460310000835613333147776666716169666675.20656.394445.642027.4347.49315.67125.0589.06238.0155.8714.31048.0486791.1257045.062.290.922.973.3358.358.260.7153.6054.0823.4932.59174.3582266667228.786.2488.1403.297.4565.6384.13176.9599.5584.0804.0410.30.214.4717.1513.3853.326.6814512.09622.528783.5922420.31166.966715.9299225.35402.629573.7918513707.43917.8013883.63897.241.8911214041.23911.831.5568359917000119076667236780000460586667835380000147463333316166000005.19206.387146.010927.1087.47715.72325.1638.78338.1515.9104.23448.0826809.3157054.822.290.922.973.3358.678.250.7252.8553.1923.3032.55174.9582733333229.186.5491.9404.797.4567.3373.63144.5607.0580.3791.9412.40.214.5017.1913.3353.856.7247612.52692.537473.6045420.40696.987315.9354225.41722.627733.7890413632.53817.6413706.63912.251.7670613829.23928.481.5651160272667120150000237340000460676667836323333147886666716147000005.21636.401545.710127.5167.49515.79225.1608.91237.5615.7184.04147.2106733.7057053.00OpenBenchmarking.org

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya1230.51531.03061.54592.06122.5765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.282.292.291. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom1230.2070.4140.6210.8281.035SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.920.920.921. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets1230.66831.33662.00492.67323.3415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.962.972.971. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID1230.74931.49862.24792.99723.7465SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.323.333.331. (CXX) g++ options: -O3 -pthread

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 51231326395265SE +/- 0.09, N = 3SE +/- 0.38, N = 3SE +/- 0.89, N = 358.8458.3558.671. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 7123246810SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 38.268.268.251. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 81230.1620.3240.4860.6480.81SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.720.710.721. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 51231224364860SE +/- 0.28, N = 3SE +/- 0.61, N = 3SE +/- 0.76, N = 353.2653.6052.851. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 71231224364860SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.40, N = 353.4754.0853.191. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 8123612182430SE +/- 0.16, N = 3SE +/- 0.15, N = 3SE +/- 0.15, N = 322.9123.4923.301. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -fPIE -pie -ldl

JPEG XL Decoding

CPU Threads: 1

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.3CPU Threads: 1123816243240SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 332.4532.5932.55

JPEG XL Decoding

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.3CPU Threads: All1234080120160200SE +/- 0.36, N = 3SE +/- 0.60, N = 3SE +/- 0.29, N = 3173.91174.35174.95

srsLTE

Test: OFDM_Test

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_Test12320M40M60M80M100MSE +/- 57735.03, N = 3SE +/- 166666.67, N = 3SE +/- 133333.33, N = 38230000082266667827333331. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test12350100150200250SE +/- 1.36, N = 3SE +/- 1.76, N = 3SE +/- 0.86, N = 3227.9228.7229.11. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test12320406080100SE +/- 0.48, N = 3SE +/- 0.43, N = 3SE +/- 0.37, N = 386.286.286.51. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters123110220330440550SE +/- 1.15, N = 3SE +/- 0.58, N = 3SE +/- 1.14, N = 3493.1488.1491.9

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter12390180270360450SE +/- 0.51, N = 3SE +/- 2.02, N = 3SE +/- 0.90, N = 3405.0403.2404.7

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform12320406080100SE +/- 0.17, N = 3SE +/- 0.52, N = 3SE +/- 0.45, N = 398.197.497.4

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase123120240360480600SE +/- 0.23, N = 3SE +/- 2.48, N = 3SE +/- 3.02, N = 3570.6565.6567.3

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters12380160240320400SE +/- 5.54, N = 9SE +/- 3.80, N = 3SE +/- 4.94, N = 9369.9384.1373.61. 3.8.1.0

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)1237001400210028003500SE +/- 11.69, N = 9SE +/- 18.55, N = 3SE +/- 12.27, N = 93139.83176.93144.51. 3.8.1.0

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter123130260390520650SE +/- 4.29, N = 9SE +/- 2.03, N = 3SE +/- 1.71, N = 9601.9599.5607.01. 3.8.1.0

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter123130260390520650SE +/- 3.61, N = 9SE +/- 1.47, N = 3SE +/- 1.18, N = 9575.8584.0580.31. 3.8.1.0

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter1232004006008001000SE +/- 2.77, N = 9SE +/- 2.87, N = 3SE +/- 11.33, N = 9809.1804.0791.91. 3.8.1.0

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform12390180270360450SE +/- 2.03, N = 9SE +/- 1.57, N = 3SE +/- 1.42, N = 9411.9410.3412.41. 3.8.1.0

AOM AV1

Encoder Mode: Speed 0 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 0 Two-Pass1230.04730.09460.14190.18920.2365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.210.210.211. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 4 Two-Pass1231.01482.02963.04444.05925.074SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 34.514.474.501. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Realtime12348121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 317.2417.1517.191. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 6 Two-Pass1233691215SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 313.3313.3813.331. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.1-rcEncoder Mode: Speed 8 Realtime1231224364860SE +/- 0.83, N = 3SE +/- 0.82, N = 3SE +/- 0.00, N = 352.0353.3253.851. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU123246810SE +/- 0.09926, N = 3SE +/- 0.02263, N = 3SE +/- 0.01807, N = 36.681946.681456.72476MIN: 5.95MIN: 5.95MIN: 5.91. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.13, N = 15SE +/- 0.16, N = 4SE +/- 0.14, N = 1512.4512.1012.53MIN: 11.43MIN: 11.23MIN: 11.331. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.57091.14181.71272.28362.8545SE +/- 0.00876, N = 3SE +/- 0.02431, N = 3SE +/- 0.02758, N = 32.511772.528782.53747MIN: 2.32MIN: 2.33MIN: 2.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.8111.6222.4333.2444.055SE +/- 0.00442, N = 3SE +/- 0.00737, N = 3SE +/- 0.01752, N = 33.575213.592243.60454MIN: 3.17MIN: 3.18MIN: 1.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.01, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 320.4020.3120.41MIN: 19.97MIN: 18.74MIN: 19.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU123246810SE +/- 0.01479, N = 3SE +/- 0.01848, N = 3SE +/- 0.02797, N = 36.960316.966716.98731MIN: 6.16MIN: 6.12MIN: 6.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1231.33552.6714.00655.3426.6775SE +/- 0.01143, N = 3SE +/- 0.00690, N = 3SE +/- 0.00102, N = 35.935675.929925.93542MIN: 5.52MIN: 5.5MIN: 5.491. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 325.4625.3525.42MIN: 24.65MIN: 24.3MIN: 24.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.5921.1841.7762.3682.96SE +/- 0.00877, N = 3SE +/- 0.00861, N = 3SE +/- 0.00732, N = 32.631222.629572.62773MIN: 2.5MIN: 2.5MIN: 2.511. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.8551.712.5653.424.275SE +/- 0.00430, N = 3SE +/- 0.00167, N = 3SE +/- 0.00762, N = 33.800163.791853.78904MIN: 3.72MIN: 3.71MIN: 3.711. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1233K6K9K12K15KSE +/- 154.68, N = 7SE +/- 145.74, N = 7SE +/- 176.85, N = 513898.813707.413632.5MIN: 13032.7MIN: 12922.4MIN: 12827.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1238001600240032004000SE +/- 17.43, N = 3SE +/- 27.48, N = 3SE +/- 39.34, N = 33886.863917.803817.64MIN: 3655.72MIN: 3769.84MIN: 3695.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1233K6K9K12K15KSE +/- 178.47, N = 5SE +/- 182.34, N = 5SE +/- 149.77, N = 1213965.613883.613706.6MIN: 13115.7MIN: 12751.7MIN: 12260.31. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1238001600240032004000SE +/- 20.29, N = 3SE +/- 11.92, N = 3SE +/- 15.33, N = 33918.543897.243912.25MIN: 3855MIN: 3835.75MIN: 3811.141. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.43110.86221.29331.72442.1555SE +/- 0.05033, N = 15SE +/- 0.06076, N = 15SE +/- 0.06494, N = 151.916171.891121.76706MIN: 1.23MIN: 1.25MIN: 1.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1233K6K9K12K15KSE +/- 92.65, N = 3SE +/- 125.70, N = 3SE +/- 92.53, N = 314208.114041.213829.2MIN: 13948.2MIN: 13720.3MIN: 136211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1238001600240032004000SE +/- 27.19, N = 3SE +/- 8.79, N = 3SE +/- 15.75, N = 33947.673911.833928.48MIN: 3865.39MIN: 3788.89MIN: 3822.061. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.35210.70421.05631.40841.7605SE +/- 0.00396, N = 3SE +/- 0.00329, N = 3SE +/- 0.00128, N = 31.559881.556831.56511MIN: 1.43MIN: 1.43MIN: 1.441. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 5712313M26M39M52M65MSE +/- 6385.75, N = 3SE +/- 65592.17, N = 3SE +/- 229485.17, N = 36062766759917000602726671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 5712330M60M90M120M150MSE +/- 8819.17, N = 3SE +/- 524859.77, N = 3SE +/- 165025.25, N = 31209533331190766671201500001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 5712350M100M150M200M250MSE +/- 161279.61, N = 3SE +/- 528488.41, N = 3SE +/- 585946.53, N = 32367133332367800002373400001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57123100M200M300M400M500MSE +/- 920887.25, N = 3SE +/- 668539.04, N = 3SE +/- 870868.02, N = 34603100004605866674606766671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57123200M400M600M800M1000MSE +/- 414782.41, N = 3SE +/- 1629918.20, N = 3SE +/- 670232.13, N = 38356133338353800008363233331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57123300M600M900M1200M1500MSE +/- 2370185.18, N = 3SE +/- 5967783.88, N = 3SE +/- 4056818.68, N = 31477766667147463333314788666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57123300M600M900M1200M1500MSE +/- 1354416.64, N = 3SE +/- 1026320.29, N = 3SE +/- 1193035.34, N = 31616966667161660000016147000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium1231.17372.34743.52114.69485.8685SE +/- 0.0362, N = 3SE +/- 0.0034, N = 3SE +/- 0.0222, N = 35.20655.19205.21631. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough123246810SE +/- 0.0026, N = 3SE +/- 0.0067, N = 3SE +/- 0.0153, N = 36.39446.38716.40151. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive1231020304050SE +/- 0.05, N = 3SE +/- 0.30, N = 3SE +/- 0.06, N = 345.6446.0145.711. (CXX) g++ options: -O3 -flto -pthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S123612182430SE +/- 0.27, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 327.4327.1127.521. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0123246810SE +/- 0.013, N = 3SE +/- 0.011, N = 3SE +/- 0.038, N = 37.4937.4777.4951. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 212348121620SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 315.6715.7215.791. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3123612182430SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 325.0625.1625.161. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.01233691215SE +/- 0.095, N = 15SE +/- 0.105, N = 15SE +/- 0.108, N = 39.0628.7838.912MIN: 8.06 / MAX: 18.98MIN: 8 / MAX: 22.86MIN: 8.28 / MAX: 17.911. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50123918273645SE +/- 0.30, N = 15SE +/- 0.34, N = 15SE +/- 0.53, N = 338.0238.1537.56MIN: 35.17 / MAX: 122.04MIN: 35.3 / MAX: 124.03MIN: 35.2 / MAX: 119.581. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_2241231.32982.65963.98945.31926.649SE +/- 0.069, N = 15SE +/- 0.058, N = 15SE +/- 0.211, N = 35.8715.9105.718MIN: 5.4 / MAX: 14.78MIN: 5.42 / MAX: 7.12MIN: 5.38 / MAX: 6.461. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.01230.96981.93962.90943.87924.849SE +/- 0.063, N = 15SE +/- 0.056, N = 15SE +/- 0.114, N = 34.3104.2344.041MIN: 3.38 / MAX: 40.78MIN: 3.39 / MAX: 44.28MIN: 3.38 / MAX: 29.441. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v31231122334455SE +/- 0.34, N = 15SE +/- 0.26, N = 15SE +/- 0.37, N = 348.0548.0847.21MIN: 44.63 / MAX: 136.06MIN: 45.19 / MAX: 105.01MIN: 43.85 / MAX: 97.151. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory12315003000450060007500SE +/- 64.27, N = 3SE +/- 32.29, N = 3SE +/- 33.94, N = 36791.126809.316733.701. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU12312K24K36K48K60KSE +/- 2.08, N = 3SE +/- 3.63, N = 3SE +/- 1.70, N = 357045.0657054.8257053.001. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm


Phoronix Test Suite v10.8.4