TR 2990WX 2020

AMD Ryzen Threadripper 2990WX 32-Core testing with a ASUS ROG ZENITH EXTREME (1701 BIOS) and Gigabyte AMD Radeon RX 470/480/570/570X/580/580X/590 4GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012260-HA-TR2990WX254&grs.

TR 2990WX 2020ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1701 BIOS)AMD 17h32GBSamsung SSD 970 EVO 500GB + 250GB Western Digital WDS250G2X0C-00L350Gigabyte AMD Radeon RX 470/480/570/570X/580/580X/590 4GB (1244/1750MHz)Realtek ALC1220LG Ultra HDIntel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adUbuntu 20.105.8.0-33-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.6 Mesa 20.2.1 (LLVM 11.0.0)1.2.131GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800820dGraphics Details- GLAMORJava Details- OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

TR 2990WX 2020build-ffmpeg: Time To Compilecompress-zstd: 19stockfish: Total Timeyquake2: OpenGL 1.x - 1920 x 1080x265: Bosphorus 4Kredis: SADDsunflow: Global Illumination + Image Synthesisredis: SETredis: GETonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUhpcc: Rand Ring Bandwidthhpcc: G-Fftevkmark: 1280 x 1024compress-lz4: 9 - Compression Speedhpcc: G-HPLai-benchmark: Device Training Scorenode-web-tooling: asmfish: 1024 Hash Memory, 26 Depthonednn: Recurrent Neural Network Inference - u8s8f32 - CPUvkmark: 1920 x 1080simdjson: LargeRandembree: Pathtracer - Crownembree: Pathtracer - Asian Dragononednn: Recurrent Neural Network Training - f32 - CPUhpcc: G-Rand Accessyquake2: OpenGL 3.x - 1920 x 1080embree: Pathtracer ISPC - Crownredis: LPUSHai-benchmark: Device AI Scorelammps: 20k Atomssimdjson: PartialTweetsx265: Bosphorus 1080ponednn: IP Shapes 3D - u8s8f32 - CPUembree: Pathtracer ISPC - Asian Dragonrav1e: 1libplacebo: av1_grain_lapcompress-lz4: 1 - Decompression Speedbrl-cad: VGR Performance Metrichmmer: Pfam Database Searchonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUgromacs: Water Benchmarkcompress-lz4: 1 - Compression Speedhpcc: Max Ping Pong Bandwidthindigobench: CPU - Bedroomcompress-lz4: 3 - Compression Speedcompress-lz4: 9 - Decompression Speedbasis: ETC1Sonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUai-benchmark: Device Inference Scorerav1e: 10espeak: Text-To-Speech Synthesisonednn: IP Shapes 1D - f32 - CPUastcenc: Thoroughonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUmafft: Multiple Sequence Alignment - LSU RNAonednn: Recurrent Neural Network Inference - f32 - CPUkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumnumpy: yquake2: Software CPU - 1920 x 1080onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 4K - Ultra Fastonednn: Deconvolution Batch shapes_1d - f32 - CPUcompress-lz4: 3 - Decompression Speedindigobench: CPU - Supercarbasis: UASTC Level 2 + RDO Post-Processingkvazaar: Bosphorus 1080p - Ultra Fastcrafty: Elapsed Timeencode-opus: WAV To Opus Encodekvazaar: Bosphorus 1080p - Mediumastcenc: Mediumvkfft: basis: UASTC Level 0clomp: Static OMP Speedupastcenc: Fastsqlite-speedtest: Timed Time - Size 1,000phpbench: PHP Benchmark Suitewaifu2x-ncnn: 2x - 3 - Yesastcenc: Exhaustivebasis: UASTC Level 3rav1e: 5rav1e: 6libplacebo: polar_nocomputekvazaar: Bosphorus 4K - Very Fastbuild-eigen: Time To Compileonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUencode-ape: WAV To APEbasis: UASTC Level 2coremark: CoreMark Size 666 - Iterations Per Secondlibplacebo: deband_heavykvazaar: Bosphorus 1080p - Very Fasthpcc: Rand Ring Latencybuild2: Time To Compileencode-wavpack: WAV To WavPackvkresample: 2x - Singlelibplacebo: hdr_peakdetectsimdjson: DistinctUserIDsimdjson: Kostyawaifu2x-ncnn: 2x - 3 - Noncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetredis: LPOPbuild-clash: Time To Compileembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer - Asian Dragon Objonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUcompress-zstd: 3lammps: Rhodopsin Proteinhpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMbetsy: ETC2 RGB - Highestbetsy: ETC1 - Highest12346.08433.949821254683.816.621892562.210.7891675529.382201882.0313991.111.46940.5712010.04669585046.6054.598379668.47738040013802.5243840.3924.019321.956413861.20.02819949.722.72591368828.88227015.5010.5141.023.4537021.31560.315448.249506.2286847163.93519.97341.469281.8348518.0410862.0415.03647.469106.947.60213826.213042.85930.8476.277159.543.4372712.1533749.1810.2910.47296.80106.63808.572.1240826.9939.682.995599128.211.035651.213116.8073295317.75327.696.3494127.59551.75.2267.20757722910.03873.3625.2290.9541.270268.7223.5493.2525.9260425.164314.02915.7851146059.294691192.5460.691.5368196.20013.19949.0121762.980.520.442.052101.0141.2850.2572.0633.5655.89103.4141.089.8219.1415.6414.9214.5815.5937.30102.9341.8950.1979.5337.7259.91102.3847.876.8320.0113.6015.2814.4415.6737.242431710.35463.22017.679118.24181.657673.696573606.713.3991.309664.0016412.6172712.69710.75430.91842.055404772619.915.261826296.850.7311654163.602099666.5213295.011.20850.583009.74083564447.5552.777539988.25748951873821.2442670.3924.615522.496613529.00.02798972.222.23651340079.29231715.4180.5040.363.4906421.66740.320441.379647.9291013162.56120.15381.485491.8588619.8410831.9744.99447.129217.247.60213783.313192.83031.1536.250119.493.4498012.2643782.3610.3810.56298.31107.13780.472.1089327.1339.673.017219191.910.963647.924116.0973713747.79527.846.3294517.62651.65.2267.45857936010.05573.1425.2170.9511.273268.3623.5792.9815.9385625.102614.01015.7901148000.044932192.4360.691.5345396.34013.21548.9961762.780.520.44100.2640.0150.5773.8632.3953.19100.7338.907.4019.2816.2614.6514.2216.3033.78101.4740.8049.6975.0334.3454.2992.9842.046.6720.9214.5014.4215.7015.6034.562058658.41477.08417.235719.29922.095673.761643798.912.9131.307714.341068.8030312.54310.51932.69936.254984012623.816.551974193.461577289.792091130.6713669.210.92850.593859.67646564045.9454.305078.21726307373711.9642680.4024.363622.265513753.90.02866972.522.64531366164.6715.1870.5141.063.5108721.56220.318446.389540.0161.63220.25431.489571.8378508.6410972.4945.05746.879130.548.16413668.42.85331.0286.216239.453.4690912.1573783.4010.3210.53295.79106.23777.652.1261126.9239.393.016459188.311.036647.089116.6873667017.77427.786.3594177.60051.85.2067.24710.07473.1025.1440.9531.274267.9023.6193.0585.9230025.104114.04115.7581146667.406114192.2260.791.5350949.0211762.550.520.44101.2739.1448.2370.9235.9356.35100.1738.327.0920.4913.7715.7515.9514.9734.39103.8939.1349.2573.6535.6964.4995.6142.957.6518.9714.1715.3015.2115.7834.911998586.16485.48517.399919.40142.009843.416993781.212.5511.338204.1742813.0341712.55310.529OpenBenchmarking.org

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile1231020304050SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.18, N = 346.0830.9232.70

Zstd Compression

Compression Level: 19

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 191231020304050SE +/- 0.37, N = 15SE +/- 0.34, N = 3SE +/- 0.39, N = 333.942.036.21. (CC) gcc options: -O3 -pthread -lz -llzma

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time12312M24M36M48M60MSE +/- 796222.48, N = 3SE +/- 577141.31, N = 3SE +/- 749500.39, N = 34982125455404772549840121. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

yquake2

Renderer: OpenGL 1.x - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 1920 x 1080123150300450600750SE +/- 4.44, N = 3SE +/- 6.33, N = 8SE +/- 5.54, N = 15683.8619.9623.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K12348121620SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 316.6215.2616.551. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123400K800K1200K1600K2000KSE +/- 20792.00, N = 15SE +/- 25792.40, N = 15SE +/- 16528.72, N = 31892562.211826296.851974193.461. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis120.17750.3550.53250.710.8875SE +/- 0.004, N = 3SE +/- 0.011, N = 150.7890.731MIN: 0.57 / MAX: 1.63MIN: 0.49 / MAX: 1.57

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123400K800K1200K1600K2000KSE +/- 19958.53, N = 3SE +/- 23714.82, N = 4SE +/- 22941.42, N = 31675529.381654163.601577289.791. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 31653.62, N = 15SE +/- 31097.88, N = 15SE +/- 27660.05, N = 152201882.032099666.522091130.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1233K6K9K12K15KSE +/- 48.11, N = 3SE +/- 123.89, N = 10SE +/- 122.96, N = 1113991.113295.013669.2MIN: 13812.5MIN: 12213.4MIN: 12754.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.19, N = 12SE +/- 0.15, N = 15SE +/- 0.01, N = 311.4711.2110.93MIN: 10.94MIN: 10.67MIN: 10.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth1230.13360.26720.40080.53440.668SE +/- 0.00203, N = 3SE +/- 0.01337, N = 3SE +/- 0.00691, N = 30.571200.583000.593851. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte1233691215SE +/- 0.04480, N = 3SE +/- 0.17781, N = 3SE +/- 0.14488, N = 310.046699.740839.676461. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

VKMark

Resolution: 1280 x 1024

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1280 x 102412313002600390052006500SE +/- 2.19, N = 3SE +/- 5.90, N = 3SE +/- 2.33, N = 35850564456401. (CXX) g++ options: -ldl -pipe -std=c++14 -fPIC -MD -MQ -MF

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1231122334455SE +/- 0.43, N = 15SE +/- 0.77, N = 3SE +/- 0.01, N = 346.6047.5545.941. (CC) gcc options: -O3

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL1231224364860SE +/- 0.14, N = 3SE +/- 0.10, N = 3SE +/- 0.24, N = 354.6052.7854.311. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

AI Benchmark Alpha

Device Training Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training Score122004006008001000966998

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark123246810SE +/- 0.03, N = 3SE +/- 0.12, N = 4SE +/- 0.04, N = 38.478.258.211. Nodejs v12.18.2

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth12316M32M48M64M80MSE +/- 738089.20, N = 3SE +/- 126998.98, N = 3SE +/- 633798.86, N = 3738040017489518772630737

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1238001600240032004000SE +/- 10.92, N = 3SE +/- 46.19, N = 3SE +/- 38.84, N = 83802.523821.243711.96MIN: 3624.28MIN: 3746.24MIN: 3503.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801239001800270036004500SE +/- 2.96, N = 3SE +/- 2.65, N = 3SE +/- 2.85, N = 34384426742681. (CXX) g++ options: -ldl -pipe -std=c++14 -fPIC -MD -MQ -MF

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.090.180.270.360.45SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.390.401. (CXX) g++ options: -O3 -pthread

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown123612182430SE +/- 0.33, N = 15SE +/- 0.22, N = 3SE +/- 0.31, N = 324.0224.6224.36MIN: 19.33 / MAX: 25.81MIN: 23.95 / MAX: 25.61MIN: 23 / MAX: 25.59

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon123510152025SE +/- 0.31, N = 15SE +/- 0.25, N = 15SE +/- 0.26, N = 1521.9622.5022.27MIN: 19.03 / MAX: 24.66MIN: 20.59 / MAX: 25.14MIN: 20.37 / MAX: 25.26

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1233K6K9K12K15KSE +/- 110.93, N = 3SE +/- 161.18, N = 3SE +/- 132.58, N = 313861.213529.013753.9MIN: 13490.9MIN: 13190.1MIN: 12649.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access1230.00640.01280.01920.02560.032SE +/- 0.00038, N = 3SE +/- 0.00067, N = 3SE +/- 0.00018, N = 30.028190.027980.028661. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

yquake2

Renderer: OpenGL 3.x - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 10801232004006008001000SE +/- 12.14, N = 3SE +/- 13.22, N = 3SE +/- 10.37, N = 3949.7972.2972.51. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown123510152025SE +/- 0.21, N = 3SE +/- 0.38, N = 3SE +/- 0.22, N = 322.7322.2422.65MIN: 21.97 / MAX: 23.66MIN: 20.82 / MAX: 23.48MIN: 21.43 / MAX: 23.62

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123300K600K900K1200K1500KSE +/- 18400.26, N = 3SE +/- 19080.84, N = 4SE +/- 13973.47, N = 31368828.881340079.291366164.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

AI Benchmark Alpha

Device AI Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI Score12500100015002000250022702317

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms12348121620SE +/- 0.21, N = 3SE +/- 0.02, N = 3SE +/- 0.21, N = 415.5015.4215.191. (CXX) g++ options: -O3 -pthread -lm

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.11480.22960.34440.45920.574SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.510.500.511. (CXX) g++ options: -O3 -pthread

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p123918273645SE +/- 0.23, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 341.0240.3641.061. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.78991.57982.36973.15963.9495SE +/- 0.02421, N = 3SE +/- 0.04824, N = 3SE +/- 0.03734, N = 33.453703.490643.51087MIN: 1.93MIN: 2.01MIN: 21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon123510152025SE +/- 0.09, N = 3SE +/- 0.26, N = 15SE +/- 0.19, N = 1521.3221.6721.56MIN: 20.36 / MAX: 22.38MIN: 19.41 / MAX: 23.55MIN: 19.33 / MAX: 23.88

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11230.0720.1440.2160.2880.36SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.3150.3200.318

Libplacebo

Test: av1_grain_lap

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: av1_grain_lap123100200300400500SE +/- 3.42, N = 3SE +/- 3.25, N = 3SE +/- 1.25, N = 3448.24441.37446.381. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1232K4K6K8K10KSE +/- 34.06, N = 3SE +/- 9.81, N = 3SE +/- 47.16, N = 39506.29647.99540.01. (CC) gcc options: -O3

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1260K120K180K240K300K2868472910131. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search1234080120160200SE +/- 1.51, N = 10SE +/- 0.23, N = 3SE +/- 0.19, N = 3163.94162.56161.631. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 319.9720.1520.25MIN: 14.86MIN: 19.19MIN: 19.161. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.33520.67041.00561.34081.676SE +/- 0.01123, N = 15SE +/- 0.00191, N = 3SE +/- 0.00013, N = 31.469281.485491.48957MIN: 1.34MIN: 1.45MIN: 1.441. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.41810.83621.25431.67242.0905SE +/- 0.004, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 21.8341.8581.8371. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed1232K4K6K8K10KSE +/- 65.90, N = 3SE +/- 79.09, N = 3SE +/- 51.76, N = 38518.048619.848508.641. (CC) gcc options: -O3

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth1232K4K6K8K10KSE +/- 250.74, N = 3SE +/- 181.12, N = 3SE +/- 206.43, N = 310862.0410831.9710972.491. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1231.13782.27563.41344.55125.689SE +/- 0.025, N = 3SE +/- 0.054, N = 12SE +/- 0.019, N = 35.0364.9945.057

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1231122334455SE +/- 0.54, N = 3SE +/- 0.25, N = 3SE +/- 0.29, N = 347.4647.1246.871. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1232K4K6K8K10KSE +/- 20.12, N = 15SE +/- 4.46, N = 3SE +/- 18.60, N = 39106.99217.29130.51. (CC) gcc options: -O3

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1231122334455SE +/- 0.22, N = 3SE +/- 0.10, N = 3SE +/- 0.24, N = 347.6047.6048.161. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1233K6K9K12K15KSE +/- 141.29, N = 3SE +/- 53.39, N = 3SE +/- 128.38, N = 1213826.213783.313668.4MIN: 13450.2MIN: 13567.9MIN: 12362.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

AI Benchmark Alpha

Device Inference Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference Score123006009001200150013041319

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.64331.28661.92992.57323.2165SE +/- 0.006, N = 3SE +/- 0.018, N = 3SE +/- 0.012, N = 32.8592.8302.853

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis123714212835SE +/- 0.06, N = 4SE +/- 0.17, N = 4SE +/- 0.07, N = 430.8531.1531.031. (CC) gcc options: -O2 -std=c99

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU123246810SE +/- 0.05437, N = 3SE +/- 0.08570, N = 3SE +/- 0.09143, N = 36.277156.250116.21623MIN: 5.63MIN: 5.72MIN: 5.671. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1233691215SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 39.549.499.451. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.78051.5612.34153.1223.9025SE +/- 0.00942, N = 3SE +/- 0.00155, N = 3SE +/- 0.02456, N = 33.437273.449803.46909MIN: 3.34MIN: 3.36MIN: 3.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1233691215SE +/- 0.04, N = 3SE +/- 0.15, N = 6SE +/- 0.12, N = 312.1512.2612.161. (CC) gcc options: -std=c99 -O3 -lm -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1238001600240032004000SE +/- 16.86, N = 3SE +/- 8.72, N = 3SE +/- 8.76, N = 33749.183782.363783.40MIN: 3631.66MIN: 3756.39MIN: 3758.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1233691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.2910.3810.321. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 310.4710.5610.531. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12360120180240300SE +/- 0.54, N = 3SE +/- 0.29, N = 3SE +/- 0.70, N = 3296.80298.31295.79

yquake2

Renderer: Software CPU - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 108012320406080100SE +/- 1.02, N = 3SE +/- 1.01, N = 3SE +/- 0.93, N = 3106.6107.1106.21. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1238001600240032004000SE +/- 3.43, N = 3SE +/- 13.99, N = 3SE +/- 10.39, N = 33808.573780.473777.65MIN: 3705.89MIN: 3731.84MIN: 3644.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.47840.95681.43521.91362.392SE +/- 0.00488, N = 3SE +/- 0.00369, N = 3SE +/- 0.00796, N = 32.124082.108932.12611MIN: 2.04MIN: 2.03MIN: 2.051. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123612182430SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 326.9927.1326.921. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast123918273645SE +/- 0.28, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 339.6839.6739.391. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1230.67891.35782.03672.71563.3945SE +/- 0.02534, N = 3SE +/- 0.01913, N = 3SE +/- 0.01010, N = 32.995593.017213.01645MIN: 2.82MIN: 2.84MIN: 2.851. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1232K4K6K8K10KSE +/- 64.14, N = 3SE +/- 19.51, N = 3SE +/- 57.74, N = 39128.29191.99188.31. (CC) gcc options: -O3

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1233691215SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 311.0410.9611.04

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing123140280420560700SE +/- 1.61, N = 3SE +/- 0.39, N = 3SE +/- 0.07, N = 3651.21647.92647.091. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast123306090120150SE +/- 0.15, N = 3SE +/- 0.66, N = 3SE +/- 0.26, N = 3116.80116.09116.681. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1231.6M3.2M4.8M6.4M8MSE +/- 7194.80, N = 3SE +/- 6808.24, N = 3SE +/- 5337.04, N = 37329531737137473667011. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode123246810SE +/- 0.012, N = 5SE +/- 0.012, N = 5SE +/- 0.021, N = 57.7537.7957.7741. (CXX) g++ options: -fvisibility=hidden -logg -lm

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123714212835SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 327.6927.8427.781. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium123246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.346.326.351. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.11232K4K6K8K10KSE +/- 13.38, N = 3SE +/- 49.17, N = 3SE +/- 6.74, N = 39412945194171. (CXX) g++ options: -O3 -pthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 0123246810SE +/- 0.057, N = 3SE +/- 0.021, N = 3SE +/- 0.019, N = 37.5957.6267.6001. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1231224364860SE +/- 0.57, N = 3SE +/- 0.20, N = 2SE +/- 0.68, N = 351.751.651.81. (CC) gcc options: -fopenmp -O3 -lm

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1231.17452.3493.52354.6985.8725SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 35.225.225.201. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001231530456075SE +/- 0.47, N = 3SE +/- 0.25, N = 3SE +/- 0.13, N = 367.2167.4667.251. (CC) gcc options: -O2 -ldl -lz -lpthread

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite12120K240K360K480K600KSE +/- 2547.76, N = 3SE +/- 145.08, N = 3577229579360

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes1233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 310.0410.0610.07

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive1231632486480SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 373.3673.1473.101. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3123612182430SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 325.2325.2225.141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 51230.21470.42940.64410.85881.0735SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 30.9540.9510.953

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61230.28670.57340.86011.14681.4335SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 31.2701.2731.274

Libplacebo

Test: polar_nocompute

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: polar_nocompute12360120180240300SE +/- 0.83, N = 3SE +/- 0.53, N = 3SE +/- 0.38, N = 3268.72268.36267.901. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast123612182430SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 323.5423.5723.611. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile12320406080100SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 393.2592.9893.06

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1231.33622.67244.00865.34486.681SE +/- 0.01022, N = 3SE +/- 0.00211, N = 3SE +/- 0.00240, N = 35.926045.938565.92300MIN: 5.7MIN: 5.71MIN: 5.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 325.1625.1025.10MIN: 24.12MIN: 24.22MIN: 23.921. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE12348121620SE +/- 0.06, N = 5SE +/- 0.06, N = 5SE +/- 0.06, N = 514.0314.0114.041. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 212348121620SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 315.7915.7915.761. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second123200K400K600K800K1000KSE +/- 2731.02, N = 3SE +/- 686.93, N = 3SE +/- 1713.53, N = 31146059.291148000.041146667.411. (CC) gcc options: -O2 -lrt" -lrt

Libplacebo

Test: deband_heavy

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: deband_heavy1234080120160200SE +/- 0.36, N = 3SE +/- 0.37, N = 3SE +/- 0.29, N = 3192.54192.43192.221. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast1231428425670SE +/- 0.21, N = 3SE +/- 0.20, N = 3SE +/- 0.26, N = 360.6960.6960.791. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency1230.34580.69161.03741.38321.729SE +/- 0.00435, N = 3SE +/- 0.00474, N = 3SE +/- 0.00358, N = 31.536811.534531.535091. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile1220406080100SE +/- 0.24, N = 3SE +/- 0.21, N = 396.2096.34

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack123691215SE +/- 0.01, N = 5SE +/- 0.01, N = 513.2013.221. (CXX) g++ options: -rdynamic

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single1231122334455SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 349.0149.0049.021. (CXX) g++ options: -O3 -pthread

Libplacebo

Test: hdr_peakdetect

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: hdr_peakdetect123400800120016002000SE +/- 0.03, N = 3SE +/- 0.18, N = 3SE +/- 0.10, N = 31762.981762.781762.551. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.1170.2340.3510.4680.585SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.520.520.521. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.0990.1980.2970.3960.495SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.440.440.441. (CXX) g++ options: -O3 -pthread

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: No10.46170.92341.38511.84682.30852.052

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m12320406080100SE +/- 1.78, N = 12SE +/- 2.13, N = 12SE +/- 2.24, N = 9101.01100.26101.27MIN: 90.72 / MAX: 1587.31MIN: 90.99 / MAX: 1833.25MIN: 89.99 / MAX: 2458.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd123918273645SE +/- 1.34, N = 12SE +/- 1.39, N = 12SE +/- 1.29, N = 941.2840.0139.14MIN: 31.56 / MAX: 448.33MIN: 31.52 / MAX: 514.49MIN: 31.19 / MAX: 435.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1231122334455SE +/- 1.15, N = 12SE +/- 1.28, N = 12SE +/- 1.42, N = 950.2550.5748.23MIN: 39.88 / MAX: 213.87MIN: 39.57 / MAX: 214.93MIN: 39.54 / MAX: 230.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501231632486480SE +/- 3.10, N = 12SE +/- 5.68, N = 12SE +/- 6.02, N = 972.0673.8670.92MIN: 39.39 / MAX: 562.21MIN: 40.4 / MAX: 546.35MIN: 39.2 / MAX: 557.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet123816243240SE +/- 1.15, N = 12SE +/- 1.09, N = 12SE +/- 1.85, N = 933.5632.3935.93MIN: 15.36 / MAX: 104.32MIN: 17.56 / MAX: 91.67MIN: 17.55 / MAX: 96.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet181231326395265SE +/- 3.22, N = 12SE +/- 5.95, N = 12SE +/- 3.64, N = 955.8953.1956.35MIN: 23.8 / MAX: 226.22MIN: 21.74 / MAX: 222.51MIN: 22.77 / MAX: 219.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1612320406080100SE +/- 2.14, N = 12SE +/- 2.43, N = 12SE +/- 2.35, N = 9103.41100.73100.17MIN: 63 / MAX: 216.59MIN: 65.03 / MAX: 221.49MIN: 63.35 / MAX: 242.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet123918273645SE +/- 2.33, N = 12SE +/- 1.77, N = 12SE +/- 1.78, N = 941.0838.9038.32MIN: 28.69 / MAX: 513.63MIN: 28.91 / MAX: 532.2MIN: 28.17 / MAX: 505.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1233691215SE +/- 1.94, N = 12SE +/- 0.76, N = 12SE +/- 0.51, N = 99.827.407.09MIN: 6.19 / MAX: 229.58MIN: 6.14 / MAX: 215.25MIN: 6.15 / MAX: 204.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0123510152025SE +/- 0.25, N = 12SE +/- 0.70, N = 12SE +/- 0.66, N = 919.1419.2820.49MIN: 17.51 / MAX: 352.05MIN: 16.96 / MAX: 430.49MIN: 17.45 / MAX: 438.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet12348121620SE +/- 0.70, N = 11SE +/- 1.06, N = 12SE +/- 0.35, N = 915.6416.2613.77MIN: 12.3 / MAX: 352.98MIN: 12.28 / MAX: 390.46MIN: 12.4 / MAX: 378.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v212348121620SE +/- 0.51, N = 12SE +/- 0.14, N = 11SE +/- 0.93, N = 914.9214.6515.75MIN: 13.15 / MAX: 283.76MIN: 13.43 / MAX: 114.64MIN: 12.97 / MAX: 295.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v312348121620SE +/- 0.55, N = 12SE +/- 0.34, N = 12SE +/- 1.16, N = 914.5814.2215.95MIN: 12.63 / MAX: 382.37MIN: 12.46 / MAX: 387.96MIN: 12.47 / MAX: 359.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v212348121620SE +/- 0.35, N = 12SE +/- 0.63, N = 12SE +/- 0.24, N = 915.5916.3014.97MIN: 12.98 / MAX: 359.55MIN: 13.31 / MAX: 389.24MIN: 13.4 / MAX: 357.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet123918273645SE +/- 1.11, N = 12SE +/- 0.72, N = 12SE +/- 1.11, N = 937.3033.7834.39MIN: 29.33 / MAX: 427.64MIN: 29.45 / MAX: 408.23MIN: 29.33 / MAX: 412.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m12320406080100SE +/- 1.12, N = 12SE +/- 1.29, N = 12SE +/- 3.17, N = 12102.93101.47103.89MIN: 90.68 / MAX: 1519.43MIN: 90.53 / MAX: 2458.71MIN: 90.75 / MAX: 3380.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1231020304050SE +/- 1.53, N = 12SE +/- 1.74, N = 12SE +/- 1.90, N = 1241.8940.8039.13MIN: 31.38 / MAX: 429.4MIN: 31.26 / MAX: 438.66MIN: 32.02 / MAX: 459.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1231122334455SE +/- 0.68, N = 12SE +/- 1.04, N = 12SE +/- 0.86, N = 1250.1949.6949.25MIN: 39.36 / MAX: 224.81MIN: 39.24 / MAX: 224.21MIN: 39.85 / MAX: 267.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet5012320406080100SE +/- 5.46, N = 12SE +/- 2.78, N = 12SE +/- 4.79, N = 1279.5375.0373.65MIN: 38.42 / MAX: 565.15MIN: 38.58 / MAX: 559.97MIN: 39.25 / MAX: 638.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet123918273645SE +/- 1.89, N = 12SE +/- 2.63, N = 12SE +/- 1.62, N = 1237.7234.3435.69MIN: 15.66 / MAX: 106.73MIN: 15.18 / MAX: 103.77MIN: 16.27 / MAX: 106.471. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181231428425670SE +/- 4.97, N = 12SE +/- 5.28, N = 12SE +/- 4.57, N = 1259.9154.2964.49MIN: 23.15 / MAX: 227.75MIN: 22 / MAX: 230.1MIN: 21.25 / MAX: 228.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1612320406080100SE +/- 3.95, N = 12SE +/- 1.61, N = 12SE +/- 2.05, N = 12102.3892.9895.61MIN: 62.04 / MAX: 220.23MIN: 61.58 / MAX: 223.7MIN: 64.1 / MAX: 227.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet1231122334455SE +/- 2.92, N = 12SE +/- 2.39, N = 12SE +/- 3.13, N = 1247.8742.0442.95MIN: 27.73 / MAX: 542.74MIN: 28.93 / MAX: 517.65MIN: 27.93 / MAX: 530.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface123246810SE +/- 0.16, N = 12SE +/- 0.10, N = 12SE +/- 0.60, N = 126.836.677.65MIN: 6.11 / MAX: 191.61MIN: 6.12 / MAX: 175.66MIN: 6.15 / MAX: 211.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0123510152025SE +/- 0.67, N = 12SE +/- 1.79, N = 12SE +/- 0.37, N = 1220.0120.9218.97MIN: 17.38 / MAX: 456.25MIN: 17.12 / MAX: 465.23MIN: 17.3 / MAX: 414.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet12348121620SE +/- 0.16, N = 12SE +/- 0.64, N = 12SE +/- 0.31, N = 1213.6014.5014.17MIN: 12.42 / MAX: 189.74MIN: 12.55 / MAX: 348.4MIN: 12.78 / MAX: 347.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v212348121620SE +/- 0.85, N = 12SE +/- 0.15, N = 12SE +/- 0.64, N = 1215.2814.4215.30MIN: 13.6 / MAX: 309.56MIN: 13.2 / MAX: 104.86MIN: 12.9 / MAX: 306.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v312348121620SE +/- 0.50, N = 12SE +/- 1.39, N = 12SE +/- 1.31, N = 1214.4415.7015.21MIN: 12.49 / MAX: 358.11MIN: 12.6 / MAX: 382.98MIN: 12.52 / MAX: 378.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v212348121620SE +/- 0.44, N = 12SE +/- 0.34, N = 12SE +/- 0.73, N = 1215.6715.6015.78MIN: 13.15 / MAX: 357.88MIN: 13.33 / MAX: 383.7MIN: 13.2 / MAX: 388.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet123918273645SE +/- 1.43, N = 12SE +/- 0.55, N = 12SE +/- 0.99, N = 1237.2434.5634.91MIN: 29.37 / MAX: 419.38MIN: 30.05 / MAX: 404.5MIN: 29.38 / MAX: 419.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123500K1000K1500K2000K2500KSE +/- 44065.81, N = 15SE +/- 132410.87, N = 12SE +/- 139985.77, N = 122431710.352058658.411998586.161. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Timed Clash Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Clash CompilationTime To Compile123110220330440550SE +/- 13.67, N = 9SE +/- 4.96, N = 9SE +/- 0.58, N = 3463.22477.08485.49

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj12348121620SE +/- 0.39, N = 12SE +/- 0.39, N = 12SE +/- 0.30, N = 1517.6817.2417.40MIN: 15.66 / MAX: 20.6MIN: 14.95 / MAX: 20.29MIN: 15.12 / MAX: 19.79

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj123510152025SE +/- 0.25, N = 15SE +/- 0.37, N = 15SE +/- 0.44, N = 1218.2419.3019.40MIN: 16.42 / MAX: 21.1MIN: 16.68 / MAX: 22.31MIN: 16.59 / MAX: 22.2

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.47150.9431.41451.8862.3575SE +/- 0.05725, N = 15SE +/- 0.05017, N = 15SE +/- 0.03417, N = 151.657672.095672.00984MIN: 1.11MIN: 1.4MIN: 1.391. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.84641.69282.53923.38564.232SE +/- 0.07703, N = 15SE +/- 0.07933, N = 15SE +/- 0.01146, N = 33.696573.761643.41699MIN: 3.25MIN: 3.26MIN: 3.221. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Zstd Compression

Compression Level: 3

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 31238001600240032004000SE +/- 88.27, N = 12SE +/- 129.73, N = 12SE +/- 107.86, N = 153606.73798.93781.21. (CC) gcc options: -O3 -pthread -lz -llzma

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1233691215SE +/- 0.36, N = 12SE +/- 0.30, N = 15SE +/- 0.15, N = 313.4012.9112.551. (CXX) g++ options: -O3 -pthread -lm

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad1230.30110.60220.90331.20441.5055SE +/- 0.05676, N = 3SE +/- 0.03714, N = 3SE +/- 0.07867, N = 31.309661.307711.338201. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans1230.97671.95342.93013.90684.8835SE +/- 0.25728, N = 3SE +/- 0.03755, N = 3SE +/- 0.08946, N = 34.001644.341064.174281. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM1233691215SE +/- 0.86774, N = 3SE +/- 0.14154, N = 3SE +/- 0.34471, N = 312.617278.8030313.034171. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highest1233691215SE +/- 0.24, N = 15SE +/- 0.02, N = 3SE +/- 0.04, N = 312.7012.5412.551. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Betsy GPU Compressor

Codec: ETC1 - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highest1233691215SE +/- 0.23, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 310.7510.5210.531. (CXX) g++ options: -O3 -O2 -lpthread -ldl


Phoronix Test Suite v10.8.4