xeon auggy

Tests for a future article. 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 22.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2308065-NE-XEONAUGGY78&grs&export=pdf&sor&rro.

xeon auggyProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen Resolutionab2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads)Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS)Intel Ice Lake IEH512GB7682GB INTEL SSDPF2KX076TZASPEEDVE2282 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFPUbuntu 22.106.2.0-rc5-phx-dodt (x86_64)GNOME Shell 43.0X Server 1.21.1.31.3.224GCC 12.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0xd000389 Java Details- OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu122.10.1)Python Details- Python 3.10.7Security Details- dodt: Mitigation of DOITM + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

xeon auggylibxsmm: 128stress-ng: Cloningstress-ng: Pipeapache-iotdb: 100 - 100 - 500apache-iotdb: 100 - 100 - 500ncnn: CPU - resnet18apache-iotdb: 500 - 1 - 200heffte: r2c - FFTW - double - 128apache-iotdb: 500 - 1 - 200ncnn: CPU - FastestDetncnn: CPU - alexnetliquid-dsp: 128 - 256 - 57heffte: r2c - FFTW - float - 256heffte: c2c - FFTW - float - 256ncnn: CPU - resnet50ncnn: CPU - regnety_400mapache-iotdb: 200 - 1 - 200apache-iotdb: 100 - 1 - 200heffte: c2c - FFTW - double - 128ncnn: CPU - blazefaceheffte: c2c - Stock - float - 256ncnn: CPU - mnasnetheffte: c2c - Stock - double - 128ncnn: CPU - squeezenet_ssdapache-iotdb: 500 - 1 - 500ncnn: CPU - vision_transformerncnn: CPU - googlenetstress-ng: Pthreadapache-iotdb: 500 - 1 - 500heffte: r2c - Stock - float - 128z3: 1.smt2apache-iotdb: 200 - 1 - 200embree: Pathtracer - Crownospray: particle_volume/scivis/real_timeheffte: r2c - Stock - double - 512apache-iotdb: 100 - 1 - 200ospray: gravity_spheres_volume/dim_512/scivis/real_timeapache-iotdb: 100 - 100 - 200heffte: c2c - FFTW - double - 512heffte: r2c - Stock - float - 512apache-iotdb: 100 - 100 - 200liquid-dsp: 16 - 256 - 512vvenc: Bosphorus 4K - Fasterremhos: Sample Remap Examplencnn: CPU - efficientnet-b0liquid-dsp: 16 - 256 - 57heffte: c2c - FFTW - double - 256liquid-dsp: 160 - 256 - 57ospray: gravity_spheres_volume/dim_512/ao/real_timelibxsmm: 256ncnn: CPU - yolov4-tinyheffte: c2c - Stock - float - 512liquid-dsp: 64 - 256 - 32liquid-dsp: 32 - 256 - 512stress-ng: Vector Floating Pointliquid-dsp: 32 - 256 - 57ncnn: CPU - mobilenetoidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlylibxsmm: 32liquid-dsp: 16 - 256 - 32z3: 2.smt2vvenc: Bosphorus 4K - Fastncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3oidn: RTLightmap.hdr.4096x4096 - CPU-Onlyoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyliquid-dsp: 64 - 256 - 512blender: Fishy Cat - CPU-Onlyquantlib: ospray: gravity_spheres_volume/dim_512/pathtracer/real_timeliquid-dsp: 128 - 256 - 32heffte: c2c - FFTW - float - 512heffte: r2c - Stock - double - 256apache-iotdb: 100 - 1 - 500heffte: c2c - Stock - float - 128srsran: PUSCH Processor Benchmark, Throughput Totalliquid-dsp: 128 - 256 - 512gpaw: Carbon Nanotubeheffte: c2c - FFTW - float - 128embree: Pathtracer - Asian Dragon Objncnn: CPU-v2-v2 - mobilenet-v2heffte: r2c - FFTW - double - 512liquid-dsp: 64 - 256 - 57heffte: c2c - Stock - double - 256vvenc: Bosphorus 1080p - Fasterlibxsmm: 64ncnn: CPU - vgg16blender: BMW27 - CPU-Onlyliquid-dsp: 160 - 256 - 32blender: Classroom - CPU-Onlyapache-iotdb: 200 - 1 - 500liquid-dsp: 1 - 256 - 512apache-iotdb: 100 - 1 - 500heffte: r2c - Stock - float - 256heffte: c2c - Stock - double - 512liquid-dsp: 1 - 256 - 32apache-iotdb: 200 - 1 - 500blender: Barbershop - CPU-Onlyheffte: r2c - FFTW - double - 256liquid-dsp: 160 - 256 - 512build-gcc: Time To Compileembree: Pathtracer ISPC - Asian Dragon Objheffte: r2c - Stock - double - 128heffte: r2c - FFTW - float - 512embree: Pathtracer ISPC - Asian Dragonembree: Pathtracer - Asian Dragonstress-ng: Fused Multiply-Adddav1d: Summer Nature 1080pheffte: r2c - FFTW - float - 128vvenc: Bosphorus 1080p - Fastliquid-dsp: 32 - 256 - 32ospray: particle_volume/pathtracer/real_timeospray: particle_volume/ao/real_timedav1d: Chimera 1080psrsran: PUSCH Processor Benchmark, Throughput Threadsrsran: Downlink Processor Benchmarkstress-ng: Vector Shuffledav1d: Summer Nature 4Kstress-ng: Wide Vector Mathencode-opus: WAV To Opus Encodestress-ng: AVL Treeliquid-dsp: 1 - 256 - 57dav1d: Chimera 1080p 10-bitstress-ng: Matrix 3D Mathstress-ng: Floating Pointstress-ng: Zlibembree: Pathtracer ISPC - Crowncouchdb: 500 - 1000 - 30couchdb: 300 - 1000 - 30couchdb: 100 - 1000 - 30apache-iotdb: 100 - 1 - 200ab1055.316195.0340500166.81109.439562245.2210.3113.25156.2241199743.229.625.652519200000222.215102.27817.5738.2014.8417.5494.45444.49101.87.6169.481615.781343156.5646.5017.0692131.5433.38185.45325.713904320.672.041924.359294.2637638644.3520.81134266143.8549.4363176.63042.7920161500010.28412.19511.4861510500045.8509260230000021.2056599.824.7793.33491805000000400730000132479.08119770000016.063.04633.249366000087.9985.6729.828.711.473.0572584000030.592622.922.6977296110000094.8348101.93835.99107.4529800.594940000045.824159.34476.96087.9190.5745206920000046.663629.0771219.926.2723.69339070000062.351134736.5413323000995259.68236.66647.28013233800036.74239.5593.00981013200000957.94689.8447117.006170.906104.414885.2423181083180.47699.97199.10315.708992540000151.13824.637516.17164.8556.548054.48282.532195391.4136.736610.6953918500476.8212743.8121134.816879.8687.93061090.424152.45694.8341946.713172.8149325396.9796.1843021501.49.6314.12148.1771141859.2510.015.442426350000230.54198.831518.1539.3714.4218.0491.90394.61104.3457.4367.948316.131372429.5845.5616.7290361.732.75182.0325.251920435.7770.790524.782792.7173628202.5520.481834807016.8548.7210174.11442.1919879000010.43012.36511.6462353500046.4607263645000020.9459592.524.4892.25681825450000396265000131100.25118550000015.903.01639.349841000087.1785.7179.758.771.463.0373031000030.772607.622.5752294515000094.3442102.42636.16107.9529756.794519000045.636158.71177.26337.9490.2388207665000046.505229.1761216.026.1923.62338195000062.511137612.6113291000992909.69236.11947.38563226700036.82239.0392.81611011200000956.12790.0132116.839171.134104.553985.1315181314757.42699.09199.33315.723993445000151.27324.6207516.50164.7556.848076.78282.652196242.2136.726610.8353926500476.7712742.7021133.026880.2287.9319OpenBenchmarking.org

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128ab400800120016002000SE +/- 54.55, N = 21055.31946.71. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Stress-NG

Test: Cloning

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Cloningba3K6K9K12K15KSE +/- 654.66, N = 2SE +/- 3270.70, N = 213172.8116195.031. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Pipe

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Pipeab11M22M33M44M55MSE +/- 2523742.74, N = 2SE +/- 6369572.71, N = 240500166.8149325396.971. (CXX) g++ options: -O2 -std=gnu99 -lc

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500ab20406080100109.4096.18MAX: 2142.92MAX: 1249.92

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500ab9M18M27M36M45M39562245.2243021501.40

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18ab3691215SE +/- 1.04, N = 2SE +/- 0.31, N = 210.319.63MIN: 9.03 / MAX: 33.3MIN: 9.16 / MAX: 26.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200ba4812162014.1213.25MAX: 878.17MAX: 896.77

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128ba306090120150SE +/- 2.45, N = 2148.18156.221. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200ba300K600K900K1200K1500K1141859.251199743.22

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetba3691215SE +/- 0.28, N = 2SE +/- 0.05, N = 210.019.62MIN: 9.4 / MAX: 59.43MIN: 9.35 / MAX: 10.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetab1.27132.54263.81395.08526.3565SE +/- 0.43, N = 2SE +/- 0.22, N = 25.655.44MIN: 5.03 / MAX: 6.71MIN: 5.08 / MAX: 7.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 57ba500M1000M1500M2000M2500MSE +/- 13350000.00, N = 2242635000025192000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256ab50100150200250SE +/- 3.31, N = 2222.22230.541. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256ba20406080100SE +/- 0.00, N = 298.83102.281. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50ba48121620SE +/- 0.83, N = 2SE +/- 0.36, N = 218.1517.57MIN: 16.98 / MAX: 42.81MIN: 16.92 / MAX: 18.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mba918273645SE +/- 1.10, N = 2SE +/- 0.86, N = 239.3738.20MIN: 37.07 / MAX: 103.97MIN: 36.18 / MAX: 62.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200ab4812162014.8414.42MAX: 605.55MAX: 596.84

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200ba4812162018.0417.54MAX: 597.99MAX: 680.16

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128ba20406080100SE +/- 1.46, N = 291.9094.451. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefaceba1.03732.07463.11194.14925.1865SE +/- 0.01, N = 2SE +/- 0.09, N = 24.614.49MIN: 4.49 / MAX: 5.33MIN: 4.31 / MAX: 5.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 256ab20406080100SE +/- 0.34, N = 2101.80104.351. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetab246810SE +/- 0.07, N = 2SE +/- 0.01, N = 27.617.43MIN: 7.33 / MAX: 43.29MIN: 7.16 / MAX: 15.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 128ba1530456075SE +/- 1.01, N = 267.9569.481. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdba48121620SE +/- 0.41, N = 2SE +/- 0.07, N = 216.1315.78MIN: 15.35 / MAX: 39.36MIN: 15.4 / MAX: 43.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500ab300K600K900K1200K1500K1343156.561372429.58

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerab1122334455SE +/- 2.46, N = 2SE +/- 1.19, N = 246.5045.56MIN: 42.6 / MAX: 72.28MIN: 43.24 / MAX: 70.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetab48121620SE +/- 1.05, N = 2SE +/- 0.37, N = 217.0616.72MIN: 15.5 / MAX: 66.12MIN: 15.67 / MAX: 100.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Stress-NG

Test: Pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Pthreadba20K40K60K80K100KSE +/- 894.90, N = 2SE +/- 279.15, N = 290361.7092131.541. (CXX) g++ options: -O2 -std=gnu99 -lc

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500ab81624324033.3832.75MAX: 934.86MAX: 992.49

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 128ba4080120160200SE +/- 0.86, N = 2182.03185.451. (CXX) g++ options: -O3

Z3 Theorem Prover

SMT File: 1.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 1.smt2ab612182430SE +/- 0.03, N = 2SE +/- 0.07, N = 225.7125.251. (CXX) g++ options: -lpthread -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -O3 -fPIC

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200ab200K400K600K800K1000K904320.60920435.77

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Crownba1632486480SE +/- 0.12, N = 270.7972.04MIN: 67 / MAX: 79.71MIN: 68.2 / MAX: 79.55

OSPRay

Benchmark: particle_volume/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timeab612182430SE +/- 0.03, N = 224.3624.78

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 512ba20406080100SE +/- 0.73, N = 2SE +/- 0.13, N = 292.7294.261. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200ba140K280K420K560K700K628202.55638644.35

OSPRay

Benchmark: gravity_spheres_volume/dim_512/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeba510152025SE +/- 0.05, N = 220.4820.81

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200ab7M14M21M28M35M34266143.8534807016.85

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512ba1122334455SE +/- 0.39, N = 2SE +/- 0.29, N = 248.7249.441. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 512ba4080120160200SE +/- 0.12, N = 2SE +/- 0.68, N = 2174.11176.631. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200ab102030405042.7942.19MAX: 855.16MAX: 784.56

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 512ba40M80M120M160M200MSE +/- 650000.00, N = 2SE +/- 795000.00, N = 21987900002016150001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fasterab3691215SE +/- 0.03, N = 2SE +/- 0.10, N = 210.2810.431. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Exampleba3691215SE +/- 0.10, N = 212.3712.201. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0ba3691215SE +/- 0.26, N = 2SE +/- 0.23, N = 211.6411.48MIN: 10.85 / MAX: 37.54MIN: 10.9 / MAX: 56.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 57ab130M260M390M520M650MSE +/- 3155000.00, N = 2SE +/- 11305000.00, N = 26151050006235350001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256ab1122334455SE +/- 0.55, N = 245.8546.461. (CXX) g++ options: -O3

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 160 - Buffer Length: 256 - Filter Length: 57ab600M1200M1800M2400M3000MSE +/- 17250000.00, N = 2260230000026364500001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OSPRay

Benchmark: gravity_spheres_volume/dim_512/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timeba510152025SE +/- 0.22, N = 220.9521.21

libxsmm

M N K: 256

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256ba130260390520650SE +/- 2.65, N = 2592.5599.81. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinyab612182430SE +/- 0.65, N = 2SE +/- 0.64, N = 224.7724.48MIN: 22.68 / MAX: 208.18MIN: 22.66 / MAX: 47.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 512ba20406080100SE +/- 0.24, N = 2SE +/- 0.40, N = 292.2693.331. (CXX) g++ options: -O3

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32ab400M800M1200M1600M2000MSE +/- 2750000.00, N = 2180500000018254500001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512ba90M180M270M360M450MSE +/- 2775000.00, N = 23962650004007300001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Floating Pointba30K60K90K120K150KSE +/- 235.00, N = 2SE +/- 872.22, N = 2131100.25132479.081. (CXX) g++ options: -O2 -std=gnu99 -lc

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57ba300M600M900M1200M1500MSE +/- 20600000.00, N = 2118550000011977000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetab48121620SE +/- 0.82, N = 2SE +/- 0.28, N = 216.0615.90MIN: 14.92 / MAX: 25.43MIN: 15.2 / MAX: 39.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Onlyba0.6841.3682.0522.7363.42SE +/- 0.00, N = 23.013.04

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32ab140280420560700SE +/- 2.35, N = 2633.2639.31. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 32ab110M220M330M440M550MSE +/- 1500000.00, N = 2SE +/- 830000.00, N = 24936600004984100001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Z3 Theorem Prover

SMT File: 2.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 2.smt2ab20406080100SE +/- 0.02, N = 2SE +/- 0.05, N = 288.0087.181. (CXX) g++ options: -lpthread -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -O3 -fPIC

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fastab1.28632.57263.85895.14526.4315SE +/- 0.019, N = 2SE +/- 0.063, N = 25.6725.7171. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2ab3691215SE +/- 0.03, N = 2SE +/- 0.06, N = 29.829.75MIN: 9.6 / MAX: 12.61MIN: 9.56 / MAX: 13.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3ba246810SE +/- 0.03, N = 2SE +/- 0.14, N = 28.778.71MIN: 8.59 / MAX: 32.78MIN: 8.43 / MAX: 9.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RTLightmap.hdr.4096x4096 - Device: CPU-Onlyba0.33080.66160.99241.32321.654SE +/- 0.00, N = 21.461.47

Intel Open Image Denoise

Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Onlyba0.68631.37262.05892.74523.4315SE +/- 0.01, N = 23.033.05

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512ab160M320M480M640M800MSE +/- 1250000.00, N = 27258400007303100001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Onlyba714212835SE +/- 0.02, N = 2SE +/- 0.01, N = 230.7730.59

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.30ba6001200180024003000SE +/- 1.80, N = 2SE +/- 2.00, N = 22607.62622.91. (CXX) g++ options: -O3 -march=native -fPIE -pie

OSPRay

Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeba510152025SE +/- 0.00, N = 222.5822.70

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32ba600M1200M1800M2400M3000MSE +/- 6350000.00, N = 2294515000029611000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512ba20406080100SE +/- 0.95, N = 2SE +/- 1.11, N = 294.3494.831. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 256ab20406080100SE +/- 0.72, N = 2101.94102.431. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500ba81624324036.1635.99MAX: 769.5MAX: 724.8

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 128ab20406080100SE +/- 1.54, N = 2107.45107.951. (CXX) g++ options: -O3

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Totalba2K4K6K8K10KSE +/- 33.95, N = 2SE +/- 47.35, N = 29756.79800.51. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512ba200M400M600M800M1000MSE +/- 2180000.00, N = 29451900009494000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon Nanotubeab1020304050SE +/- 0.02, N = 2SE +/- 0.03, N = 245.8245.641. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128ba4080120160200SE +/- 0.21, N = 2158.71159.341. (CXX) g++ options: -O3

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Asian Dragon Objab20406080100SE +/- 0.03, N = 2SE +/- 0.03, N = 276.9677.26MIN: 75.53 / MAX: 82.14MIN: 75.78 / MAX: 81.08

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2ba246810SE +/- 0.02, N = 2SE +/- 0.13, N = 27.947.91MIN: 7.81 / MAX: 10.99MIN: 7.68 / MAX: 9.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512ba20406080100SE +/- 1.17, N = 2SE +/- 1.08, N = 290.2490.571. (CXX) g++ options: -O3

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57ab400M800M1200M1600M2000MSE +/- 13650000.00, N = 2206920000020766500001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 256ba1122334455SE +/- 0.08, N = 246.5146.661. (CXX) g++ options: -O3

VVenC

Video Input: Bosphorus 1080p - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fasterab714212835SE +/- 0.37, N = 2SE +/- 0.23, N = 229.0829.181. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64ba30060090012001500SE +/- 1.25, N = 21216.01219.91. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16ab612182430SE +/- 0.84, N = 2SE +/- 0.34, N = 226.2726.19MIN: 24.05 / MAX: 301.35MIN: 24.19 / MAX: 341.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlyab612182430SE +/- 0.09, N = 2SE +/- 0.06, N = 223.6923.62

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 160 - Buffer Length: 256 - Filter Length: 32ba700M1400M2100M2800M3500MSE +/- 9150000.00, N = 2338195000033907000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Onlyba1428425670SE +/- 0.19, N = 2SE +/- 0.04, N = 262.5162.35

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500ab200K400K600K800K1000K1134736.541137612.61

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 512ba3M6M9M12M15MSE +/- 34000.00, N = 2SE +/- 1000.00, N = 213291000133230001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500ba200K400K600K800K1000K992909.69995259.68

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 256ba50100150200250SE +/- 2.75, N = 2236.12236.671. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 512ab1122334455SE +/- 0.14, N = 2SE +/- 0.07, N = 247.2847.391. (CXX) g++ options: -O3

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 32ba7M14M21M28M35MSE +/- 0.00, N = 2SE +/- 0.00, N = 232267000323380001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500ba81624324036.8236.74MAX: 691.5MAX: 793.88

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Onlyab50100150200250SE +/- 0.32, N = 2SE +/- 1.32, N = 2239.55239.03

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256ba20406080100SE +/- 1.52, N = 292.8293.011. (CXX) g++ options: -O3

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 160 - Buffer Length: 256 - Filter Length: 512ba200M400M600M800M1000MSE +/- 1800000.00, N = 2101120000010132000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 13.2Time To Compileab2004006008001000SE +/- 1.97, N = 2957.95956.13

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragon Objab20406080100SE +/- 0.06, N = 2SE +/- 0.05, N = 289.8490.01MIN: 87.68 / MAX: 94.71MIN: 87.6 / MAX: 94.43

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 128ba306090120150SE +/- 4.04, N = 2116.84117.011. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512ab4080120160200SE +/- 1.25, N = 2SE +/- 1.39, N = 2170.91171.131. (CXX) g++ options: -O3

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragonab20406080100SE +/- 0.32, N = 2SE +/- 0.24, N = 2104.41104.55MIN: 101.88 / MAX: 109.22MIN: 102.2 / MAX: 108.91

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Asian Dragonba20406080100SE +/- 0.14, N = 2SE +/- 0.04, N = 285.1385.24MIN: 83.65 / MAX: 90.45MIN: 83.75 / MAX: 89.99

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Fused Multiply-Addab40M80M120M160M200MSE +/- 118686.48, N = 2SE +/- 92010.25, N = 2181083180.47181314757.421. (CXX) g++ options: -O2 -std=gnu99 -lc

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Summer Nature 1080pba150300450600750SE +/- 0.50, N = 2699.09699.971. (CC) gcc options: -pthread -lm

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128ab4080120160200SE +/- 0.39, N = 2199.10199.331. (CXX) g++ options: -O3

VVenC

Video Input: Bosphorus 1080p - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fastab48121620SE +/- 0.06, N = 2SE +/- 0.04, N = 215.7115.721. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32ab200M400M600M800M1000MSE +/- 2085000.00, N = 29925400009934450001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OSPRay

Benchmark: particle_volume/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/pathtracer/real_timeab306090120150SE +/- 0.63, N = 2151.14151.27

OSPRay

Benchmark: particle_volume/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timeba612182430SE +/- 0.09, N = 224.6224.64

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Chimera 1080pab110220330440550SE +/- 0.06, N = 2516.17516.501. (CC) gcc options: -pthread -lm

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Threadba4080120160200SE +/- 0.90, N = 2SE +/- 1.70, N = 2164.7164.81. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

srsRAN Project

Test: Downlink Processor Benchmark

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: Downlink Processor Benchmarkab120240360480600SE +/- 0.70, N = 2SE +/- 1.25, N = 2556.5556.81. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

Stress-NG

Test: Vector Shuffle

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Shuffleab10K20K30K40K50KSE +/- 1.26, N = 2SE +/- 42.22, N = 248054.4848076.781. (CXX) g++ options: -O2 -std=gnu99 -lc

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Summer Nature 4Kab60120180240300SE +/- 0.08, N = 2282.53282.651. (CC) gcc options: -pthread -lm

Stress-NG

Test: Wide Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Wide Vector Mathab500K1000K1500K2000K2500KSE +/- 1200.16, N = 2SE +/- 497.69, N = 22195391.412196242.211. (CXX) g++ options: -O2 -std=gnu99 -lc

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.4WAV To Opus Encodeab816243240SE +/- 0.01, N = 236.7436.731. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

Stress-NG

Test: AVL Tree

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: AVL Treeab130260390520650SE +/- 0.08, N = 2SE +/- 0.44, N = 2610.69610.831. (CXX) g++ options: -O2 -std=gnu99 -lc

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 57ab12M24M36M48M60MSE +/- 500.00, N = 2SE +/- 1500.00, N = 253918500539265001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Chimera 1080p 10-bitba100200300400500SE +/- 0.41, N = 2476.77476.821. (CC) gcc options: -pthread -lm

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Matrix 3D Mathba3K6K9K12K15KSE +/- 5.06, N = 2SE +/- 9.80, N = 212742.7012743.811. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Floating Pointba5K10K15K20K25KSE +/- 9.14, N = 2SE +/- 6.86, N = 221133.0221134.811. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Zlib

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Zlibab15003000450060007500SE +/- 8.83, N = 2SE +/- 4.39, N = 26879.866880.221. (CXX) g++ options: -O2 -std=gnu99 -lc

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Crownab20406080100SE +/- 0.10, N = 287.9387.93MIN: 85.27 / MAX: 92.58MIN: 84.73 / MAX: 92.37

Apache CouchDB

Bulk Size: 500 - Inserts: 1000 - Rounds: 30

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 500 - Inserts: 1000 - Rounds: 30a20040060080010001090.421. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD

Apache CouchDB

Bulk Size: 300 - Inserts: 1000 - Rounds: 30

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 300 - Inserts: 1000 - Rounds: 30a306090120150152.461. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD

Apache CouchDB

Bulk Size: 100 - Inserts: 1000 - Rounds: 30

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 100 - Inserts: 1000 - Rounds: 30a2040608010094.831. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD


Phoronix Test Suite v10.8.5