xeon auggy

Tests for a future article. 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 22.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2308065-NE-XEONAUGGY78&grs&sor.

xeon auggyProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen Resolutionab2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads)Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS)Intel Ice Lake IEH512GB7682GB INTEL SSDPF2KX076TZASPEEDVE2282 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFPUbuntu 22.106.2.0-rc5-phx-dodt (x86_64)GNOME Shell 43.0X Server 1.21.1.31.3.224GCC 12.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0xd000389 Java Details- OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu122.10.1)Python Details- Python 3.10.7Security Details- dodt: Mitigation of DOITM + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

xeon auggylibxsmm: 128stress-ng: Cloningstress-ng: Pipeapache-iotdb: 100 - 100 - 500apache-iotdb: 100 - 100 - 500ncnn: CPU - resnet18apache-iotdb: 500 - 1 - 200heffte: r2c - FFTW - double - 128apache-iotdb: 500 - 1 - 200ncnn: CPU - FastestDetncnn: CPU - alexnetliquid-dsp: 128 - 256 - 57heffte: r2c - FFTW - float - 256heffte: c2c - FFTW - float - 256ncnn: CPU - resnet50ncnn: CPU - regnety_400mapache-iotdb: 200 - 1 - 200apache-iotdb: 100 - 1 - 200heffte: c2c - FFTW - double - 128ncnn: CPU - blazefaceheffte: c2c - Stock - float - 256ncnn: CPU - mnasnetheffte: c2c - Stock - double - 128ncnn: CPU - squeezenet_ssdapache-iotdb: 500 - 1 - 500ncnn: CPU - vision_transformerncnn: CPU - googlenetstress-ng: Pthreadapache-iotdb: 500 - 1 - 500heffte: r2c - Stock - float - 128z3: 1.smt2apache-iotdb: 200 - 1 - 200embree: Pathtracer - Crownospray: particle_volume/scivis/real_timeheffte: r2c - Stock - double - 512apache-iotdb: 100 - 1 - 200ospray: gravity_spheres_volume/dim_512/scivis/real_timeapache-iotdb: 100 - 100 - 200heffte: c2c - FFTW - double - 512heffte: r2c - Stock - float - 512apache-iotdb: 100 - 100 - 200liquid-dsp: 16 - 256 - 512vvenc: Bosphorus 4K - Fasterremhos: Sample Remap Examplencnn: CPU - efficientnet-b0liquid-dsp: 16 - 256 - 57heffte: c2c - FFTW - double - 256liquid-dsp: 160 - 256 - 57ospray: gravity_spheres_volume/dim_512/ao/real_timelibxsmm: 256ncnn: CPU - yolov4-tinyheffte: c2c - Stock - float - 512liquid-dsp: 64 - 256 - 32liquid-dsp: 32 - 256 - 512stress-ng: Vector Floating Pointliquid-dsp: 32 - 256 - 57ncnn: CPU - mobilenetoidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlylibxsmm: 32liquid-dsp: 16 - 256 - 32z3: 2.smt2vvenc: Bosphorus 4K - Fastncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3oidn: RTLightmap.hdr.4096x4096 - CPU-Onlyoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyliquid-dsp: 64 - 256 - 512blender: Fishy Cat - CPU-Onlyquantlib: ospray: gravity_spheres_volume/dim_512/pathtracer/real_timeliquid-dsp: 128 - 256 - 32heffte: c2c - FFTW - float - 512heffte: r2c - Stock - double - 256apache-iotdb: 100 - 1 - 500heffte: c2c - Stock - float - 128srsran: PUSCH Processor Benchmark, Throughput Totalliquid-dsp: 128 - 256 - 512gpaw: Carbon Nanotubeheffte: c2c - FFTW - float - 128embree: Pathtracer - Asian Dragon Objncnn: CPU-v2-v2 - mobilenet-v2heffte: r2c - FFTW - double - 512liquid-dsp: 64 - 256 - 57heffte: c2c - Stock - double - 256vvenc: Bosphorus 1080p - Fasterlibxsmm: 64ncnn: CPU - vgg16blender: BMW27 - CPU-Onlyliquid-dsp: 160 - 256 - 32blender: Classroom - CPU-Onlyapache-iotdb: 200 - 1 - 500liquid-dsp: 1 - 256 - 512apache-iotdb: 100 - 1 - 500heffte: r2c - Stock - float - 256heffte: c2c - Stock - double - 512liquid-dsp: 1 - 256 - 32apache-iotdb: 200 - 1 - 500blender: Barbershop - CPU-Onlyheffte: r2c - FFTW - double - 256liquid-dsp: 160 - 256 - 512build-gcc: Time To Compileembree: Pathtracer ISPC - Asian Dragon Objheffte: r2c - Stock - double - 128heffte: r2c - FFTW - float - 512embree: Pathtracer ISPC - Asian Dragonembree: Pathtracer - Asian Dragonstress-ng: Fused Multiply-Adddav1d: Summer Nature 1080pheffte: r2c - FFTW - float - 128vvenc: Bosphorus 1080p - Fastliquid-dsp: 32 - 256 - 32ospray: particle_volume/pathtracer/real_timeospray: particle_volume/ao/real_timedav1d: Chimera 1080psrsran: PUSCH Processor Benchmark, Throughput Threadsrsran: Downlink Processor Benchmarkstress-ng: Vector Shuffledav1d: Summer Nature 4Kstress-ng: Wide Vector Mathencode-opus: WAV To Opus Encodestress-ng: AVL Treeliquid-dsp: 1 - 256 - 57dav1d: Chimera 1080p 10-bitstress-ng: Matrix 3D Mathstress-ng: Floating Pointstress-ng: Zlibembree: Pathtracer ISPC - Crowncouchdb: 500 - 1000 - 30couchdb: 300 - 1000 - 30couchdb: 100 - 1000 - 30apache-iotdb: 100 - 1 - 200ab1055.316195.0340500166.81109.439562245.2210.3113.25156.2241199743.229.625.652519200000222.215102.27817.5738.2014.8417.5494.45444.49101.87.6169.481615.781343156.5646.5017.0692131.5433.38185.45325.713904320.672.041924.359294.2637638644.3520.81134266143.8549.4363176.63042.7920161500010.28412.19511.4861510500045.8509260230000021.2056599.824.7793.33491805000000400730000132479.08119770000016.063.04633.249366000087.9985.6729.828.711.473.0572584000030.592622.922.6977296110000094.8348101.93835.99107.4529800.594940000045.824159.34476.96087.9190.5745206920000046.663629.0771219.926.2723.69339070000062.351134736.5413323000995259.68236.66647.28013233800036.74239.5593.00981013200000957.94689.8447117.006170.906104.414885.2423181083180.47699.97199.10315.708992540000151.13824.637516.17164.8556.548054.48282.532195391.4136.736610.6953918500476.8212743.8121134.816879.8687.93061090.424152.45694.8341946.713172.8149325396.9796.1843021501.49.6314.12148.1771141859.2510.015.442426350000230.54198.831518.1539.3714.4218.0491.90394.61104.3457.4367.948316.131372429.5845.5616.7290361.732.75182.0325.251920435.7770.790524.782792.7173628202.5520.481834807016.8548.7210174.11442.1919879000010.43012.36511.6462353500046.4607263645000020.9459592.524.4892.25681825450000396265000131100.25118550000015.903.01639.349841000087.1785.7179.758.771.463.0373031000030.772607.622.5752294515000094.3442102.42636.16107.9529756.794519000045.636158.71177.26337.9490.2388207665000046.505229.1761216.026.1923.62338195000062.511137612.6113291000992909.69236.11947.38563226700036.82239.0392.81611011200000956.12790.0132116.839171.134104.553985.1315181314757.42699.09199.33315.723993445000151.27324.6207516.50164.7556.848076.78282.652196242.2136.726610.8353926500476.7712742.7021133.026880.2287.9319OpenBenchmarking.org

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128ba400800120016002000SE +/- 54.55, N = 21946.71055.31. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Stress-NG

Test: Cloning

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Cloningab3K6K9K12K15KSE +/- 3270.70, N = 2SE +/- 654.66, N = 216195.0313172.811. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Pipe

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Pipeba11M22M33M44M55MSE +/- 6369572.71, N = 2SE +/- 2523742.74, N = 249325396.9740500166.811. (CXX) g++ options: -O2 -std=gnu99 -lc

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500ba2040608010096.18109.40MAX: 1249.92MAX: 2142.92

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500ba9M18M27M36M45M43021501.4039562245.22

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18ba3691215SE +/- 0.31, N = 2SE +/- 1.04, N = 29.6310.31MIN: 9.16 / MAX: 26.03MIN: 9.03 / MAX: 33.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200ab4812162013.2514.12MAX: 896.77MAX: 878.17

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128ab306090120150SE +/- 2.45, N = 2156.22148.181. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200ab300K600K900K1200K1500K1199743.221141859.25

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetab3691215SE +/- 0.05, N = 2SE +/- 0.28, N = 29.6210.01MIN: 9.35 / MAX: 10.52MIN: 9.4 / MAX: 59.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetba1.27132.54263.81395.08526.3565SE +/- 0.22, N = 2SE +/- 0.43, N = 25.445.65MIN: 5.08 / MAX: 7.52MIN: 5.03 / MAX: 6.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 57ab500M1000M1500M2000M2500MSE +/- 13350000.00, N = 2251920000024263500001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256ba50100150200250SE +/- 3.31, N = 2230.54222.221. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256ab20406080100SE +/- 0.00, N = 2102.2898.831. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50ab48121620SE +/- 0.36, N = 2SE +/- 0.83, N = 217.5718.15MIN: 16.92 / MAX: 18.88MIN: 16.98 / MAX: 42.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mab918273645SE +/- 0.86, N = 2SE +/- 1.10, N = 238.2039.37MIN: 36.18 / MAX: 62.76MIN: 37.07 / MAX: 103.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200ba4812162014.4214.84MAX: 596.84MAX: 605.55

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200ab4812162017.5418.04MAX: 680.16MAX: 597.99

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128ab20406080100SE +/- 1.46, N = 294.4591.901. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefaceab1.03732.07463.11194.14925.1865SE +/- 0.09, N = 2SE +/- 0.01, N = 24.494.61MIN: 4.31 / MAX: 5.13MIN: 4.49 / MAX: 5.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 256ba20406080100SE +/- 0.34, N = 2104.35101.801. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetba246810SE +/- 0.01, N = 2SE +/- 0.07, N = 27.437.61MIN: 7.16 / MAX: 15.37MIN: 7.33 / MAX: 43.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 128ab1530456075SE +/- 1.01, N = 269.4867.951. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdab48121620SE +/- 0.07, N = 2SE +/- 0.41, N = 215.7816.13MIN: 15.4 / MAX: 43.08MIN: 15.35 / MAX: 39.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500ba300K600K900K1200K1500K1372429.581343156.56

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerba1122334455SE +/- 1.19, N = 2SE +/- 2.46, N = 245.5646.50MIN: 43.24 / MAX: 70.59MIN: 42.6 / MAX: 72.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetba48121620SE +/- 0.37, N = 2SE +/- 1.05, N = 216.7217.06MIN: 15.67 / MAX: 100.92MIN: 15.5 / MAX: 66.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Stress-NG

Test: Pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Pthreadab20K40K60K80K100KSE +/- 279.15, N = 2SE +/- 894.90, N = 292131.5490361.701. (CXX) g++ options: -O2 -std=gnu99 -lc

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500ba81624324032.7533.38MAX: 992.49MAX: 934.86

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 128ab4080120160200SE +/- 0.86, N = 2185.45182.031. (CXX) g++ options: -O3

Z3 Theorem Prover

SMT File: 1.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 1.smt2ba612182430SE +/- 0.07, N = 2SE +/- 0.03, N = 225.2525.711. (CXX) g++ options: -lpthread -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -O3 -fPIC

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200ba200K400K600K800K1000K920435.77904320.60

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Crownab1632486480SE +/- 0.12, N = 272.0470.79MIN: 68.2 / MAX: 79.55MIN: 67 / MAX: 79.71

OSPRay

Benchmark: particle_volume/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timeba612182430SE +/- 0.03, N = 224.7824.36

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 512ab20406080100SE +/- 0.13, N = 2SE +/- 0.73, N = 294.2692.721. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200ab140K280K420K560K700K638644.35628202.55

OSPRay

Benchmark: gravity_spheres_volume/dim_512/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeab510152025SE +/- 0.05, N = 220.8120.48

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200ba7M14M21M28M35M34807016.8534266143.85

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512ab1122334455SE +/- 0.29, N = 2SE +/- 0.39, N = 249.4448.721. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 512ab4080120160200SE +/- 0.68, N = 2SE +/- 0.12, N = 2176.63174.111. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200ba102030405042.1942.79MAX: 784.56MAX: 855.16

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 512ab40M80M120M160M200MSE +/- 795000.00, N = 2SE +/- 650000.00, N = 22016150001987900001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fasterba3691215SE +/- 0.10, N = 2SE +/- 0.03, N = 210.4310.281. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Exampleab3691215SE +/- 0.10, N = 212.2012.371. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0ab3691215SE +/- 0.23, N = 2SE +/- 0.26, N = 211.4811.64MIN: 10.9 / MAX: 56.34MIN: 10.85 / MAX: 37.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 57ba130M260M390M520M650MSE +/- 11305000.00, N = 2SE +/- 3155000.00, N = 26235350006151050001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256ba1122334455SE +/- 0.55, N = 246.4645.851. (CXX) g++ options: -O3

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 160 - Buffer Length: 256 - Filter Length: 57ba600M1200M1800M2400M3000MSE +/- 17250000.00, N = 2263645000026023000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OSPRay

Benchmark: gravity_spheres_volume/dim_512/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timeab510152025SE +/- 0.22, N = 221.2120.95

libxsmm

M N K: 256

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256ab130260390520650SE +/- 2.65, N = 2599.8592.51. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinyba612182430SE +/- 0.64, N = 2SE +/- 0.65, N = 224.4824.77MIN: 22.66 / MAX: 47.54MIN: 22.68 / MAX: 208.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 512ab20406080100SE +/- 0.40, N = 2SE +/- 0.24, N = 293.3392.261. (CXX) g++ options: -O3

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32ba400M800M1200M1600M2000MSE +/- 2750000.00, N = 2182545000018050000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512ab90M180M270M360M450MSE +/- 2775000.00, N = 24007300003962650001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Floating Pointab30K60K90K120K150KSE +/- 872.22, N = 2SE +/- 235.00, N = 2132479.08131100.251. (CXX) g++ options: -O2 -std=gnu99 -lc

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57ab300M600M900M1200M1500MSE +/- 20600000.00, N = 2119770000011855000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetba48121620SE +/- 0.28, N = 2SE +/- 0.82, N = 215.9016.06MIN: 15.2 / MAX: 39.56MIN: 14.92 / MAX: 25.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Onlyab0.6841.3682.0522.7363.42SE +/- 0.00, N = 23.043.01

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32ba140280420560700SE +/- 2.35, N = 2639.3633.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 32ba110M220M330M440M550MSE +/- 830000.00, N = 2SE +/- 1500000.00, N = 24984100004936600001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Z3 Theorem Prover

SMT File: 2.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 2.smt2ba20406080100SE +/- 0.05, N = 2SE +/- 0.02, N = 287.1888.001. (CXX) g++ options: -lpthread -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -O3 -fPIC

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fastba1.28632.57263.85895.14526.4315SE +/- 0.063, N = 2SE +/- 0.019, N = 25.7175.6721. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2ba3691215SE +/- 0.06, N = 2SE +/- 0.03, N = 29.759.82MIN: 9.56 / MAX: 13.68MIN: 9.6 / MAX: 12.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3ab246810SE +/- 0.14, N = 2SE +/- 0.03, N = 28.718.77MIN: 8.43 / MAX: 9.8MIN: 8.59 / MAX: 32.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RTLightmap.hdr.4096x4096 - Device: CPU-Onlyab0.33080.66160.99241.32321.654SE +/- 0.00, N = 21.471.46

Intel Open Image Denoise

Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Onlyab0.68631.37262.05892.74523.4315SE +/- 0.01, N = 23.053.03

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512ba160M320M480M640M800MSE +/- 1250000.00, N = 27303100007258400001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Onlyab714212835SE +/- 0.01, N = 2SE +/- 0.02, N = 230.5930.77

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.30ab6001200180024003000SE +/- 2.00, N = 2SE +/- 1.80, N = 22622.92607.61. (CXX) g++ options: -O3 -march=native -fPIE -pie

OSPRay

Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeab510152025SE +/- 0.00, N = 222.7022.58

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32ab600M1200M1800M2400M3000MSE +/- 6350000.00, N = 2296110000029451500001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512ab20406080100SE +/- 1.11, N = 2SE +/- 0.95, N = 294.8394.341. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 256ba20406080100SE +/- 0.72, N = 2102.43101.941. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500ab81624324035.9936.16MAX: 724.8MAX: 769.5

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 128ba20406080100SE +/- 1.54, N = 2107.95107.451. (CXX) g++ options: -O3

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Totalab2K4K6K8K10KSE +/- 47.35, N = 2SE +/- 33.95, N = 29800.59756.71. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512ab200M400M600M800M1000MSE +/- 2180000.00, N = 29494000009451900001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon Nanotubeba1020304050SE +/- 0.03, N = 2SE +/- 0.02, N = 245.6445.821. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128ab4080120160200SE +/- 0.21, N = 2159.34158.711. (CXX) g++ options: -O3

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Asian Dragon Objba20406080100SE +/- 0.03, N = 2SE +/- 0.03, N = 277.2676.96MIN: 75.78 / MAX: 81.08MIN: 75.53 / MAX: 82.14

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2ab246810SE +/- 0.13, N = 2SE +/- 0.02, N = 27.917.94MIN: 7.68 / MAX: 9.6MIN: 7.81 / MAX: 10.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512ab20406080100SE +/- 1.08, N = 2SE +/- 1.17, N = 290.5790.241. (CXX) g++ options: -O3

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57ba400M800M1200M1600M2000MSE +/- 13650000.00, N = 2207665000020692000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 256ab1122334455SE +/- 0.08, N = 246.6646.511. (CXX) g++ options: -O3

VVenC

Video Input: Bosphorus 1080p - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fasterba714212835SE +/- 0.23, N = 2SE +/- 0.37, N = 229.1829.081. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64ab30060090012001500SE +/- 1.25, N = 21219.91216.01. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16ba612182430SE +/- 0.34, N = 2SE +/- 0.84, N = 226.1926.27MIN: 24.19 / MAX: 341.6MIN: 24.05 / MAX: 301.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlyba612182430SE +/- 0.06, N = 2SE +/- 0.09, N = 223.6223.69

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 160 - Buffer Length: 256 - Filter Length: 32ab700M1400M2100M2800M3500MSE +/- 9150000.00, N = 2339070000033819500001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Onlyab1428425670SE +/- 0.04, N = 2SE +/- 0.19, N = 262.3562.51

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500ba200K400K600K800K1000K1137612.611134736.54

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 512ab3M6M9M12M15MSE +/- 1000.00, N = 2SE +/- 34000.00, N = 213323000132910001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500ab200K400K600K800K1000K995259.68992909.69

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 256ab50100150200250SE +/- 2.75, N = 2236.67236.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 512ba1122334455SE +/- 0.07, N = 2SE +/- 0.14, N = 247.3947.281. (CXX) g++ options: -O3

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 32ab7M14M21M28M35MSE +/- 0.00, N = 2SE +/- 0.00, N = 232338000322670001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500ab81624324036.7436.82MAX: 793.88MAX: 691.5

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Onlyba50100150200250SE +/- 1.32, N = 2SE +/- 0.32, N = 2239.03239.55

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256ab20406080100SE +/- 1.52, N = 293.0192.821. (CXX) g++ options: -O3

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 160 - Buffer Length: 256 - Filter Length: 512ab200M400M600M800M1000MSE +/- 1800000.00, N = 2101320000010112000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 13.2Time To Compileba2004006008001000SE +/- 1.97, N = 2956.13957.95

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragon Objba20406080100SE +/- 0.05, N = 2SE +/- 0.06, N = 290.0189.84MIN: 87.6 / MAX: 94.43MIN: 87.68 / MAX: 94.71

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 128ab306090120150SE +/- 4.04, N = 2117.01116.841. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512ba4080120160200SE +/- 1.39, N = 2SE +/- 1.25, N = 2171.13170.911. (CXX) g++ options: -O3

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragonba20406080100SE +/- 0.24, N = 2SE +/- 0.32, N = 2104.55104.41MIN: 102.2 / MAX: 108.91MIN: 101.88 / MAX: 109.22

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Asian Dragonab20406080100SE +/- 0.04, N = 2SE +/- 0.14, N = 285.2485.13MIN: 83.75 / MAX: 89.99MIN: 83.65 / MAX: 90.45

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Fused Multiply-Addba40M80M120M160M200MSE +/- 92010.25, N = 2SE +/- 118686.48, N = 2181314757.42181083180.471. (CXX) g++ options: -O2 -std=gnu99 -lc

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Summer Nature 1080pab150300450600750SE +/- 0.50, N = 2699.97699.091. (CC) gcc options: -pthread -lm

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128ba4080120160200SE +/- 0.39, N = 2199.33199.101. (CXX) g++ options: -O3

VVenC

Video Input: Bosphorus 1080p - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fastba48121620SE +/- 0.04, N = 2SE +/- 0.06, N = 215.7215.711. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32ba200M400M600M800M1000MSE +/- 2085000.00, N = 29934450009925400001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OSPRay

Benchmark: particle_volume/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/pathtracer/real_timeba306090120150SE +/- 0.63, N = 2151.27151.14

OSPRay

Benchmark: particle_volume/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timeab612182430SE +/- 0.09, N = 224.6424.62

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Chimera 1080pba110220330440550SE +/- 0.06, N = 2516.50516.171. (CC) gcc options: -pthread -lm

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Threadab4080120160200SE +/- 1.70, N = 2SE +/- 0.90, N = 2164.8164.71. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

srsRAN Project

Test: Downlink Processor Benchmark

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: Downlink Processor Benchmarkba120240360480600SE +/- 1.25, N = 2SE +/- 0.70, N = 2556.8556.51. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

Stress-NG

Test: Vector Shuffle

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Shuffleba10K20K30K40K50KSE +/- 42.22, N = 2SE +/- 1.26, N = 248076.7848054.481. (CXX) g++ options: -O2 -std=gnu99 -lc

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Summer Nature 4Kba60120180240300SE +/- 0.08, N = 2282.65282.531. (CC) gcc options: -pthread -lm

Stress-NG

Test: Wide Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Wide Vector Mathba500K1000K1500K2000K2500KSE +/- 497.69, N = 2SE +/- 1200.16, N = 22196242.212195391.411. (CXX) g++ options: -O2 -std=gnu99 -lc

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.4WAV To Opus Encodeba816243240SE +/- 0.01, N = 236.7336.741. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

Stress-NG

Test: AVL Tree

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: AVL Treeba130260390520650SE +/- 0.44, N = 2SE +/- 0.08, N = 2610.83610.691. (CXX) g++ options: -O2 -std=gnu99 -lc

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 57ba12M24M36M48M60MSE +/- 1500.00, N = 2SE +/- 500.00, N = 253926500539185001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Chimera 1080p 10-bitab100200300400500SE +/- 0.41, N = 2476.82476.771. (CC) gcc options: -pthread -lm

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Matrix 3D Mathab3K6K9K12K15KSE +/- 9.80, N = 2SE +/- 5.06, N = 212743.8112742.701. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Floating Pointab5K10K15K20K25KSE +/- 6.86, N = 2SE +/- 9.14, N = 221134.8121133.021. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Zlib

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Zlibba15003000450060007500SE +/- 4.39, N = 2SE +/- 8.83, N = 26880.226879.861. (CXX) g++ options: -O2 -std=gnu99 -lc

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Crownba20406080100SE +/- 0.10, N = 287.9387.93MIN: 84.73 / MAX: 92.37MIN: 85.27 / MAX: 92.58

Apache CouchDB

Bulk Size: 500 - Inserts: 1000 - Rounds: 30

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 500 - Inserts: 1000 - Rounds: 30a20040060080010001090.421. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD

Apache CouchDB

Bulk Size: 300 - Inserts: 1000 - Rounds: 30

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 300 - Inserts: 1000 - Rounds: 30a306090120150152.461. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD

Apache CouchDB

Bulk Size: 100 - Inserts: 1000 - Rounds: 30

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 100 - Inserts: 1000 - Rounds: 30a2040608010094.831. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD


Phoronix Test Suite v10.8.5