xeon auggy

Tests for a future article. 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 22.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2308065-NE-XEONAUGGY78
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C++ Boost Tests 2 Tests
CPU Massive 4 Tests
Creator Workloads 7 Tests
Database Test Suite 3 Tests
Encoding 3 Tests
Game Development 2 Tests
HPC - High Performance Computing 2 Tests
Multi-Core 7 Tests
NVIDIA GPU Compute 2 Tests
Intel oneAPI 3 Tests
OpenMPI Tests 3 Tests
Python Tests 2 Tests
Renderers 2 Tests
Software Defined Radio 2 Tests
Server 3 Tests
Server CPU Tests 4 Tests
Video Encoding 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
a
August 06 2023
  3 Hours, 24 Minutes
b
August 06 2023
  3 Hours, 43 Minutes
Invert Hiding All Results Option
  3 Hours, 33 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


xeon auggy - Phoronix Test Suite

xeon auggy

Tests for a future article. 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 22.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2308065-NE-XEONAUGGY78&grs&export=pdf&rdt&rro.

xeon auggyProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen Resolutionab2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads)Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS)Intel Ice Lake IEH512GB7682GB INTEL SSDPF2KX076TZASPEEDVE2282 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFPUbuntu 22.106.2.0-rc5-phx-dodt (x86_64)GNOME Shell 43.0X Server 1.21.1.31.3.224GCC 12.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0xd000389 Java Details- OpenJDK Runtime Environment (build 11.0.19+7-post-Ubuntu-0ubuntu122.10.1)Python Details- Python 3.10.7Security Details- dodt: Mitigation of DOITM + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

xeon auggylibxsmm: 128stress-ng: Cloningstress-ng: Pipeapache-iotdb: 100 - 100 - 500apache-iotdb: 100 - 100 - 500ncnn: CPU - resnet18apache-iotdb: 500 - 1 - 200heffte: r2c - FFTW - double - 128apache-iotdb: 500 - 1 - 200ncnn: CPU - FastestDetncnn: CPU - alexnetliquid-dsp: 128 - 256 - 57heffte: r2c - FFTW - float - 256heffte: c2c - FFTW - float - 256ncnn: CPU - resnet50ncnn: CPU - regnety_400mapache-iotdb: 200 - 1 - 200apache-iotdb: 100 - 1 - 200heffte: c2c - FFTW - double - 128ncnn: CPU - blazefaceheffte: c2c - Stock - float - 256ncnn: CPU - mnasnetheffte: c2c - Stock - double - 128ncnn: CPU - squeezenet_ssdapache-iotdb: 500 - 1 - 500ncnn: CPU - vision_transformerncnn: CPU - googlenetstress-ng: Pthreadapache-iotdb: 500 - 1 - 500heffte: r2c - Stock - float - 128z3: 1.smt2apache-iotdb: 200 - 1 - 200embree: Pathtracer - Crownospray: particle_volume/scivis/real_timeheffte: r2c - Stock - double - 512apache-iotdb: 100 - 1 - 200ospray: gravity_spheres_volume/dim_512/scivis/real_timeapache-iotdb: 100 - 100 - 200heffte: c2c - FFTW - double - 512heffte: r2c - Stock - float - 512apache-iotdb: 100 - 100 - 200liquid-dsp: 16 - 256 - 512vvenc: Bosphorus 4K - Fasterremhos: Sample Remap Examplencnn: CPU - efficientnet-b0liquid-dsp: 16 - 256 - 57heffte: c2c - FFTW - double - 256liquid-dsp: 160 - 256 - 57ospray: gravity_spheres_volume/dim_512/ao/real_timelibxsmm: 256ncnn: CPU - yolov4-tinyheffte: c2c - Stock - float - 512liquid-dsp: 64 - 256 - 32liquid-dsp: 32 - 256 - 512stress-ng: Vector Floating Pointliquid-dsp: 32 - 256 - 57ncnn: CPU - mobilenetoidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlylibxsmm: 32liquid-dsp: 16 - 256 - 32z3: 2.smt2vvenc: Bosphorus 4K - Fastncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3oidn: RTLightmap.hdr.4096x4096 - CPU-Onlyoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyliquid-dsp: 64 - 256 - 512blender: Fishy Cat - CPU-Onlyquantlib: ospray: gravity_spheres_volume/dim_512/pathtracer/real_timeliquid-dsp: 128 - 256 - 32heffte: c2c - FFTW - float - 512heffte: r2c - Stock - double - 256apache-iotdb: 100 - 1 - 500heffte: c2c - Stock - float - 128srsran: PUSCH Processor Benchmark, Throughput Totalliquid-dsp: 128 - 256 - 512gpaw: Carbon Nanotubeheffte: c2c - FFTW - float - 128embree: Pathtracer - Asian Dragon Objncnn: CPU-v2-v2 - mobilenet-v2heffte: r2c - FFTW - double - 512liquid-dsp: 64 - 256 - 57heffte: c2c - Stock - double - 256vvenc: Bosphorus 1080p - Fasterlibxsmm: 64ncnn: CPU - vgg16blender: BMW27 - CPU-Onlyliquid-dsp: 160 - 256 - 32blender: Classroom - CPU-Onlyapache-iotdb: 200 - 1 - 500liquid-dsp: 1 - 256 - 512apache-iotdb: 100 - 1 - 500heffte: r2c - Stock - float - 256heffte: c2c - Stock - double - 512liquid-dsp: 1 - 256 - 32apache-iotdb: 200 - 1 - 500blender: Barbershop - CPU-Onlyheffte: r2c - FFTW - double - 256liquid-dsp: 160 - 256 - 512build-gcc: Time To Compileembree: Pathtracer ISPC - Asian Dragon Objheffte: r2c - Stock - double - 128heffte: r2c - FFTW - float - 512embree: Pathtracer ISPC - Asian Dragonembree: Pathtracer - Asian Dragonstress-ng: Fused Multiply-Adddav1d: Summer Nature 1080pheffte: r2c - FFTW - float - 128vvenc: Bosphorus 1080p - Fastliquid-dsp: 32 - 256 - 32ospray: particle_volume/pathtracer/real_timeospray: particle_volume/ao/real_timedav1d: Chimera 1080psrsran: PUSCH Processor Benchmark, Throughput Threadsrsran: Downlink Processor Benchmarkstress-ng: Vector Shuffledav1d: Summer Nature 4Kstress-ng: Wide Vector Mathencode-opus: WAV To Opus Encodestress-ng: AVL Treeliquid-dsp: 1 - 256 - 57dav1d: Chimera 1080p 10-bitstress-ng: Matrix 3D Mathstress-ng: Floating Pointstress-ng: Zlibembree: Pathtracer ISPC - Crowncouchdb: 500 - 1000 - 30couchdb: 300 - 1000 - 30couchdb: 100 - 1000 - 30apache-iotdb: 100 - 1 - 200ab1055.316195.0340500166.81109.439562245.2210.3113.25156.2241199743.229.625.652519200000222.215102.27817.5738.2014.8417.5494.45444.49101.87.6169.481615.781343156.5646.5017.0692131.5433.38185.45325.713904320.672.041924.359294.2637638644.3520.81134266143.8549.4363176.63042.7920161500010.28412.19511.4861510500045.8509260230000021.2056599.824.7793.33491805000000400730000132479.08119770000016.063.04633.249366000087.9985.6729.828.711.473.0572584000030.592622.922.6977296110000094.8348101.93835.99107.4529800.594940000045.824159.34476.96087.9190.5745206920000046.663629.0771219.926.2723.69339070000062.351134736.5413323000995259.68236.66647.28013233800036.74239.5593.00981013200000957.94689.8447117.006170.906104.414885.2423181083180.47699.97199.10315.708992540000151.13824.637516.17164.8556.548054.48282.532195391.4136.736610.6953918500476.8212743.8121134.816879.8687.93061090.424152.45694.8341946.713172.8149325396.9796.1843021501.49.6314.12148.1771141859.2510.015.442426350000230.54198.831518.1539.3714.4218.0491.90394.61104.3457.4367.948316.131372429.5845.5616.7290361.732.75182.0325.251920435.7770.790524.782792.7173628202.5520.481834807016.8548.7210174.11442.1919879000010.43012.36511.6462353500046.4607263645000020.9459592.524.4892.25681825450000396265000131100.25118550000015.903.01639.349841000087.1785.7179.758.771.463.0373031000030.772607.622.5752294515000094.3442102.42636.16107.9529756.794519000045.636158.71177.26337.9490.2388207665000046.505229.1761216.026.1923.62338195000062.511137612.6113291000992909.69236.11947.38563226700036.82239.0392.81611011200000956.12790.0132116.839171.134104.553985.1315181314757.42699.09199.33315.723993445000151.27324.6207516.50164.7556.848076.78282.652196242.2136.726610.8353926500476.7712742.7021133.026880.2287.9319OpenBenchmarking.org

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128ba400800120016002000SE +/- 54.55, N = 21946.71055.31. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Stress-NG

Test: Cloning

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Cloningba3K6K9K12K15KSE +/- 654.66, N = 2SE +/- 3270.70, N = 213172.8116195.031. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Pipe

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Pipeba11M22M33M44M55MSE +/- 6369572.71, N = 2SE +/- 2523742.74, N = 249325396.9740500166.811. (CXX) g++ options: -O2 -std=gnu99 -lc

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500ba2040608010096.18109.40MAX: 1249.92MAX: 2142.92

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 500ba9M18M27M36M45M43021501.4039562245.22

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18ba3691215SE +/- 0.31, N = 2SE +/- 1.04, N = 29.6310.31MIN: 9.16 / MAX: 26.03MIN: 9.03 / MAX: 33.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200ba4812162014.1213.25MAX: 878.17MAX: 896.77

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128ba306090120150SE +/- 2.45, N = 2148.18156.221. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 200ba300K600K900K1200K1500K1141859.251199743.22

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetba3691215SE +/- 0.28, N = 2SE +/- 0.05, N = 210.019.62MIN: 9.4 / MAX: 59.43MIN: 9.35 / MAX: 10.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetba1.27132.54263.81395.08526.3565SE +/- 0.22, N = 2SE +/- 0.43, N = 25.445.65MIN: 5.08 / MAX: 7.52MIN: 5.03 / MAX: 6.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 57ba500M1000M1500M2000M2500MSE +/- 13350000.00, N = 2242635000025192000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256ba50100150200250SE +/- 3.31, N = 2230.54222.221. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256ba20406080100SE +/- 0.00, N = 298.83102.281. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50ba48121620SE +/- 0.83, N = 2SE +/- 0.36, N = 218.1517.57MIN: 16.98 / MAX: 42.81MIN: 16.92 / MAX: 18.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mba918273645SE +/- 1.10, N = 2SE +/- 0.86, N = 239.3738.20MIN: 37.07 / MAX: 103.97MIN: 36.18 / MAX: 62.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200ba4812162014.4214.84MAX: 596.84MAX: 605.55

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200ba4812162018.0417.54MAX: 597.99MAX: 680.16

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128ba20406080100SE +/- 1.46, N = 291.9094.451. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefaceba1.03732.07463.11194.14925.1865SE +/- 0.01, N = 2SE +/- 0.09, N = 24.614.49MIN: 4.49 / MAX: 5.33MIN: 4.31 / MAX: 5.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 256ba20406080100SE +/- 0.34, N = 2104.35101.801. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetba246810SE +/- 0.01, N = 2SE +/- 0.07, N = 27.437.61MIN: 7.16 / MAX: 15.37MIN: 7.33 / MAX: 43.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 128ba1530456075SE +/- 1.01, N = 267.9569.481. (CXX) g++ options: -O3

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdba48121620SE +/- 0.41, N = 2SE +/- 0.07, N = 216.1315.78MIN: 15.35 / MAX: 39.36MIN: 15.4 / MAX: 43.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500ba300K600K900K1200K1500K1372429.581343156.56

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerba1122334455SE +/- 1.19, N = 2SE +/- 2.46, N = 245.5646.50MIN: 43.24 / MAX: 70.59MIN: 42.6 / MAX: 72.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetba48121620SE +/- 0.37, N = 2SE +/- 1.05, N = 216.7217.06MIN: 15.67 / MAX: 100.92MIN: 15.5 / MAX: 66.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Stress-NG

Test: Pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Pthreadba20K40K60K80K100KSE +/- 894.90, N = 2SE +/- 279.15, N = 290361.7092131.541. (CXX) g++ options: -O2 -std=gnu99 -lc

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 500 - Batch Size Per Write: 1 - Sensor Count: 500ba81624324032.7533.38MAX: 992.49MAX: 934.86

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 128ba4080120160200SE +/- 0.86, N = 2182.03185.451. (CXX) g++ options: -O3

Z3 Theorem Prover

SMT File: 1.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 1.smt2ba612182430SE +/- 0.07, N = 2SE +/- 0.03, N = 225.2525.711. (CXX) g++ options: -lpthread -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -O3 -fPIC

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 200ba200K400K600K800K1000K920435.77904320.60

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Crownba1632486480SE +/- 0.12, N = 270.7972.04MIN: 67 / MAX: 79.71MIN: 68.2 / MAX: 79.55

OSPRay

Benchmark: particle_volume/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timeba612182430SE +/- 0.03, N = 224.7824.36

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 512ba20406080100SE +/- 0.73, N = 2SE +/- 0.13, N = 292.7294.261. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 200ba140K280K420K560K700K628202.55638644.35

OSPRay

Benchmark: gravity_spheres_volume/dim_512/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeba510152025SE +/- 0.05, N = 220.4820.81

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200ba7M14M21M28M35M34807016.8534266143.85

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512ba1122334455SE +/- 0.39, N = 2SE +/- 0.29, N = 248.7249.441. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 512ba4080120160200SE +/- 0.12, N = 2SE +/- 0.68, N = 2174.11176.631. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 100 - Sensor Count: 200ba102030405042.1942.79MAX: 784.56MAX: 855.16

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 512ba40M80M120M160M200MSE +/- 650000.00, N = 2SE +/- 795000.00, N = 21987900002016150001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fasterba3691215SE +/- 0.10, N = 2SE +/- 0.03, N = 210.4310.281. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Exampleba3691215SE +/- 0.10, N = 212.3712.201. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0ba3691215SE +/- 0.26, N = 2SE +/- 0.23, N = 211.6411.48MIN: 10.85 / MAX: 37.54MIN: 10.9 / MAX: 56.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 57ba130M260M390M520M650MSE +/- 11305000.00, N = 2SE +/- 3155000.00, N = 26235350006151050001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256ba1122334455SE +/- 0.55, N = 246.4645.851. (CXX) g++ options: -O3

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 160 - Buffer Length: 256 - Filter Length: 57ba600M1200M1800M2400M3000MSE +/- 17250000.00, N = 2263645000026023000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OSPRay

Benchmark: gravity_spheres_volume/dim_512/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timeba510152025SE +/- 0.22, N = 220.9521.21

libxsmm

M N K: 256

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256ba130260390520650SE +/- 2.65, N = 2592.5599.81. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinyba612182430SE +/- 0.64, N = 2SE +/- 0.65, N = 224.4824.77MIN: 22.66 / MAX: 47.54MIN: 22.68 / MAX: 208.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 512ba20406080100SE +/- 0.24, N = 2SE +/- 0.40, N = 292.2693.331. (CXX) g++ options: -O3

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32ba400M800M1200M1600M2000MSE +/- 2750000.00, N = 2182545000018050000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512ba90M180M270M360M450MSE +/- 2775000.00, N = 23962650004007300001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Floating Pointba30K60K90K120K150KSE +/- 235.00, N = 2SE +/- 872.22, N = 2131100.25132479.081. (CXX) g++ options: -O2 -std=gnu99 -lc

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57ba300M600M900M1200M1500MSE +/- 20600000.00, N = 2118550000011977000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetba48121620SE +/- 0.28, N = 2SE +/- 0.82, N = 215.9016.06MIN: 15.2 / MAX: 39.56MIN: 14.92 / MAX: 25.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Onlyba0.6841.3682.0522.7363.42SE +/- 0.00, N = 23.013.04

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32ba140280420560700SE +/- 2.35, N = 2639.3633.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 32ba110M220M330M440M550MSE +/- 830000.00, N = 2SE +/- 1500000.00, N = 24984100004936600001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Z3 Theorem Prover

SMT File: 2.smt2

OpenBenchmarking.orgSeconds, Fewer Is BetterZ3 Theorem Prover 4.12.1SMT File: 2.smt2ba20406080100SE +/- 0.05, N = 2SE +/- 0.02, N = 287.1888.001. (CXX) g++ options: -lpthread -std=c++17 -fvisibility=hidden -mfpmath=sse -msse -msse2 -O3 -fPIC

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fastba1.28632.57263.85895.14526.4315SE +/- 0.063, N = 2SE +/- 0.019, N = 25.7175.6721. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2ba3691215SE +/- 0.06, N = 2SE +/- 0.03, N = 29.759.82MIN: 9.56 / MAX: 13.68MIN: 9.6 / MAX: 12.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3ba246810SE +/- 0.03, N = 2SE +/- 0.14, N = 28.778.71MIN: 8.59 / MAX: 32.78MIN: 8.43 / MAX: 9.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RTLightmap.hdr.4096x4096 - Device: CPU-Onlyba0.33080.66160.99241.32321.654SE +/- 0.00, N = 21.461.47

Intel Open Image Denoise

Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Onlyba0.68631.37262.05892.74523.4315SE +/- 0.01, N = 23.033.05

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512ba160M320M480M640M800MSE +/- 1250000.00, N = 27303100007258400001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Onlyba714212835SE +/- 0.02, N = 2SE +/- 0.01, N = 230.7730.59

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.30ba6001200180024003000SE +/- 1.80, N = 2SE +/- 2.00, N = 22607.62622.91. (CXX) g++ options: -O3 -march=native -fPIE -pie

OSPRay

Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeba510152025SE +/- 0.00, N = 222.5822.70

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32ba600M1200M1800M2400M3000MSE +/- 6350000.00, N = 2294515000029611000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512ba20406080100SE +/- 0.95, N = 2SE +/- 1.11, N = 294.3494.831. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 256ba20406080100SE +/- 0.72, N = 2102.43101.941. (CXX) g++ options: -O3

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500ba81624324036.1635.99MAX: 769.5MAX: 724.8

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 128ba20406080100SE +/- 1.54, N = 2107.95107.451. (CXX) g++ options: -O3

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Totalba2K4K6K8K10KSE +/- 33.95, N = 2SE +/- 47.35, N = 29756.79800.51. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512ba200M400M600M800M1000MSE +/- 2180000.00, N = 29451900009494000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon Nanotubeba1020304050SE +/- 0.03, N = 2SE +/- 0.02, N = 245.6445.821. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128ba4080120160200SE +/- 0.21, N = 2158.71159.341. (CXX) g++ options: -O3

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Asian Dragon Objba20406080100SE +/- 0.03, N = 2SE +/- 0.03, N = 277.2676.96MIN: 75.78 / MAX: 81.08MIN: 75.53 / MAX: 82.14

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2ba246810SE +/- 0.02, N = 2SE +/- 0.13, N = 27.947.91MIN: 7.81 / MAX: 10.99MIN: 7.68 / MAX: 9.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512ba20406080100SE +/- 1.17, N = 2SE +/- 1.08, N = 290.2490.571. (CXX) g++ options: -O3

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57ba400M800M1200M1600M2000MSE +/- 13650000.00, N = 2207665000020692000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 256ba1122334455SE +/- 0.08, N = 246.5146.661. (CXX) g++ options: -O3

VVenC

Video Input: Bosphorus 1080p - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fasterba714212835SE +/- 0.23, N = 2SE +/- 0.37, N = 229.1829.081. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64ba30060090012001500SE +/- 1.25, N = 21216.01219.91. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16ba612182430SE +/- 0.34, N = 2SE +/- 0.84, N = 226.1926.27MIN: 24.19 / MAX: 341.6MIN: 24.05 / MAX: 301.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlyba612182430SE +/- 0.06, N = 2SE +/- 0.09, N = 223.6223.69

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 160 - Buffer Length: 256 - Filter Length: 32ba700M1400M2100M2800M3500MSE +/- 9150000.00, N = 2338195000033907000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Onlyba1428425670SE +/- 0.19, N = 2SE +/- 0.04, N = 262.5162.35

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500ba200K400K600K800K1000K1137612.611134736.54

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 512ba3M6M9M12M15MSE +/- 34000.00, N = 2SE +/- 1000.00, N = 213291000133230001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Apache IoTDB

Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.1.2Device Count: 100 - Batch Size Per Write: 1 - Sensor Count: 500ba200K400K600K800K1000K992909.69995259.68

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 256ba50100150200250SE +/- 2.75, N = 2236.12236.671. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 512ba1122334455SE +/- 0.07, N = 2SE +/- 0.14, N = 247.3947.281. (CXX) g++ options: -O3

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 32ba7M14M21M28M35MSE +/- 0.00, N = 2SE +/- 0.00, N = 232267000323380001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Apache IoTDB

Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.1.2Device Count: 200 - Batch Size Per Write: 1 - Sensor Count: 500ba81624324036.8236.74MAX: 691.5MAX: 793.88

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Onlyba50100150200250SE +/- 1.32, N = 2SE +/- 0.32, N = 2239.03239.55

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256ba20406080100SE +/- 1.52, N = 292.8293.011. (CXX) g++ options: -O3

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 160 - Buffer Length: 256 - Filter Length: 512ba200M400M600M800M1000MSE +/- 1800000.00, N = 2101120000010132000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Timed GCC Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 13.2Time To Compileba2004006008001000SE +/- 1.97, N = 2956.13957.95

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragon Objba20406080100SE +/- 0.05, N = 2SE +/- 0.06, N = 290.0189.84MIN: 87.6 / MAX: 94.43MIN: 87.68 / MAX: 94.71

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 128ba306090120150SE +/- 4.04, N = 2116.84117.011. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512ba4080120160200SE +/- 1.39, N = 2SE +/- 1.25, N = 2171.13170.911. (CXX) g++ options: -O3

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragonba20406080100SE +/- 0.24, N = 2SE +/- 0.32, N = 2104.55104.41MIN: 102.2 / MAX: 108.91MIN: 101.88 / MAX: 109.22

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Asian Dragonba20406080100SE +/- 0.14, N = 2SE +/- 0.04, N = 285.1385.24MIN: 83.65 / MAX: 90.45MIN: 83.75 / MAX: 89.99

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Fused Multiply-Addba40M80M120M160M200MSE +/- 92010.25, N = 2SE +/- 118686.48, N = 2181314757.42181083180.471. (CXX) g++ options: -O2 -std=gnu99 -lc

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Summer Nature 1080pba150300450600750SE +/- 0.50, N = 2699.09699.971. (CC) gcc options: -pthread -lm

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128ba4080120160200SE +/- 0.39, N = 2199.33199.101. (CXX) g++ options: -O3

VVenC

Video Input: Bosphorus 1080p - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fastba48121620SE +/- 0.04, N = 2SE +/- 0.06, N = 215.7215.711. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32ba200M400M600M800M1000MSE +/- 2085000.00, N = 29934450009925400001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OSPRay

Benchmark: particle_volume/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/pathtracer/real_timeba306090120150SE +/- 0.63, N = 2151.27151.14

OSPRay

Benchmark: particle_volume/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timeba612182430SE +/- 0.09, N = 224.6224.64

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Chimera 1080pba110220330440550SE +/- 0.06, N = 2516.50516.171. (CC) gcc options: -pthread -lm

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Threadba4080120160200SE +/- 0.90, N = 2SE +/- 1.70, N = 2164.7164.81. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

srsRAN Project

Test: Downlink Processor Benchmark

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: Downlink Processor Benchmarkba120240360480600SE +/- 1.25, N = 2SE +/- 0.70, N = 2556.8556.51. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest

Stress-NG

Test: Vector Shuffle

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Shuffleba10K20K30K40K50KSE +/- 42.22, N = 2SE +/- 1.26, N = 248076.7848054.481. (CXX) g++ options: -O2 -std=gnu99 -lc

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Summer Nature 4Kba60120180240300SE +/- 0.08, N = 2282.65282.531. (CC) gcc options: -pthread -lm

Stress-NG

Test: Wide Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Wide Vector Mathba500K1000K1500K2000K2500KSE +/- 497.69, N = 2SE +/- 1200.16, N = 22196242.212195391.411. (CXX) g++ options: -O2 -std=gnu99 -lc

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.4WAV To Opus Encodeba816243240SE +/- 0.01, N = 236.7336.741. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

Stress-NG

Test: AVL Tree

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: AVL Treeba130260390520650SE +/- 0.44, N = 2SE +/- 0.08, N = 2610.83610.691. (CXX) g++ options: -O2 -std=gnu99 -lc

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 57ba12M24M36M48M60MSE +/- 1500.00, N = 2SE +/- 500.00, N = 253926500539185001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 1.2.1Video Input: Chimera 1080p 10-bitba100200300400500SE +/- 0.41, N = 2476.77476.821. (CC) gcc options: -pthread -lm

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Matrix 3D Mathba3K6K9K12K15KSE +/- 5.06, N = 2SE +/- 9.80, N = 212742.7012743.811. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Floating Pointba5K10K15K20K25KSE +/- 9.14, N = 2SE +/- 6.86, N = 221133.0221134.811. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Zlib

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Zlibba15003000450060007500SE +/- 4.39, N = 2SE +/- 8.83, N = 26880.226879.861. (CXX) g++ options: -O2 -std=gnu99 -lc

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Crownba20406080100SE +/- 0.10, N = 287.9387.93MIN: 84.73 / MAX: 92.37MIN: 85.27 / MAX: 92.58

Apache CouchDB

Bulk Size: 500 - Inserts: 1000 - Rounds: 30

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 500 - Inserts: 1000 - Rounds: 30a20040060080010001090.421. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD

Apache CouchDB

Bulk Size: 300 - Inserts: 1000 - Rounds: 30

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 300 - Inserts: 1000 - Rounds: 30a306090120150152.461. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD

Apache CouchDB

Bulk Size: 100 - Inserts: 1000 - Rounds: 30

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.3.2Bulk Size: 100 - Inserts: 1000 - Rounds: 30a2040608010094.831. (CXX) g++ options: -std=c++17 -lmozjs-78 -lm -lei -fPIC -MMD


Phoronix Test Suite v10.8.4