AMD EPYC 9755 DDR5 Turin Memory Performance

AMD EPYC 9755 with varying DDR5-6000 default versus DDR5-4800 memory performance. Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2410130-NE-TURINDDR565&grs&sor.

AMD EPYC 9755 DDR5 Turin Memory PerformanceProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionDDR5-4800DDR5-6000AMD EPYC 9755 128-Core @ 2.70GHz (128 Cores / 256 Threads)AMD VOLCANO (RVOT1000D BIOS)AMD Device 153a12 x 64GB DDR5-4800MT/s Samsung M321R8GA0PB1-CCPKC2 x 1920GB KIOXIA KCD8XPUG1T92ASPEEDBroadcom NetXtreme BCM5720 PCIeUbuntu 24.046.10.0-phx (x86_64)GCC 13.2.0ext41920x120012 x 64GB DDR5-6000MT/s Samsung M321R8GA0PB1-CCPKCOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-OiuXZC/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-OiuXZC/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002110Java Details- OpenJDK Runtime Environment (build 21.0.3-ea+7-Ubuntu-1build1)Python Details- Python 3.12.2Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC 9755 DDR5 Turin Memory Performanceaskap: tConvolve MT - Degriddinghpcg: 144 144 144 - 60amg: cloverleaf: clover_bm16minife: Smallcloverleaf: clover_bm64_shortaskap: tConvolve MT - Griddingnpb: SP.Cnpb: MG.Cincompact3d: X3D-benchmarking input.i3dwrf: conus 2.5kmopenradioss: Chrysler Neon 1Mopenfoam: drivaerFastback, Medium Mesh Size - Execution Timellamafile: wizardcoder-python-34b-v1.0.Q6_K - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUaskap: tConvolve MPI - Griddingnpb: IS.Dopenvino: Noise Suppression Poconet-Like FP16 - CPUopenfoam: drivaerFastback, Small Mesh Size - Mesh Timelulesh: openvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUtensorflow: CPU - 512 - ResNet-50askap: tConvolve MPI - Degriddingclickhouse: 100M Rows Hits Dataset, Second Runtensorflow: CPU - 512 - GoogLeNetincompact3d: input.i3d 193 Cells Per Directiontensorflow: CPU - 256 - ResNet-50vvenc: Bosphorus 4K - Fasteropenvino: Noise Suppression Poconet-Like FP16 - CPUoidn: RTLightmap.hdr.4096x4096 - CPU-Onlyclickhouse: 100M Rows Hits Dataset, Third Runcompress-7zip: Compression Ratingtensorflow: CPU - 256 - GoogLeNetsvt-av1: Preset 12 - Bosphorus 4Kclickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheuvg266: Bosphorus 4K - Very Fasttensorflow: CPU - 64 - ResNet-50pytorch: CPU - 512 - ResNet-50kvazaar: Bosphorus 4K - Very Fasttensorflow: CPU - 256 - AlexNettensorflow: CPU - 1 - ResNet-50blender: Junkshop - CPU-Onlyoidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlyoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlygpaw: Carbon Nanotubeopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUbuild-gem5: Time To Compilecompress-zstd: 19 - Compression Speedblender: BMW27 - CPU-Onlypytorch: CPU - 64 - ResNet-152tensorflow: CPU - 64 - GoogLeNetbuild-godot: Time To Compilenamd: STMV with 1,066,628 Atomsgraphics-magick: Swirlllamafile: llava-v1.5-7b-q4 - CPUwebp: Quality 100, Lossless, Highest Compressionnamd: ATPase with 327,506 Atomsembree: Pathtracer ISPC - Asian Dragon Objrocksdb: Read While Writingbuild-imagemagick: Time To Compileembree: Pathtracer ISPC - Asian Dragonbuild-linux-kernel: allmodconfigpytorch: CPU - 64 - ResNet-50tensorflow: CPU - 64 - AlexNetonnx: ArcFace ResNet-100 - CPU - Standardsrsran: PUSCH Processor Benchmark, Throughput Totalonnx: CaffeNet 12-int8 - CPU - Parallellammps: Rhodopsin Proteinoctave-benchmark: pytorch: CPU - 1 - ResNet-152openvkl: vklBenchmarkCPU ISPCtensorflow: CPU - 1 - GoogLeNetonednn: Deconvolution Batch shapes_3d - CPUpgbench: 100 - 1000 - Read Only - Average Latencymemcached: 1:100lammps: 20k Atomspgbench: 100 - 1000 - Read Onlyonnx: GPT-2 - CPU - Standardonnx: T5 Encoder - CPU - Standardgromacs: MPI CPU - water_GMX50_baremt-dgemm: Sustained Floating-Point Ratevvenc: Bosphorus 4K - Fastonnx: yolov4 - CPU - Parallelpytorch: CPU - 256 - ResNet-152build-nodejs: Time To Compilehelsing: 14 digitblender: Classroom - CPU-Onlybrl-cad: VGR Performance Metricpytorch: CPU - 256 - ResNet-50build-llvm: Ninjaavifenc: 6, Losslessrocksdb: Rand Readapache-iotdb: 800 - 100 - 800 - 100ffmpeg: libx265 - Video On Demandffmpeg: libx265 - Platformtensorflow: CPU - 512 - AlexNetpytorch: CPU - 1 - ResNet-50openssl: SHA512pgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 1000 - Read Writeblender: Barbershop - CPU-Onlygraphics-magick: Sharpeny-cruncher: 1Bopenradioss: Cell Phone Drop Testtensorflow: CPU - 1 - AlexNetliquid-dsp: 128 - 256 - 57srsran: PDSCH Processor Benchmark, Throughput Totalffmpeg: libx265 - Uploady-cruncher: 500Mopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUm-queens: Time To Solveopenvino: Person Vehicle Bike Detection FP16 - CPUospray-studio: 3 - 4K - 16 - Path Tracer - CPUcompress-pbzip2: FreeBSD-13.0-RELEASE-amd64-memstick.img Compressionospray-studio: 3 - 4K - 1 - Path Tracer - CPUastcenc: Mediumapache-iotdb: 500 - 100 - 800 - 400appleseed: Disney Materialospray-studio: 3 - 4K - 32 - Path Tracer - CPUnpb: CG.Csecuremark: SecureMark-TLSopenvino: Person Vehicle Bike Detection FP16 - CPUliquid-dsp: 256 - 256 - 57compress-zstd: 19, Long Mode - Decompression Speedbuild-linux-kernel: defconfigembree: Pathtracer ISPC - Crownminibude: OpenMP - BM1minibude: OpenMP - BM1webp: Quality 100, Highest Compressionnpb: LU.Cblender: Pabellon Barcelona - CPU-Onlyapache-iotdb: 800 - 100 - 800 - 100ospray-studio: 1 - 4K - 1 - Path Tracer - CPUonnx: super-resolution-10 - CPU - Parallelospray-studio: 1 - 4K - 16 - Path Tracer - CPUgraphics-magick: Enhancedstockfish: Chess Benchmarkspeedb: Rand Readbuild-ffmpeg: Time To Compileospray: gravity_spheres_volume/dim_512/pathtracer/real_timeonnx: ArcFace ResNet-100 - CPU - Parallelprimesieve: 1e12x265: Bosphorus 4Kliquid-dsp: 256 - 256 - 32coremark: CoreMark Size 666 - Iterations Per Secondsrsran: PDSCH Processor Benchmark, Throughput Threadospray: particle_volume/scivis/real_timegraphics-magick: Noise-Gaussianopenvino: Vehicle Detection FP16-INT8 - CPUliquid-dsp: 1 - 256 - 512liquid-dsp: 128 - 256 - 32quantlib: Single-Threadedcompress-zstd: 19 - Decompression Speeduvg266: Bosphorus 4K - Slowqmcpack: Li2_STO_aequantlib: Multi-Threadedsvt-av1: Preset 8 - Bosphorus 4Kliquid-dsp: 64 - 256 - 512openvino: Face Detection Retail FP16-INT8 - CPUuvg266: Bosphorus 4K - Mediumospray-studio: 1 - 4K - 32 - Path Tracer - CPUblender: Fishy Cat - CPU-Onlyopenfoam: drivaerFastback, Small Mesh Size - Execution Timepytorch: CPU - 512 - ResNet-152openvino: Person Re-Identification Retail FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUluxcorerender: LuxCore Benchmark - CPUliquid-dsp: 1 - 256 - 57nwchem: C240 Buckyballavifenc: 10, Losslessopenssl: AES-128-GCMonnx: bertsquad-12 - CPU - Paralleljohn-the-ripper: Blowfishjohn-the-ripper: bcryptavifenc: 0openvino: Face Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUsvt-av1: Preset 4 - Bosphorus 4Konnx: yolov4 - CPU - Standardopenvino: Face Detection FP16-INT8 - CPUavifenc: 6compress-7zip: Decompression Ratingonnx: fcn-resnet101-11 - CPU - Parallelnumpy: liquid-dsp: 128 - 256 - 512openssl: AES-256-GCMjohn-the-ripper: WPA PSKopenssl: RSA4096openssl: ChaCha20build-mesa: Time To Compileopenvino: Face Detection Retail FP16-INT8 - CPUapache-iotdb: 500 - 100 - 800 - 400openvino: Handwritten English Recognition FP16-INT8 - CPUkvazaar: Bosphorus 4K - Mediumprimesieve: 1e13openssl: ChaCha20-Poly1305openssl: SHA256avifenc: 2liquid-dsp: 64 - 256 - 32openvino: Handwritten English Recognition FP16-INT8 - CPUastcenc: Very Thoroughopenradioss: INIVOL and Fluid Structure Interaction Drop Containerospray: particle_volume/ao/real_timeopenssl: RSA4096liquid-dsp: 1 - 256 - 32kvazaar: Bosphorus 4K - Slowliquid-dsp: 64 - 256 - 57minibude: OpenMP - BM2minibude: OpenMP - BM2liquid-dsp: 256 - 256 - 512astcenc: Thoroughopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUsrsran: PUSCH Processor Benchmark, Throughput Threadastcenc: Exhaustivepybench: Total For Average Test Timescompress-zstd: 19, Long Mode - Compression Speedxmrig: GhostRider - 1Monnx: T5 Encoder - CPU - Standardonnx: CaffeNet 12-int8 - CPU - Parallelonnx: ArcFace ResNet-100 - CPU - Parallelonnx: ArcFace ResNet-100 - CPU - Standardonnx: GPT-2 - CPU - Standardonnx: bertsquad-12 - CPU - Parallelonnx: super-resolution-10 - CPU - Parallelonnx: fcn-resnet101-11 - CPU - Parallelonnx: yolov4 - CPU - Parallelonnx: yolov4 - CPU - Standardsvt-av1: Preset 13 - Bosphorus 4Kluxcorerender: Orange Juice - CPUluxcorerender: DLSC - CPUpennant: leblancbigspeedb: Read While WritingDDR5-4800DDR5-600022665.153.73572698476000209.9659008.425.7914143.3172595.77149191.03222.6224626206.69285.36177.112619.71674.4694.7768467.47995.289833.4722.89573435386.93561.241043.31231.7166651.4693.77780.736.79525073195.8922.88311.092.77704.36882936641.12274.068680.8176.35134.5042.8494.712223.516.9612.985.725.7324.3003224.7219.73130.54321.49.6317.77355.1484.7204.5470786529.540.6714.03647188.99941681711110.012219.9997198.89843.071071.5143.63168056.3957.06455.1504.66320.47361422.940.5078500.18713455536.7268.4795364577192.352234.14022.50855.8738319.71914.416817.58110.55043.44324.21584838043.2489.6454.25679339038513312801067.0567.142644.5951.61729362357537.85512731087.284068.11317.7426.12583480000019592.632.574.843192115.855.4547.27120630.941118758697.194211827044130.4994672413188719.534077598681.6281961666671627.723.467178.5158383.9019597.5293.88383603.9129.9057.58642195.1681020045730869258181855676813.92354.960341.36551.55934.0884631000006019991.221136990.454.571431411196.672316433354241333334252.21721.539.2578.804464193.2117.85414462666673.8743.962042812.5320.54992217.5313918.744.4913.93683213331328.93.452200331309875324.586832239632240647.552328.945.6311.25512.0891194.142.0068456313.23367796.772484533333183969943652313600002757741.0118731514793713.65831630.96225.7026.6553.9917.95580758842563718678981995324.81327831333334776.6615.712973.7054.376569052.54406300053.433298933333386.2039655.0752855600000110.034718679.636.610.48183.39.657658210.218386.44.270071.0433124.174622.93845.1967240.66945.12255309.34269.364782.7153265.84730.8321.202.8565281882643127297.463.67083177082000178.9369057.122.0916464.1194853.91167026.93199.8445895574.36377.26160.3737810.64733.8087.1474093.38647.6210557.1621.4414637645.97357.571109.75246.2470762.8729.56819.056.47804538203.3323.74310.732.85724.64908159659.11281.589698.1278.23137.6243.8396.812271.957.1112.715.845.8523.8223287.4319.36128.13221.89.4617.46361.3883.2934.6227785130.010.6814.24493191.7839170630219.868223.2053201.75543.681086.4844.19997954.2969.28155.7914.61020.24365422.690.5134170.18513316660.1067.7765418672194.268236.46822.72956.4088909.81014.551117.42109.56143.05724.00589905543.6188.8934.22178689658013421067166.5166.602665.9251.20723597672407.91612633486.624038.17317.8725.93587656666719454.132.344.877190777.835.4927.22119820.934896753701.731511751679730.6924412398188208.364054548730.8881519000001636.423.344179.4494385.9009647.4963.90385558.5329.7557.30639194.2641015445530734392382210282813.97855.176241.20521.55334.2184947333336042366.343047994.054.395831311232.262309366754076000004264.71726.439.1479.020465436.8118.16314425000003.8843.852037912.5020.59708917.4913949.894.4813.96681816671326.23.459200736535147024.539132301032300047.465329.535.6211.27512.0678193.802.0038467363.22956795.832487400000183776585074713613332755042.3118635929445013.66931655.57225.8726.6753.9517.96880700984142718666039331324.79727845666674774.3315.719573.6754.392769037.74405400053.423299400000386.1839654.5762855466667110.038318679.366.610.48183.39.657658210.219873.04.228231.0299824.268322.65865.1453940.75065.14629309.75768.719682.8651271.92432.0220.942.76126018687897OpenBenchmarking.org

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - DegriddingDDR5-6000DDR5-48006K12K18K24K30KSE +/- 28.93, N = 3SE +/- 10.95, N = 327297.422665.11. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60DDR5-6000DDR5-48001428425670SE +/- 0.01, N = 3SE +/- 0.01, N = 363.6753.741. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2DDR5-6000DDR5-4800700M1400M2100M2800M3500MSE +/- 6342005.91, N = 3SE +/- 2882733.14, N = 3317708200026984760001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

CloverLeaf

Input: clover_bm16

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16DDR5-6000DDR5-480050100150200250SE +/- 0.24, N = 3SE +/- 0.08, N = 3178.93209.961. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallDDR5-6000DDR5-480015K30K45K60K75KSE +/- 78.85, N = 6SE +/- 20.01, N = 369057.159008.41. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

CloverLeaf

Input: clover_bm64_short

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortDDR5-6000DDR5-4800612182430SE +/- 0.07, N = 3SE +/- 0.11, N = 322.0925.791. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - GriddingDDR5-6000DDR5-48004K8K12K16K20KSE +/- 0.00, N = 3SE +/- 9.34, N = 316464.114143.31. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CDDR5-6000DDR5-480040K80K120K160K200KSE +/- 878.07, N = 5SE +/- 1358.70, N = 3194853.91172595.771. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CDDR5-6000DDR5-480040K80K120K160K200KSE +/- 1302.88, N = 15SE +/- 239.36, N = 3167026.93149191.031. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dDDR5-6000DDR5-480050100150200250SE +/- 0.78, N = 3SE +/- 0.04, N = 3199.84222.621. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

WRF

Input: conus 2.5km

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5kmDDR5-6000DDR5-4800130026003900520065005574.366206.691. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1MDDR5-6000DDR5-480020406080100SE +/- 0.11, N = 3SE +/- 0.19, N = 377.2685.36

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution TimeDDR5-6000DDR5-48004080120160200160.37177.111. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

Llamafile

Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.7Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPUDDR5-6000DDR5-48003691215SE +/- 0.06, N = 3SE +/- 0.01, N = 310.649.71

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPUDDR5-6000DDR5-4800160320480640800SE +/- 4.77, N = 15SE +/- 1.38, N = 3733.80674.461. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPUDDR5-6000DDR5-480020406080100SE +/- 0.54, N = 15SE +/- 0.19, N = 387.1494.77MIN: 35.35 / MAX: 200.15MIN: 40.52 / MAX: 153.331. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingDDR5-6000DDR5-480016K32K48K64K80KSE +/- 438.43, N = 3SE +/- 752.40, N = 374093.368467.41. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DDDR5-6000DDR5-48002K4K6K8K10KSE +/- 55.66, N = 6SE +/- 34.68, N = 38647.627995.281. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPUDDR5-6000DDR5-48002K4K6K8K10KSE +/- 33.49, N = 3SE +/- 39.39, N = 310557.169833.471. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeDDR5-6000DDR5-480051015202521.4422.901. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3DDR5-6000DDR5-48008K16K24K32K40KSE +/- 450.61, N = 15SE +/- 386.37, N = 1537645.9735386.941. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPUDDR5-6000DDR5-48001428425670SE +/- 0.16, N = 3SE +/- 0.40, N = 357.5761.24MIN: 34.99 / MAX: 106.02MIN: 27.85 / MAX: 106.261. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPUDDR5-6000DDR5-48002004006008001000SE +/- 2.99, N = 3SE +/- 6.77, N = 31109.751043.311. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

TensorFlow

Device: CPU - Batch Size: 512 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 512 - Model: ResNet-50DDR5-6000DDR5-480050100150200250SE +/- 0.52, N = 3SE +/- 0.47, N = 3246.24231.71

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingDDR5-6000DDR5-480015K30K45K60K75KSE +/- 395.30, N = 3SE +/- 610.89, N = 370762.866651.41. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second RunDDR5-6000DDR5-4800160320480640800SE +/- 5.27, N = 3SE +/- 2.33, N = 3729.56693.77MIN: 81.63 / MAX: 7500MIN: 84.27 / MAX: 6666.67

TensorFlow

Device: CPU - Batch Size: 512 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 512 - Model: GoogLeNetDDR5-6000DDR5-48002004006008001000SE +/- 3.19, N = 3SE +/- 3.88, N = 3819.05780.73

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionDDR5-6000DDR5-4800246810SE +/- 0.01274608, N = 6SE +/- 0.01024398, N = 36.478045386.795250731. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

TensorFlow

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 256 - Model: ResNet-50DDR5-6000DDR5-48004080120160200SE +/- 0.65, N = 3SE +/- 0.77, N = 3203.33195.89

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: FasterDDR5-6000DDR5-4800612182430SE +/- 0.02, N = 3SE +/- 0.04, N = 323.7422.881. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPUDDR5-6000DDR5-48003691215SE +/- 0.05, N = 3SE +/- 0.01, N = 310.7311.09MIN: 6.4 / MAX: 32.23MIN: 6.01 / MAX: 31.791. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.2Run: RTLightmap.hdr.4096x4096 - Device: CPU-OnlyDDR5-6000DDR5-48000.64131.28261.92392.56523.2065SE +/- 0.00, N = 5SE +/- 0.00, N = 32.852.77

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third RunDDR5-6000DDR5-4800160320480640800SE +/- 4.70, N = 3SE +/- 2.37, N = 3724.64704.36MIN: 85.11 / MAX: 6666.67MIN: 83.45 / MAX: 6666.67

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingDDR5-6000DDR5-4800200K400K600K800K1000KSE +/- 3530.77, N = 3SE +/- 2814.63, N = 39081598829361. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

TensorFlow

Device: CPU - Batch Size: 256 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 256 - Model: GoogLeNetDDR5-6000DDR5-4800140280420560700SE +/- 7.40, N = 3SE +/- 6.10, N = 3659.11641.12

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 12 - Input: Bosphorus 4KDDR5-6000DDR5-480060120180240300SE +/- 2.17, N = 9SE +/- 2.96, N = 5281.59274.071. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold CacheDDR5-6000DDR5-4800150300450600750SE +/- 6.12, N = 3SE +/- 4.99, N = 3698.12680.81MIN: 80.86 / MAX: 6666.67MIN: 82.42 / MAX: 6666.67

uvg266

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very FastDDR5-6000DDR5-480020406080100SE +/- 0.14, N = 6SE +/- 0.27, N = 378.2376.35

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: ResNet-50DDR5-6000DDR5-4800306090120150SE +/- 0.26, N = 3SE +/- 1.56, N = 4137.62134.50

PyTorch

Device: CPU - Batch Size: 512 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 512 - Model: ResNet-50DDR5-6000DDR5-48001020304050SE +/- 0.13, N = 3SE +/- 0.03, N = 343.8342.84MIN: 41.6 / MAX: 45.11MIN: 41.63 / MAX: 44.31

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastDDR5-6000DDR5-480020406080100SE +/- 0.49, N = 7SE +/- 1.24, N = 396.8194.711. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

TensorFlow

Device: CPU - Batch Size: 256 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 256 - Model: AlexNetDDR5-6000DDR5-48005001000150020002500SE +/- 2.78, N = 4SE +/- 16.20, N = 32271.952223.51

TensorFlow

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: ResNet-50DDR5-6000DDR5-4800246810SE +/- 0.01, N = 3SE +/- 0.10, N = 157.116.96

Blender

Blend File: Junkshop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Junkshop - Compute: CPU-OnlyDDR5-6000DDR5-48003691215SE +/- 0.03, N = 4SE +/- 0.08, N = 312.7112.98

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.2Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-OnlyDDR5-6000DDR5-48001.3142.6283.9425.2566.57SE +/- 0.01, N = 7SE +/- 0.01, N = 35.845.72

Intel Open Image Denoise

Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.2Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-OnlyDDR5-6000DDR5-48001.31632.63263.94895.26526.5815SE +/- 0.00, N = 7SE +/- 0.00, N = 35.855.73

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon NanotubeDDR5-6000DDR5-4800612182430SE +/- 0.10, N = 3SE +/- 0.13, N = 323.8224.301. (CC) gcc options: -shared -lxc -lblas -lmpi

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPUDDR5-6000DDR5-48007001400210028003500SE +/- 2.87, N = 3SE +/- 2.32, N = 33287.433224.721. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPUDDR5-6000DDR5-4800510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 319.3619.73MIN: 9.99 / MAX: 44.41MIN: 9.5 / MAX: 45.51. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To CompileDDR5-6000DDR5-4800306090120150SE +/- 1.52, N = 3SE +/- 1.70, N = 3128.13130.54

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression SpeedDDR5-6000DDR5-4800510152025SE +/- 0.12, N = 3SE +/- 0.06, N = 321.821.41. (CC) gcc options: -O3 -pthread -lz -llzma

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: BMW27 - Compute: CPU-OnlyDDR5-6000DDR5-48003691215SE +/- 0.01, N = 5SE +/- 0.08, N = 39.469.63

PyTorch

Device: CPU - Batch Size: 64 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 64 - Model: ResNet-152DDR5-4800DDR5-600048121620SE +/- 0.16, N = 7SE +/- 0.12, N = 317.7717.46MIN: 16.6 / MAX: 18.47MIN: 16.79 / MAX: 17.99

TensorFlow

Device: CPU - Batch Size: 64 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: GoogLeNetDDR5-6000DDR5-480080160240320400SE +/- 2.87, N = 3SE +/- 3.12, N = 8361.38355.14

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileDDR5-6000DDR5-480020406080100SE +/- 0.10, N = 3SE +/- 0.27, N = 383.2984.72

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: STMV with 1,066,628 AtomsDDR5-6000DDR5-48001.04012.08023.12034.16045.2005SE +/- 0.00697, N = 3SE +/- 0.01366, N = 34.622774.54707

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlDDR5-4800DDR5-60002004006008001000SE +/- 1.53, N = 3SE +/- 0.33, N = 38658511. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

Llamafile

Test: llava-v1.5-7b-q4 - Acceleration: CPU

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.7Test: llava-v1.5-7b-q4 - Acceleration: CPUDDR5-6000DDR5-4800714212835SE +/- 0.32, N = 5SE +/- 0.23, N = 330.0129.54

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionDDR5-6000DDR5-48000.1530.3060.4590.6120.765SE +/- 0.00, N = 3SE +/- 0.00, N = 30.680.671. (CC) gcc options: -fvisibility=hidden -O2 -lm

NAMD

Input: ATPase with 327,506 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0b6Input: ATPase with 327,506 AtomsDDR5-6000DDR5-480048121620SE +/- 0.03, N = 7SE +/- 0.02, N = 314.2414.04

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon ObjDDR5-6000DDR5-48004080120160200SE +/- 0.14, N = 5SE +/- 0.10, N = 3191.78189.00MIN: 188.36 / MAX: 196.2MIN: 186.02 / MAX: 192.57

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingDDR5-6000DDR5-48004M8M12M16M20MSE +/- 241581.30, N = 15SE +/- 233412.71, N = 1517063021168171111. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileDDR5-6000DDR5-48003691215SE +/- 0.013, N = 5SE +/- 0.051, N = 39.86810.012

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian DragonDDR5-6000DDR5-480050100150200250SE +/- 0.06, N = 8SE +/- 0.10, N = 3223.21220.00MIN: 219.93 / MAX: 227.73MIN: 216.78 / MAX: 224.82

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfigDDR5-4800DDR5-60004080120160200SE +/- 1.43, N = 3SE +/- 0.88, N = 3198.90201.76

PyTorch

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 64 - Model: ResNet-50DDR5-6000DDR5-48001020304050SE +/- 0.27, N = 3SE +/- 0.05, N = 343.6843.07MIN: 41.7 / MAX: 44.98MIN: 41.33 / MAX: 43.96

TensorFlow

Device: CPU - Batch Size: 64 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 64 - Model: AlexNetDDR5-6000DDR5-48002004006008001000SE +/- 3.83, N = 6SE +/- 11.67, N = 31086.481071.51

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.17Model: ArcFace ResNet-100 - Device: CPU - Executor: StandardDDR5-6000DDR5-48001020304050SE +/- 0.48, N = 15SE +/- 0.36, N = 1544.2043.631. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput TotalDDR5-4800DDR5-60002K4K6K8K10KSE +/- 89.89, N = 15SE +/- 0.32, N = 38056.37954.2MIN: 5096.4 / MAX: 8588.6MIN: 5432.6 / MAX: 7954.61. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.17Model: CaffeNet 12-int8 - Device: CPU - Executor: ParallelDDR5-6000DDR5-48002004006008001000SE +/- 1.72, N = 3SE +/- 7.84, N = 3969.28957.061. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinDDR5-6000DDR5-48001326395265SE +/- 0.31, N = 10SE +/- 0.61, N = 355.7955.151. (CXX) g++ options: -O3 -lm -ldl

GNU Octave Benchmark

OpenBenchmarking.orgSeconds, Fewer Is BetterGNU Octave Benchmark 8.4.0DDR5-6000DDR5-48001.04922.09843.14764.19685.246SE +/- 0.010, N = 8SE +/- 0.010, N = 54.6104.663

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-152DDR5-4800DDR5-6000510152025SE +/- 0.12, N = 3SE +/- 0.07, N = 320.4720.24MIN: 19.75 / MAX: 20.95MIN: 19.69 / MAX: 20.76

OpenVKL

Benchmark: vklBenchmarkCPU ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 2.0.0Benchmark: vklBenchmarkCPU ISPCDDR5-6000DDR5-48008001600240032004000SE +/- 0.88, N = 3SE +/- 1.45, N = 336543614MIN: 293 / MAX: 42376MIN: 293 / MAX: 42496

TensorFlow

Device: CPU - Batch Size: 1 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: GoogLeNetDDR5-4800DDR5-6000510152025SE +/- 0.14, N = 15SE +/- 0.16, N = 1522.9422.69

oneDNN

Harness: Deconvolution Batch shapes_3d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.4Harness: Deconvolution Batch shapes_3d - Engine: CPUDDR5-4800DDR5-60000.11550.2310.34650.4620.5775SE +/- 0.000838, N = 3SE +/- 0.004859, N = 150.5078500.513417MIN: 0.48MIN: 0.481. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyDDR5-6000DDR5-48000.04210.08420.12630.16840.2105SE +/- 0.001, N = 3SE +/- 0.001, N = 30.1850.1871. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100DDR5-4800DDR5-60003M6M9M12M15MSE +/- 14934.26, N = 3SE +/- 136953.60, N = 313455536.7213316660.101. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsDDR5-4800DDR5-60001530456075SE +/- 0.58, N = 3SE +/- 0.06, N = 368.4867.781. (CXX) g++ options: -O3 -lm -ldl

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyDDR5-6000DDR5-48001.2M2.4M3.6M4.8M6MSE +/- 39392.13, N = 3SE +/- 27253.44, N = 3541867253645771. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.17Model: GPT-2 - Device: CPU - Executor: StandardDDR5-6000DDR5-48004080120160200SE +/- 0.38, N = 3SE +/- 0.96, N = 3194.27192.351. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.17Model: T5 Encoder - Device: CPU - Executor: StandardDDR5-6000DDR5-480050100150200250SE +/- 1.36, N = 3SE +/- 1.66, N = 3236.47234.141. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareDDR5-6000DDR5-4800510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 322.7322.511. (CXX) g++ options: -O3 -lm

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateDDR5-6000DDR5-48001326395265SE +/- 0.18, N = 8SE +/- 0.31, N = 356.4155.871. (CC) gcc options: -O3 -march=native -fopenmp

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: FastDDR5-6000DDR5-48003691215SE +/- 0.028, N = 3SE +/- 0.040, N = 39.8109.7191. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.17Model: yolov4 - Device: CPU - Executor: ParallelDDR5-6000DDR5-480048121620SE +/- 0.02, N = 3SE +/- 0.09, N = 314.5514.421. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

PyTorch

Device: CPU - Batch Size: 256 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-152DDR5-4800DDR5-600048121620SE +/- 0.14, N = 3SE +/- 0.19, N = 417.5817.42MIN: 16.88 / MAX: 18.01MIN: 16.53 / MAX: 18.16

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileDDR5-6000DDR5-480020406080100SE +/- 0.21, N = 3SE +/- 0.13, N = 3109.56110.55

Helsing

Digit Range: 14 digit

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitDDR5-6000DDR5-48001020304050SE +/- 0.13, N = 3SE +/- 0.11, N = 343.0643.441. (CC) gcc options: -O2 -pthread

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Classroom - Compute: CPU-OnlyDDR5-6000DDR5-4800612182430SE +/- 0.05, N = 3SE +/- 0.04, N = 324.0024.21

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.38.2VGR Performance MetricDDR5-6000DDR5-48001.3M2.6M3.9M5.2M6.5M589905558483801. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6

PyTorch

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 256 - Model: ResNet-50DDR5-6000DDR5-48001020304050SE +/- 0.29, N = 3SE +/- 0.36, N = 343.6143.24MIN: 41.9 / MAX: 45.08MIN: 41.12 / MAX: 44.61

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaDDR5-6000DDR5-480020406080100SE +/- 0.32, N = 3SE +/- 0.30, N = 388.8989.65

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, LosslessDDR5-6000DDR5-48000.95761.91522.87283.83044.788SE +/- 0.006, N = 8SE +/- 0.010, N = 34.2214.2561. (CXX) g++ options: -O3 -fPIC -lm

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadDDR5-4800DDR5-6000200M400M600M800M1000MSE +/- 225348.54, N = 3SE +/- 5260460.78, N = 37933903857868965801. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100DDR5-6000DDR5-480030M60M90M120M150MSE +/- 338573.54, N = 3SE +/- 33011.72, N = 3134210671133128010

FFmpeg

Encoder: libx265 - Scenario: Video On Demand

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: Video On DemandDDR5-4800DDR5-60001530456075SE +/- 0.05, N = 3SE +/- 0.01, N = 367.0566.511. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Platform

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: PlatformDDR5-4800DDR5-60001530456075SE +/- 0.02, N = 3SE +/- 0.06, N = 367.1466.601. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

TensorFlow

Device: CPU - Batch Size: 512 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 512 - Model: AlexNetDDR5-6000DDR5-48006001200180024003000SE +/- 2.64, N = 3SE +/- 3.62, N = 32665.922644.59

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 1 - Model: ResNet-50DDR5-4800DDR5-60001224364860SE +/- 0.59, N = 3SE +/- 0.42, N = 351.6151.20MIN: 48.6 / MAX: 53.42MIN: 48.71 / MAX: 53.18

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA512DDR5-4800DDR5-600016000M32000M48000M64000M80000MSE +/- 495890924.48, N = 3SE +/- 163780192.60, N = 372936235753723597672401. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyDDR5-4800DDR5-6000246810SE +/- 0.010, N = 3SE +/- 0.004, N = 37.8557.9161. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteDDR5-4800DDR5-600030K60K90K120K150KSE +/- 154.18, N = 3SE +/- 68.35, N = 31273101263341. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Barbershop - Compute: CPU-OnlyDDR5-6000DDR5-480020406080100SE +/- 0.12, N = 3SE +/- 0.08, N = 386.6287.28

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenDDR5-4800DDR5-600090180270360450SE +/- 0.67, N = 3SE +/- 0.33, N = 34064031. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

Y-Cruncher

Pi Digits To Calculate: 1B

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.3Pi Digits To Calculate: 1BDDR5-4800DDR5-6000246810SE +/- 0.013, N = 3SE +/- 0.015, N = 58.1138.173

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop TestDDR5-4800DDR5-600048121620SE +/- 0.12, N = 3SE +/- 0.16, N = 317.7417.87

TensorFlow

Device: CPU - Batch Size: 1 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 1 - Model: AlexNetDDR5-4800DDR5-6000612182430SE +/- 0.12, N = 3SE +/- 0.16, N = 726.1225.93

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 57DDR5-6000DDR5-48001300M2600M3900M5200M6500MSE +/- 20311928.62, N = 3SE +/- 19352605.34, N = 3587656666758348000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput TotalDDR5-4800DDR5-60004K8K12K16K20KSE +/- 180.03, N = 3SE +/- 214.06, N = 319592.619454.11. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

FFmpeg

Encoder: libx265 - Scenario: Upload

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 7.0Encoder: libx265 - Scenario: UploadDDR5-4800DDR5-6000816243240SE +/- 0.02, N = 3SE +/- 0.04, N = 332.5732.341. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Y-Cruncher

Pi Digits To Calculate: 500M

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.3Pi Digits To Calculate: 500MDDR5-4800DDR5-60001.09732.19463.29194.38925.4865SE +/- 0.019, N = 3SE +/- 0.005, N = 64.8434.877

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUDDR5-4800DDR5-600040K80K120K160K200KSE +/- 417.74, N = 3SE +/- 1255.01, N = 3192115.85190777.831. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveDDR5-4800DDR5-60001.23572.47143.70714.94286.1785SE +/- 0.036, N = 3SE +/- 0.025, N = 75.4545.4921. (CXX) g++ options: -fopenmp -O2 -march=native

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPUDDR5-6000DDR5-4800246810SE +/- 0.01, N = 3SE +/- 0.01, N = 37.227.27MIN: 4.48 / MAX: 26.34MIN: 4.1 / MAX: 24.921. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUDDR5-6000DDR5-48003K6K9K12K15KSE +/- 14.19, N = 3SE +/- 11.86, N = 31198212063

Parallel BZIP2 Compression

FreeBSD-13.0-RELEASE-amd64-memstick.img Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.13FreeBSD-13.0-RELEASE-amd64-memstick.img CompressionDDR5-6000DDR5-48000.21180.42360.63540.84721.059SE +/- 0.010948, N = 15SE +/- 0.009629, N = 150.9348960.9411181. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUDDR5-6000DDR5-4800160320480640800SE +/- 0.58, N = 3SE +/- 0.67, N = 3753758

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: MediumDDR5-6000DDR5-4800150300450600750SE +/- 3.94, N = 8SE +/- 5.72, N = 3701.73697.191. (CXX) g++ options: -O3 -flto -pthread

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400DDR5-4800DDR5-600030M60M90M120M150MSE +/- 266876.94, N = 3SE +/- 120972.23, N = 3118270441117516797

Appleseed

Scene: Disney Material

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialDDR5-4800DDR5-600071421283530.5030.69

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUDDR5-6000DDR5-48005K10K15K20K25KSE +/- 32.92, N = 3SE +/- 51.91, N = 32398124131

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CDDR5-4800DDR5-600020K40K60K80K100KSE +/- 1060.91, N = 3SE +/- 618.06, N = 1588719.5388208.361. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSDDR5-4800DDR5-600090K180K270K360K450KSE +/- 2301.56, N = 3SE +/- 809.78, N = 34077594054541. (CC) gcc options: -pedantic -O3

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPUDDR5-6000DDR5-48002K4K6K8K10KSE +/- 8.58, N = 3SE +/- 7.28, N = 38730.888681.621. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 57DDR5-4800DDR5-60002000M4000M6000M8000M10000MSE +/- 3233333.33, N = 3SE +/- 6331139.97, N = 3819616666781519000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression SpeedDDR5-6000DDR5-4800400800120016002000SE +/- 1.50, N = 3SE +/- 4.80, N = 31636.41627.71. (CC) gcc options: -O3 -pthread -lz -llzma

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfigDDR5-6000DDR5-4800612182430SE +/- 0.13, N = 3SE +/- 0.19, N = 323.3423.47

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: CrownDDR5-6000DDR5-48004080120160200SE +/- 0.11, N = 8SE +/- 0.40, N = 3179.45178.52MIN: 175.01 / MAX: 186.64MIN: 174.12 / MAX: 184.84

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1DDR5-6000DDR5-480080160240320400SE +/- 2.67, N = 11SE +/- 3.39, N = 15385.90383.901. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1DDR5-6000DDR5-48002K4K6K8K10KSE +/- 66.65, N = 11SE +/- 84.72, N = 159647.509597.531. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionDDR5-6000DDR5-48000.87751.7552.63253.514.3875SE +/- 0.01, N = 7SE +/- 0.01, N = 33.903.881. (CC) gcc options: -fvisibility=hidden -O2 -lm

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CDDR5-6000DDR5-480080K160K240K320K400KSE +/- 5529.34, N = 15SE +/- 5209.32, N = 12385558.53383603.911. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Pabellon Barcelona - Compute: CPU-OnlyDDR5-6000DDR5-4800714212835SE +/- 0.06, N = 3SE +/- 0.10, N = 329.7529.90

Apache IoTDB

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100DDR5-6000DDR5-48001326395265SE +/- 0.13, N = 3SE +/- 0.13, N = 357.3057.58MAX: 23846.01MAX: 23818.56

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUDDR5-6000DDR5-4800140280420560700SE +/- 0.33, N = 3SE +/- 0.33, N = 3639642

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.17Model: super-resolution-10 - Device: CPU - Executor: ParallelDDR5-4800DDR5-60004080120160200SE +/- 0.63, N = 3SE +/- 0.21, N = 3195.17194.261. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPUDDR5-6000DDR5-48002K4K6K8K10KSE +/- 3.71, N = 3SE +/- 7.81, N = 31015410200

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedDDR5-4800DDR5-6000100200300400500SE +/- 0.58, N = 3SE +/- 0.33, N = 34574551. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess BenchmarkDDR5-4800DDR5-600070M140M210M280M350MSE +/- 1906957.54, N = 3SE +/- 4338134.07, N = 153086925813073439231. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver

Speedb

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random ReadDDR5-6000DDR5-4800200M400M600M800M1000MSE +/- 707549.54, N = 3SE +/- 4175835.13, N = 38221028288185567681. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 7.0Time To CompileDDR5-4800DDR5-600048121620SE +/- 0.05, N = 3SE +/- 0.07, N = 413.9213.98

OSPRay

Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeDDR5-6000DDR5-48001224364860SE +/- 0.01, N = 3SE +/- 0.02, N = 355.1854.96

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.17Model: ArcFace ResNet-100 - Device: CPU - Executor: ParallelDDR5-4800DDR5-6000918273645SE +/- 0.24, N = 3SE +/- 0.20, N = 341.3741.211. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

Primesieve

Length: 1e12

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.1Length: 1e12DDR5-6000DDR5-48000.35080.70161.05241.40321.754SE +/- 0.004, N = 12SE +/- 0.013, N = 31.5531.5591. (CXX) g++ options: -O3

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.6Video Input: Bosphorus 4KDDR5-6000DDR5-4800816243240SE +/- 0.11, N = 3SE +/- 0.08, N = 334.2134.081. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 32DDR5-6000DDR5-48002000M4000M6000M8000M10000MSE +/- 14339494.80, N = 3SE +/- 24860209.17, N = 3849473333384631000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondDDR5-6000DDR5-48001.3M2.6M3.9M5.2M6.5MSE +/- 10789.95, N = 3SE +/- 24560.32, N = 36042366.346019991.221. (CC) gcc options: -O2 -lrt" -lrt

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput ThreadDDR5-6000DDR5-48002004006008001000SE +/- 2.19, N = 9SE +/- 10.21, N = 5994.0990.41. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

OSPRay

Benchmark: particle_volume/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: particle_volume/scivis/real_timeDDR5-4800DDR5-60001224364860SE +/- 0.02, N = 3SE +/- 0.01, N = 354.5754.40

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianDDR5-4800DDR5-600070140210280350SE +/- 0.33, N = 3SE +/- 0.58, N = 33143131. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPUDDR5-6000DDR5-48002K4K6K8K10KSE +/- 2.97, N = 3SE +/- 5.50, N = 311232.2611196.671. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 512DDR5-4800DDR5-60005M10M15M20M25MSE +/- 333.33, N = 3SE +/- 13860.42, N = 323164333230936671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32DDR5-4800DDR5-60001200M2400M3600M4800M6000MSE +/- 166666.67, N = 3SE +/- 9856131.76, N = 3542413333354076000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

QuantLib

Configuration: Single-Threaded

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Single-ThreadedDDR5-6000DDR5-48009001800270036004500SE +/- 40.37, N = 3SE +/- 45.37, N = 34264.74252.21. (CXX) g++ options: -O3 -march=native -fPIE -pie

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression SpeedDDR5-6000DDR5-4800400800120016002000SE +/- 8.42, N = 3SE +/- 6.51, N = 31726.41721.51. (CC) gcc options: -O3 -pthread -lz -llzma

uvg266

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: SlowDDR5-4800DDR5-6000918273645SE +/- 0.08, N = 3SE +/- 0.09, N = 439.2539.14

QMCPACK

Input: Li2_STO_ae

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: Li2_STO_aeDDR5-4800DDR5-600020406080100SE +/- 0.54, N = 3SE +/- 0.27, N = 378.8079.021. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

QuantLib

Configuration: Multi-Threaded

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedDDR5-6000DDR5-4800100K200K300K400K500KSE +/- 401.79, N = 3SE +/- 649.08, N = 3465436.8464193.21. (CXX) g++ options: -O3 -march=native -fPIE -pie

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 8 - Input: Bosphorus 4KDDR5-6000DDR5-4800306090120150SE +/- 1.10, N = 7SE +/- 1.05, N = 15118.16117.851. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512DDR5-4800DDR5-6000300M600M900M1200M1500MSE +/- 1386041.53, N = 3SE +/- 1960442.13, N = 3144626666714425000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPUDDR5-4800DDR5-60000.8731.7462.6193.4924.365SE +/- 0.01, N = 3SE +/- 0.00, N = 33.873.88MIN: 1.54 / MAX: 25.35MIN: 1.55 / MAX: 24.231. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

uvg266

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: MediumDDR5-4800DDR5-60001020304050SE +/- 0.05, N = 3SE +/- 0.04, N = 443.9643.85

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUDDR5-6000DDR5-48004K8K12K16K20KSE +/- 39.68, N = 3SE +/- 13.58, N = 32037920428

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.1Blend File: Fishy Cat - Compute: CPU-OnlyDDR5-6000DDR5-48003691215SE +/- 0.01, N = 4SE +/- 0.05, N = 312.5012.53

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeDDR5-4800DDR5-600051015202520.5520.601. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

PyTorch

Device: CPU - Batch Size: 512 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.2.1Device: CPU - Batch Size: 512 - Model: ResNet-152DDR5-4800DDR5-600048121620SE +/- 0.13, N = 3SE +/- 0.05, N = 317.5317.49MIN: 16.64 / MAX: 17.95MIN: 16.94 / MAX: 18.01

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPUDDR5-6000DDR5-48003K6K9K12K15KSE +/- 14.83, N = 3SE +/- 2.75, N = 313949.8913918.741. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPUDDR5-6000DDR5-48001.01032.02063.03094.04125.0515SE +/- 0.00, N = 3SE +/- 0.00, N = 34.484.49MIN: 1.99 / MAX: 22.31MIN: 1.96 / MAX: 20.971. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPUDDR5-6000DDR5-480048121620SE +/- 0.14, N = 15SE +/- 0.15, N = 1513.9613.93MIN: 6.55 / MAX: 16.95MIN: 6.46 / MAX: 16.83

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 57DDR5-4800DDR5-600015M30M45M60M75MSE +/- 65397.08, N = 3SE +/- 98289.26, N = 368321333681816671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballDDR5-6000DDR5-4800300600900120015001326.21328.91. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 10, LosslessDDR5-4800DDR5-60000.77831.55662.33493.11323.8915SE +/- 0.005, N = 3SE +/- 0.003, N = 93.4523.4591. (CXX) g++ options: -O3 -fPIC -lm

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-128-GCMDDR5-6000DDR5-4800400000M800000M1200000M1600000M2000000MSE +/- 679943093.73, N = 3SE +/- 798384586.01, N = 3200736535147020033130987531. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.17Model: bertsquad-12 - Device: CPU - Executor: ParallelDDR5-4800DDR5-6000612182430SE +/- 0.04, N = 3SE +/- 0.13, N = 324.5924.541. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishDDR5-6000DDR5-480070K140K210K280K350KSE +/- 53.35, N = 3SE +/- 53.65, N = 33230103223961. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptDDR5-6000DDR5-480070K140K210K280K350KSE +/- 45.32, N = 3SE +/- 44.46, N = 33230003224061. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0DDR5-6000DDR5-48001122334455SE +/- 0.03, N = 3SE +/- 0.02, N = 347.4747.551. (CXX) g++ options: -O3 -fPIC -lm

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPUDDR5-4800DDR5-600070140210280350SE +/- 0.37, N = 3SE +/- 0.54, N = 3328.94329.53MIN: 143.56 / MAX: 354.17MIN: 165.13 / MAX: 354.631. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPUDDR5-6000DDR5-48001.26682.53363.80045.06726.334SE +/- 0.00, N = 3SE +/- 0.00, N = 35.625.63MIN: 2.01 / MAX: 29.21MIN: 2.39 / MAX: 30.131. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 4 - Input: Bosphorus 4KDDR5-6000DDR5-48003691215SE +/- 0.04, N = 4SE +/- 0.05, N = 311.2811.261. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.17Model: yolov4 - Device: CPU - Executor: StandardDDR5-4800DDR5-60003691215SE +/- 0.01, N = 3SE +/- 0.05, N = 312.0912.071. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPUDDR5-4800DDR5-60004080120160200SE +/- 0.21, N = 3SE +/- 0.32, N = 3194.14193.801. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6DDR5-6000DDR5-48000.45140.90281.35421.80562.257SE +/- 0.006, N = 22.0032.0061. (CXX) g++ options: -O3 -fPIC -lm

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingDDR5-6000DDR5-4800200K400K600K800K1000KSE +/- 276.36, N = 3SE +/- 719.43, N = 38467368456311. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.17Model: fcn-resnet101-11 - Device: CPU - Executor: ParallelDDR5-4800DDR5-60000.72761.45522.18282.91043.638SE +/- 0.04063, N = 3SE +/- 0.04511, N = 33.233673.229561. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkDDR5-4800DDR5-60002004006008001000SE +/- 1.53, N = 3SE +/- 1.03, N = 3796.77795.83

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512DDR5-6000DDR5-4800500M1000M1500M2000M2500MSE +/- 1305118.13, N = 3SE +/- 1474599.76, N = 3248740000024845333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-256-GCMDDR5-4800DDR5-6000400000M800000M1200000M1600000M2000000MSE +/- 686739707.21, N = 3SE +/- 2482404044.78, N = 3183969943652318377658507471. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKDDR5-6000DDR5-4800300K600K900K1200K1500KSE +/- 1201.85, N = 3SE +/- 1527.53, N = 3136133313600001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.3Algorithm: RSA4096DDR5-4800DDR5-6000600K1200K1800K2400K3000KSE +/- 282.94, N = 3SE +/- 331.00, N = 32757741.02755042.31. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20DDR5-4800DDR5-6000300000M600000M900000M1200000M1500000MSE +/- 227575784.55, N = 3SE +/- 507390352.95, N = 3118731514793711863592944501. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 24.0Time To CompileDDR5-4800DDR5-600048121620SE +/- 0.07, N = 3SE +/- 0.07, N = 413.6613.67

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPUDDR5-6000DDR5-48007K14K21K28K35KSE +/- 0.83, N = 3SE +/- 17.88, N = 331655.5731630.961. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

Apache IoTDB

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400DDR5-4800DDR5-600050100150200250SE +/- 1.40, N = 3SE +/- 0.92, N = 3225.70225.87MAX: 26697.41MAX: 26595.42

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPUDDR5-4800DDR5-6000612182430SE +/- 0.00, N = 3SE +/- 0.03, N = 326.6526.67MIN: 15.32 / MAX: 48.24MIN: 15.22 / MAX: 47.891. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumDDR5-4800DDR5-60001224364860SE +/- 0.01, N = 3SE +/- 0.01, N = 553.9953.951. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Primesieve

Length: 1e13

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.1Length: 1e13DDR5-4800DDR5-600048121620SE +/- 0.01, N = 3SE +/- 0.02, N = 317.9617.971. (CXX) g++ options: -O3

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20-Poly1305DDR5-4800DDR5-6000200000M400000M600000M800000M1000000MSE +/- 318743927.24, N = 3SE +/- 89801965.81, N = 38075884256378070098414271. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA256DDR5-4800DDR5-600040000M80000M120000M160000M200000MSE +/- 316551135.30, N = 3SE +/- 76407988.26, N = 31867898199531866603933131. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2DDR5-6000DDR5-4800612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 324.8024.811. (CXX) g++ options: -O3 -fPIC -lm

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32DDR5-6000DDR5-4800600M1200M1800M2400M3000MSE +/- 233333.33, N = 3SE +/- 352766.84, N = 3278456666727831333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPUDDR5-4800DDR5-600010002000300040005000SE +/- 1.24, N = 3SE +/- 5.09, N = 34776.664774.331. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

ASTC Encoder

Preset: Very Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: Very ThoroughDDR5-6000DDR5-480048121620SE +/- 0.00, N = 3SE +/- 0.01, N = 315.7215.711. (CXX) g++ options: -O3 -flto -pthread

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop ContainerDDR5-6000DDR5-48001632486480SE +/- 0.07, N = 3SE +/- 0.35, N = 373.6773.70

OSPRay

Benchmark: particle_volume/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.1Benchmark: particle_volume/ao/real_timeDDR5-6000DDR5-48001224364860SE +/- 0.02, N = 3SE +/- 0.01, N = 354.3954.38

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.3Algorithm: RSA4096DDR5-4800DDR5-600015K30K45K60K75KSE +/- 80.51, N = 3SE +/- 15.58, N = 369052.569037.71. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 32DDR5-4800DDR5-60009M18M27M36M45MSE +/- 52003.21, N = 3SE +/- 3511.88, N = 344063000440540001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowDDR5-4800DDR5-60001224364860SE +/- 0.09, N = 3SE +/- 0.06, N = 553.4353.421. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57DDR5-6000DDR5-4800700M1400M2100M2800M3500MSE +/- 5392896.56, N = 3SE +/- 693621.73, N = 3329940000032989333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2DDR5-4800DDR5-600080160240320400SE +/- 3.02, N = 3SE +/- 3.07, N = 4386.20386.181. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2DDR5-4800DDR5-60002K4K6K8K10KSE +/- 75.53, N = 3SE +/- 76.86, N = 49655.089654.581. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 512DDR5-4800DDR5-6000600M1200M1800M2400M3000MSE +/- 1059874.21, N = 3SE +/- 3925274.23, N = 3285560000028554666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: ThoroughDDR5-6000DDR5-480020406080100SE +/- 0.02, N = 6SE +/- 0.01, N = 3110.04110.031. (CXX) g++ options: -O3 -flto -pthread

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPUDDR5-4800DDR5-60004K8K12K16K20KSE +/- 7.25, N = 3SE +/- 13.14, N = 318679.6318679.361. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPUDDR5-4800DDR5-6000246810SE +/- 0.01, N = 3SE +/- 0.01, N = 36.616.61MIN: 2.24 / MAX: 27.98MIN: 2.25 / MAX: 28.151. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUDDR5-4800DDR5-60000.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.01, N = 30.480.48MIN: 0.13 / MAX: 26.21MIN: 0.13 / MAX: 26.241. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput ThreadDDR5-6000DDR5-48004080120160200SE +/- 0.00, N = 4SE +/- 0.00, N = 3183.3183.3MIN: 105.8MIN: 105.81. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: ExhaustiveDDR5-6000DDR5-48003691215SE +/- 0.0004, N = 3SE +/- 0.0032, N = 39.65769.65761. (CXX) g++ options: -O3 -flto -pthread

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test TimesDDR5-4800DDR5-6000130260390520650SE +/- 3.93, N = 3SE +/- 1.70, N = 4582582

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression SpeedDDR5-6000DDR5-48003691215SE +/- 0.00, N = 3SE +/- 0.06, N = 310.210.21. (CC) gcc options: -O3 -pthread -lz -llzma

Xmrig

Variant: GhostRider - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MDDR5-6000DDR5-48004K8K12K16K20KSE +/- 912.79, N = 15SE +/- 997.93, N = 1519873.018386.41. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.17Model: T5 Encoder - Device: CPU - Executor: StandardDDR5-6000DDR5-48000.96081.92162.88243.84324.804SE +/- 0.02413, N = 3SE +/- 0.03015, N = 34.228234.270071. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.17Model: CaffeNet 12-int8 - Device: CPU - Executor: ParallelDDR5-6000DDR5-48000.23470.46940.70410.93881.1735SE +/- 0.00186, N = 3SE +/- 0.00860, N = 31.029981.043311. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.17Model: ArcFace ResNet-100 - Device: CPU - Executor: ParallelDDR5-4800DDR5-6000612182430SE +/- 0.14, N = 3SE +/- 0.12, N = 324.1724.271. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.17Model: ArcFace ResNet-100 - Device: CPU - Executor: StandardDDR5-6000DDR5-4800510152025SE +/- 0.24, N = 15SE +/- 0.18, N = 1522.6622.941. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.17Model: GPT-2 - Device: CPU - Executor: StandardDDR5-6000DDR5-48001.16932.33863.50794.67725.8465SE +/- 0.01008, N = 3SE +/- 0.02595, N = 35.145395.196721. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.17Model: bertsquad-12 - Device: CPU - Executor: ParallelDDR5-4800DDR5-6000918273645SE +/- 0.07, N = 3SE +/- 0.21, N = 340.6740.751. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.17Model: super-resolution-10 - Device: CPU - Executor: ParallelDDR5-4800DDR5-60001.15792.31583.47374.63165.7895SE +/- 0.01638, N = 3SE +/- 0.00564, N = 35.122555.146291. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.17Model: fcn-resnet101-11 - Device: CPU - Executor: ParallelDDR5-4800DDR5-600070140210280350SE +/- 3.94, N = 3SE +/- 4.35, N = 3309.34309.761. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Parallel

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.17Model: yolov4 - Device: CPU - Executor: ParallelDDR5-6000DDR5-48001530456075SE +/- 0.11, N = 3SE +/- 0.42, N = 368.7269.361. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInference Time Cost (ms), Fewer Is BetterONNX Runtime 1.17Model: yolov4 - Device: CPU - Executor: StandardDDR5-4800DDR5-600020406080100SE +/- 0.05, N = 3SE +/- 0.33, N = 382.7282.871. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 13 - Input: Bosphorus 4KDDR5-6000DDR5-480060120180240300SE +/- 7.58, N = 15SE +/- 9.35, N = 12271.92265.851. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPUDDR5-6000DDR5-4800714212835SE +/- 0.52, N = 15SE +/- 0.41, N = 1532.0230.83MIN: 26.48 / MAX: 43.04MIN: 26.09 / MAX: 42.8

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUDDR5-4800DDR5-6000510152025SE +/- 0.38, N = 15SE +/- 0.33, N = 1521.2020.94MIN: 19.48 / MAX: 27.07MIN: 19.5 / MAX: 27

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigDDR5-6000DDR5-48000.64271.28541.92812.57083.2135SE +/- 0.045208, N = 15SE +/- 0.001976, N = 32.7612602.8565281. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Speedb

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read While WritingDDR5-4800DDR5-60004M8M12M16M20MSE +/- 78558.58, N = 3SE +/- 433421.14, N = 1218826431186878971. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread


Phoronix Test Suite v10.8.5