svc05_hpc_run_1_23-09-22

v1.59

HTML result view exported from: https://openbenchmarking.org/result/2309266-NE-SVC05HPCR92&grs.

svc05_hpc_run_1_23-09-22ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen ResolutionSamsung SSD 980 PRO 1TBAMD Ryzen Threadripper PRO 5995WX 64-Cores @ 2.70GHz (64 Cores)ASRock WRX80 Creator (5.01 BIOS)AMD Starship/Matisse256GB1000GB Samsung SSD 980 PRO 1TBllvmpipeAMD Starship/Matisse2 x Intel X710 for 10GBASE-T + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 22.046.2.0-33-generic (x86_64)GNOME Shell 42.9X Server 1.21.1.34.5 Mesa 23.0.4-0ubuntu1~22.04.1 (LLVM 15.0.7 256 bits)1.3.238GCC 11.4.0ext41024x768OpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - NONE / errors=remount-ro,relatime,rw / Block Size: 4096- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa008204 - Python 3.10.12- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

svc05_hpc_run_1_23-09-22opencv: DNN - Deep Neural Networkkripke: scikit-learn: Sparse Rand Projections / 100 Iterationsscikit-learn: 20 Newsgroups / Logistic Regressionscikit-learn: Covertype Dataset Benchmarkscikit-learn: Text Vectorizersscikit-learn: Sparsifypyhpc: CPU - Numpy - 4194304 - Isoneutral Mixingpyhpc: CPU - Numpy - 4194304 - Equation of Statepyhpc: CPU - Numpy - 1048576 - Isoneutral Mixingpyhpc: CPU - Numpy - 1048576 - Equation of Statepyhpc: CPU - Numpy - 262144 - Isoneutral Mixingpyhpc: CPU - Numpy - 262144 - Equation of Statepyhpc: CPU - Numpy - 65536 - Isoneutral Mixingpyhpc: CPU - Numpy - 65536 - Equation of Statepyhpc: CPU - Numpy - 16384 - Isoneutral Mixingpyhpc: CPU - Numpy - 16384 - Equation of Statemlpack: scikit_linearridgeregressionmlpack: scikit_svmmlpack: scikit_qdamlpack: scikit_icafaiss: bench_polysemous_sift1m - PQ baselineai-benchmark: Device AI Scoreai-benchmark: Device Training Scoreai-benchmark: Device Inference Scorenumenta-nab: Contextual Anomaly Detector OSEnumenta-nab: Bayesian Changepointnumenta-nab: Earthgecko Skylinenumenta-nab: Windowed Gaussiannumenta-nab: Relative Entropynumenta-nab: KNN CADpetsc: Streamsopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUtnn: CPU - SqueezeNet v1.1tnn: CPU - SqueezeNet v2tnn: CPU - MobileNet v2tnn: CPU - DenseNetncnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetncnn: CPU - vision_transformerncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: squeezenetv1.1mnn: mobilenetV3mnn: nasnetgpaw: Carbon Nanotubewrf: conus 2.5kmcaffe: GoogleNet - CPU - 1000caffe: GoogleNet - CPU - 200caffe: GoogleNet - CPU - 100caffe: AlexNet - CPU - 1000caffe: AlexNet - CPU - 200caffe: AlexNet - CPU - 100spacy: en_core_web_trfspacy: en_core_web_lgdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Synchronous Single-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Synchronous Single-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Synchronous Single-Streamdeepsparse: BERT-Large, NLP Question Answering - Synchronous Single-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Synchronous Single-Streamdeepsparse: CV Detection, YOLOv5s COCO - Synchronous Single-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Synchronous Single-Streamdeepsparse: ResNet-50, Sparse INT8 - Synchronous Single-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Synchronous Single-Streamdeepsparse: ResNet-50, Baseline - Synchronous Single-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Synchronous Single-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Synchronous Single-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamoctave-benchmark: tensorflow: CPU - 512 - ResNet-50tensorflow: CPU - 512 - GoogLeNettensorflow: CPU - 256 - ResNet-50tensorflow: CPU - 256 - GoogLeNettensorflow: CPU - 64 - ResNet-50tensorflow: CPU - 64 - GoogLeNettensorflow: CPU - 32 - ResNet-50tensorflow: CPU - 32 - GoogLeNettensorflow: CPU - 16 - ResNet-50tensorflow: CPU - 16 - GoogLeNettensorflow: CPU - 512 - AlexNettensorflow: CPU - 256 - AlexNettensorflow: CPU - 64 - AlexNettensorflow: CPU - 512 - VGG-16tensorflow: CPU - 32 - AlexNettensorflow: CPU - 256 - VGG-16tensorflow: CPU - 16 - AlexNettensorflow: CPU - 64 - VGG-16tensorflow: CPU - 32 - VGG-16tensorflow: CPU - 16 - VGG-16tensorflow-lite: Inception ResNet V2tensorflow-lite: Mobilenet Quanttensorflow-lite: Mobilenet Floattensorflow-lite: NASNet Mobiletensorflow-lite: Inception V4tensorflow-lite: SqueezeNetdaphne: OpenMP - Euclidean Clusterdaphne: OpenMP - Points2Imagedaphne: OpenMP - NDT Mappinggromacs: MPI CPU - water_GMX50_bareintel-mpi: IMB-MPI1 PingPongintel-mpi: IMB-P2P PingPonggraph500: 26graph500: 26graph500: 26graph500: 26askap: Hogbom Clean OpenMPaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingaskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingrnnoise: rbenchmark: deepspeech: CPUnumpy: onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - f32 - CPUhimeno: Poisson Pressure Solvermt-dgemm: Sustained Floating-Point Ratearrayfire: BLAS CPUlulesh: lammps: Rhodopsin Proteinlammps: 20k Atomsnekrs: TurboPipe Periodicnekrs: Kershawspecfem3d: Water-layered Halfspacespecfem3d: Homogeneous Halfspacespecfem3d: Tomographic Modelspecfem3d: Layered Halfspacespecfem3d: Mount St. Helensremhos: Sample Remap Exampleqe: AUSURF112openradioss: Rubber O-Ring Seal Installationopenradioss: Bird Strike on Windshieldopenradioss: Cell Phone Drop Testopenradioss: Chrysler Neon 1Mopenradioss: Bumper Beamopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Large Mesh Size - Execution Timeopenfoam: drivaerFastback, Large Mesh Size - Mesh Timeopenfoam: motorBike - Execution Timeopenfoam: motorBike - Mesh Timemocassin: Dust 2D tau100.0mafft: Multiple Sequence Alignment - LSU RNAincompact3d: input.i3d 193 Cells Per Directionincompact3d: input.i3d 129 Cells Per Directionincompact3d: X3D-benchmarking input.i3dhmmer: Pfam Database Searchqmcpack: FeCO6_b3lyp_gmsqmcpack: FeCO6_b3lyp_gmsqmcpack: simple-H2Oqmcpack: Li2_STO_aenwchem: C240 Buckyballmrbayes: Primate Phylogeny Analysispalabos: 1000palabos: 500palabos: 400palabos: 100pennant: leblancbigpennant: sedovbigheffte: r2c - Stock - double-long - 512heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - double-long - 128heffte: c2c - Stock - double-long - 512heffte: c2c - Stock - double-long - 256heffte: c2c - Stock - double-long - 128heffte: r2c - Stock - float-long - 512heffte: r2c - Stock - float-long - 256heffte: r2c - Stock - float-long - 128heffte: r2c - FFTW - double-long - 512heffte: r2c - FFTW - double-long - 256heffte: r2c - FFTW - double-long - 128heffte: c2c - Stock - float-long - 512heffte: c2c - Stock - float-long - 256heffte: c2c - Stock - float-long - 128heffte: c2c - FFTW - double-long - 512heffte: c2c - FFTW - double-long - 256heffte: c2c - FFTW - double-long - 128heffte: r2c - FFTW - float-long - 512heffte: r2c - FFTW - float-long - 256heffte: r2c - FFTW - float-long - 128heffte: c2c - FFTW - float-long - 512heffte: c2c - FFTW - float-long - 256heffte: c2c - FFTW - float-long - 128heffte: r2c - Stock - double - 512heffte: r2c - Stock - double - 256heffte: r2c - Stock - double - 128heffte: c2c - Stock - double - 512heffte: c2c - Stock - double - 256heffte: c2c - Stock - double - 128heffte: r2c - Stock - float - 512heffte: r2c - Stock - float - 256heffte: r2c - Stock - float - 128heffte: r2c - FFTW - double - 512heffte: r2c - FFTW - double - 256heffte: r2c - FFTW - double - 128heffte: c2c - Stock - float - 512heffte: c2c - Stock - float - 256heffte: c2c - Stock - float - 128heffte: c2c - FFTW - double - 512heffte: c2c - FFTW - double - 256heffte: c2c - FFTW - double - 128heffte: r2c - FFTW - float - 512heffte: r2c - FFTW - float - 256heffte: r2c - FFTW - float - 128heffte: c2c - FFTW - float - 512heffte: c2c - FFTW - float - 256heffte: c2c - FFTW - float - 128fftw: Float + SSE - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 1D FFT Size 32fftw: Stock - 2D FFT Size 4096fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 32fftw: Stock - 1D FFT Size 32laghos: Sedov Blast Wave, ube_922_hex.meshlaghos: Triple Point Problemffte: N=256, 1D Complex FFT Routinelibxsmm: 64libxsmm: 32libxsmm: 256libxsmm: 128amg: neat: dolfyn: Computational Fluid Dynamicsnamd: ATPase Simulation - 327,506 Atomscp2k: Fayalite-FISTcp2k: H20-64rodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solverrodinia: OpenMP Leukocyterodinia: OpenMP LavaMDcloverleaf: Lagrangian-Eulerian Hydrodynamicsminibude: OpenMP - BM2minibude: OpenMP - BM2minibude: OpenMP - BM1minibude: OpenMP - BM1minife: Smallparboil: OpenMP MRI Griddingparboil: OpenMP Stencilparboil: OpenMP CUTCPparboil: OpenMP LBMlczero: BLAShpcc: Max Ping Pong Bandwidthhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: G-Fftehpcc: G-HPLnpb: SP.Cnpb: SP.Bnpb: MG.Cnpb: LU.Cnpb: IS.Dnpb: FT.Cnpb: EP.Dnpb: CG.Cnpb: BT.Chpl: hpcg: 160 160 160 - 1800hpcg: 144 144 144 - 1800hpcg: 104 104 104 - 1800hpcg: 160 160 160 - 60hpcg: 144 144 144 - 60hpcg: 104 104 104 - 60ior: 1024MB - Default Test Directoryior: 512MB - Default Test Directoryior: 256MB - Default Test Directoryior: 64MB - Default Test Directoryior: 32MB - Default Test Directoryior: 16MB - Default Test Directoryior: 8MB - Default Test Directoryior: 2MB - Default Test Directorywhisper-cpp: ggml-medium.en - 2016 State of the Unionwhisper-cpp: ggml-small.en - 2016 State of the Unionwhisper-cpp: ggml-base.en - 2016 State of the Unionfaiss: bench_polysemous_sift1m - Polysemous 30faiss: bench_polysemous_sift1m - Polysemous 34faiss: bench_polysemous_sift1m - Polysemous 38faiss: bench_polysemous_sift1m - Polysemous 42faiss: bench_polysemous_sift1m - Polysemous 46faiss: bench_polysemous_sift1m - Polysemous 50faiss: bench_polysemous_sift1m - Polysemous 54faiss: bench_polysemous_sift1m - Polysemous 58faiss: bench_polysemous_sift1m - Polysemous 62faiss: bench_polysemous_sift1m - Polysemous 64ncnn: CPU - FastestDetncnn: CPU - regnety_400mncnn: CPU - blazefacencnn: CPU - shufflenet-v2intel-mpi: IMB-MPI1 Sendrecvintel-mpi: IMB-MPI1 Sendrecvintel-mpi: IMB-MPI1 Exchangeintel-mpi: IMB-MPI1 Exchangeonednn: IP Shapes 1D - u8s8f32 - CPUrelion: Basic - CPUhpcc: G-Rand Accesshpcc: EP-DGEMMnpb: EP.Cior: 4MB - Default Test DirectorySamsung SSD 980 PRO 1TB24131315881267478.56230.830347.99258.80495.4241.9750.9520.4730.1620.1160.0440.0320.0120.0090.0020.8819.5125.1532.963.74949672000296734.93728.31459.3543.84210.56274.304118164.41171.1754311.931.2550822.478.851806.6716.993764.59100.45159.1510.361542.396.332524.35432.4036.9611.161432.351572.8010.111564.6510.161033.1815.39250.49060.433246.0352499.4775.9037.1415.769.6015.6911.453.726.2419.6010.232.136.303.884.984.044.2510.2237.149.5115.6911.373.666.1619.3510.236.333.884.044.2910.2416.7892.0273.0244.75214.6132.8611.67610.52759.10215397.72776573715297276654278202558852804539811367340.373924.7653622.407850.715815.859763.0298142.3392223.18948.7222114.611342.6437746.714629.098534.3496485.267865.09808.8145113.375272.3802438.92906.2004161.2329105.8412299.76554.4754223.338951.3821617.888933.448129.8909511.291061.94636.2699159.4054108.7560291.77611.1157893.91966.41274954.46334.5011222.045851.1578620.278322.211745.0039201.2608157.91059.994699.981159.7086533.08634.5868217.928022.65291405.235740.422324.7352623.589050.59806.00029.5384.0828.6683.5027.3482.0127.3185.5925.2476.32201.68194.81175.7210.60157.3610.55129.3310.229.909.3427862.91519.591485.9419693.521802.62165.591381.6625162.1997104721401.197.0446060.6236538343367742000286299000748649000733923000603.6568186.039740.4127028.925499.38148.105441.3218.2350.121770.41540538.47419.3451253.69416.8061255.04424.1621237.800.5200110.8576561.700381.565623.285850.6528560.2998451.813440.9394194580.38170120.6771271761.2420397.48336.81035.6773558176667466971000022.95902610911.4451051589.02993891925.4221842099.10976764412.780265.4356.11105.0822.15278.6061.03611.52112114.8168141.03570920.35659914665.882752.3381959.01936.4189178.7568.68221.43938645.01886018590.179036100.337181.92179.1525.36993.5721753.6103.314320.655284.807270.058384.3774.0094067.25457754.374473.116693.732527.796528.947351.1766108.971180.952170.18050.468564.7097103.85454.761382.969591.061327.704827.914259.0725100.731173.835192.58953.903575.7030104.96054.344373.448694.851727.731028.791551.1027108.915183.582169.96050.356564.7549104.47854.656982.944892.221627.667327.970559.0575100.404173.615191.32553.876075.7418104.342282716106747598176017141.59825.01148211256484.30267.50307873.59869846628.0309.61438.61116.1100145066727.93015.4220.3405599.45424.9384.9515.88130.91736.6708.47111.0252775.617108.2042705.09321660.4197.2608853.3479070.72196916.989112169127044.4531.800480.904091.7936814.5345329.05440108.1036750933.6993331.7057204.74151979.362718.0262960.516568.4423866.92132969.60103.6918.900118.900918.887718.903318.916419.12931489.061059.04907.56974.401009.38933.08813.53665.75959.47285475.98140208.249010.6590.6650.6810.7280.8731.2281.9073.0294.8975.9565.9517.592.285.4175.265133.49135.457897.230.692944413.0060.3542730.004406255.95645.60OpenBenchmarking.org

OpenCV

Test: DNN - Deep Neural Network

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: DNN - Deep Neural NetworkSamsung SSD 980 PRO 1TB5K10K15K20K25KSE +/- 279.29, N = 15241311. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.6Samsung SSD 980 PRO 1TB70M140M210M280M350MSE +/- 387235.49, N = 33158812671. (CXX) g++ options: -O3 -fopenmp -ldl

Scikit-Learn

Benchmark: Sparse Random Projections / 100 Iterations

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Sparse Random Projections / 100 IterationsSamsung SSD 980 PRO 1TB100200300400500SE +/- 0.25, N = 3478.561. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: 20 Newsgroups / Logistic Regression

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: 20 Newsgroups / Logistic RegressionSamsung SSD 980 PRO 1TB714212835SE +/- 0.38, N = 430.831. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Covertype Dataset Benchmark

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Covertype Dataset BenchmarkSamsung SSD 980 PRO 1TB80160240320400SE +/- 0.45, N = 3347.991. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Text Vectorizers

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: Text VectorizersSamsung SSD 980 PRO 1TB1326395265SE +/- 0.08, N = 358.801. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Scikit-Learn

Benchmark: Sparsify

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 1.2.2Benchmark: SparsifySamsung SSD 980 PRO 1TB20406080100SE +/- 0.20, N = 395.421. (F9X) gfortran options: -O3 -fopenmp -fno-tree-vectorize -lm -lpthread -lgfortran -lc

PyHPC Benchmarks

Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral MixingSamsung SSD 980 PRO 1TB0.44440.88881.33321.77762.222SE +/- 0.007, N = 31.975

PyHPC Benchmarks

Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of StateSamsung SSD 980 PRO 1TB0.21420.42840.64260.85681.071SE +/- 0.001, N = 30.952

PyHPC Benchmarks

Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Isoneutral Mixing

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Isoneutral MixingSamsung SSD 980 PRO 1TB0.10640.21280.31920.42560.532SE +/- 0.002, N = 30.473

PyHPC Benchmarks

Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Equation of State

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Equation of StateSamsung SSD 980 PRO 1TB0.03650.0730.10950.1460.1825SE +/- 0.001, N = 30.162

PyHPC Benchmarks

Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Isoneutral Mixing

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Isoneutral MixingSamsung SSD 980 PRO 1TB0.02610.05220.07830.10440.1305SE +/- 0.000, N = 30.116

PyHPC Benchmarks

Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Equation of State

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Equation of StateSamsung SSD 980 PRO 1TB0.00990.01980.02970.03960.0495SE +/- 0.000, N = 30.044

PyHPC Benchmarks

Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Isoneutral Mixing

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Isoneutral MixingSamsung SSD 980 PRO 1TB0.00720.01440.02160.02880.036SE +/- 0.000, N = 30.032

PyHPC Benchmarks

Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Equation of State

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Equation of StateSamsung SSD 980 PRO 1TB0.00270.00540.00810.01080.0135SE +/- 0.000, N = 150.012

PyHPC Benchmarks

Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Isoneutral Mixing

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Isoneutral MixingSamsung SSD 980 PRO 1TB0.0020.0040.0060.0080.01SE +/- 0.000, N = 30.009

PyHPC Benchmarks

Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Equation of State

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Equation of StateSamsung SSD 980 PRO 1TB0.00050.0010.00150.0020.0025SE +/- 0.000, N = 30.002

Mlpack Benchmark

Benchmark: scikit_linearridgeregression

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregressionSamsung SSD 980 PRO 1TB0.1980.3960.5940.7920.99SE +/- 0.01, N = 150.88

Mlpack Benchmark

Benchmark: scikit_svm

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svmSamsung SSD 980 PRO 1TB510152025SE +/- 0.03, N = 319.51

Mlpack Benchmark

Benchmark: scikit_qda

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_qdaSamsung SSD 980 PRO 1TB612182430SE +/- 0.01, N = 325.15

Mlpack Benchmark

Benchmark: scikit_ica

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_icaSamsung SSD 980 PRO 1TB816243240SE +/- 0.07, N = 332.96

Faiss

Test: bench_polysemous_sift1m - PQ baseline

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - PQ baselineSamsung SSD 980 PRO 1TB0.84351.6872.53053.3744.2175SE +/- 0.008, N = 33.7491. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

AI Benchmark Alpha

Device AI Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreSamsung SSD 980 PRO 1TB110022003300440055004967

AI Benchmark Alpha

Device Training Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreSamsung SSD 980 PRO 1TB4008001200160020002000

AI Benchmark Alpha

Device Inference Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreSamsung SSD 980 PRO 1TB60012001800240030002967

Numenta Anomaly Benchmark

Detector: Contextual Anomaly Detector OSE

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Contextual Anomaly Detector OSESamsung SSD 980 PRO 1TB816243240SE +/- 0.06, N = 334.94

Numenta Anomaly Benchmark

Detector: Bayesian Changepoint

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Bayesian ChangepointSamsung SSD 980 PRO 1TB714212835SE +/- 0.17, N = 328.31

Numenta Anomaly Benchmark

Detector: Earthgecko Skyline

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Earthgecko SkylineSamsung SSD 980 PRO 1TB1326395265SE +/- 0.13, N = 359.35

Numenta Anomaly Benchmark

Detector: Windowed Gaussian

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Windowed GaussianSamsung SSD 980 PRO 1TB0.86451.7292.59353.4584.3225SE +/- 0.004, N = 33.842

Numenta Anomaly Benchmark

Detector: Relative Entropy

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Relative EntropySamsung SSD 980 PRO 1TB3691215SE +/- 0.06, N = 310.56

Numenta Anomaly Benchmark

Detector: KNN CAD

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: KNN CADSamsung SSD 980 PRO 1TB1632486480SE +/- 0.23, N = 374.30

PETSc

Test: Streams

OpenBenchmarking.orgMB/s, More Is BetterPETSc 3.19Test: StreamsSamsung SSD 980 PRO 1TB30K60K90K120K150KSE +/- 21.10, N = 3118164.411. (CC) gcc options: -fPIC -O3 -O2 -lpthread -lm

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUSamsung SSD 980 PRO 1TB0.26330.52660.78991.05321.3165SE +/- 0.00, N = 31.17MIN: 0.92 / MAX: 38.441. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUSamsung SSD 980 PRO 1TB12K24K36K48K60KSE +/- 223.97, N = 354311.931. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUSamsung SSD 980 PRO 1TB0.28130.56260.84391.12521.4065SE +/- 0.01, N = 31.25MIN: 0.99 / MAX: 27.851. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUSamsung SSD 980 PRO 1TB11K22K33K44K55KSE +/- 332.81, N = 350822.471. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPUSamsung SSD 980 PRO 1TB246810SE +/- 0.02, N = 38.85MIN: 7.32 / MAX: 26.61. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPUSamsung SSD 980 PRO 1TB400800120016002000SE +/- 4.71, N = 31806.671. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPUSamsung SSD 980 PRO 1TB48121620SE +/- 0.03, N = 316.99MIN: 14.92 / MAX: 31.841. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPUSamsung SSD 980 PRO 1TB8001600240032004000SE +/- 6.50, N = 33764.591. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPUSamsung SSD 980 PRO 1TB20406080100SE +/- 0.45, N = 3100.45MIN: 86.37 / MAX: 135.941. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPUSamsung SSD 980 PRO 1TB4080120160200SE +/- 0.71, N = 3159.151. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUSamsung SSD 980 PRO 1TB3691215SE +/- 0.01, N = 310.36MIN: 8.69 / MAX: 38.361. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUSamsung SSD 980 PRO 1TB30060090012001500SE +/- 0.74, N = 31542.391. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPUSamsung SSD 980 PRO 1TB246810SE +/- 0.01, N = 36.33MIN: 5.87 / MAX: 25.461. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPUSamsung SSD 980 PRO 1TB5001000150020002500SE +/- 4.19, N = 32524.351. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUSamsung SSD 980 PRO 1TB90180270360450SE +/- 2.59, N = 3432.40MIN: 402.42 / MAX: 466.311. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUSamsung SSD 980 PRO 1TB816243240SE +/- 0.21, N = 336.961. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPUSamsung SSD 980 PRO 1TB3691215SE +/- 0.04, N = 311.16MIN: 10.05 / MAX: 33.111. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPUSamsung SSD 980 PRO 1TB30060090012001500SE +/- 4.82, N = 31432.351. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPUSamsung SSD 980 PRO 1TB30060090012001500SE +/- 5.09, N = 31572.80MIN: 1371.92 / MAX: 1836.381. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPUSamsung SSD 980 PRO 1TB3691215SE +/- 0.03, N = 310.111. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUSamsung SSD 980 PRO 1TB30060090012001500SE +/- 2.08, N = 31564.65MIN: 1396.52 / MAX: 1795.31. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUSamsung SSD 980 PRO 1TB3691215SE +/- 0.01, N = 310.161. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUSamsung SSD 980 PRO 1TB2004006008001000SE +/- 0.77, N = 31033.18MIN: 900.57 / MAX: 1094.111. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUSamsung SSD 980 PRO 1TB48121620SE +/- 0.00, N = 315.391. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1Samsung SSD 980 PRO 1TB50100150200250SE +/- 0.11, N = 3250.49MIN: 250.17 / MAX: 250.981. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2Samsung SSD 980 PRO 1TB1428425670SE +/- 0.29, N = 360.43MIN: 59.69 / MAX: 60.991. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2Samsung SSD 980 PRO 1TB50100150200250SE +/- 0.06, N = 3246.04MIN: 244.84 / MAX: 260.761. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetSamsung SSD 980 PRO 1TB5001000150020002500SE +/- 1.35, N = 32499.48MIN: 2428.85 / MAX: 2554.761. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetSamsung SSD 980 PRO 1TB1.32752.6553.98255.316.6375SE +/- 0.06, N = 35.90MIN: 5.66 / MAX: 16.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformerSamsung SSD 980 PRO 1TB918273645SE +/- 0.05, N = 337.14MIN: 36.16 / MAX: 57.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mSamsung SSD 980 PRO 1TB48121620SE +/- 0.48, N = 315.76MIN: 14.71 / MAX: 59.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdSamsung SSD 980 PRO 1TB3691215SE +/- 0.05, N = 39.60MIN: 9.24 / MAX: 20.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinySamsung SSD 980 PRO 1TB48121620SE +/- 0.04, N = 315.69MIN: 14.78 / MAX: 27.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50Samsung SSD 980 PRO 1TB3691215SE +/- 0.06, N = 311.45MIN: 11.22 / MAX: 23.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetSamsung SSD 980 PRO 1TB0.8371.6742.5113.3484.185SE +/- 0.06, N = 33.72MIN: 3.52 / MAX: 7.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18Samsung SSD 980 PRO 1TB246810SE +/- 0.08, N = 36.24MIN: 6.04 / MAX: 20.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16Samsung SSD 980 PRO 1TB510152025SE +/- 0.15, N = 319.60MIN: 19.07 / MAX: 106.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetSamsung SSD 980 PRO 1TB3691215SE +/- 0.06, N = 310.23MIN: 9.96 / MAX: 21.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefaceSamsung SSD 980 PRO 1TB0.47930.95861.43791.91722.3965SE +/- 0.06, N = 32.13MIN: 2 / MAX: 3.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0Samsung SSD 980 PRO 1TB246810SE +/- 0.05, N = 36.30MIN: 6.15 / MAX: 15.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetSamsung SSD 980 PRO 1TB0.8731.7462.6193.4924.365SE +/- 0.02, N = 33.88MIN: 3.78 / MAX: 11.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2Samsung SSD 980 PRO 1TB1.12052.2413.36154.4825.6025SE +/- 0.01, N = 34.98MIN: 4.83 / MAX: 14.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3Samsung SSD 980 PRO 1TB0.9091.8182.7273.6364.545SE +/- 0.05, N = 34.04MIN: 3.87 / MAX: 14.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2Samsung SSD 980 PRO 1TB0.95631.91262.86893.82524.7815SE +/- 0.06, N = 34.25MIN: 4.11 / MAX: 19.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetSamsung SSD 980 PRO 1TB3691215SE +/- 0.03, N = 310.22MIN: 10 / MAX: 21.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerSamsung SSD 980 PRO 1TB918273645SE +/- 0.30, N = 337.14MIN: 36.07 / MAX: 98.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdSamsung SSD 980 PRO 1TB3691215SE +/- 0.06, N = 39.51MIN: 9.12 / MAX: 20.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinySamsung SSD 980 PRO 1TB48121620SE +/- 0.06, N = 315.69MIN: 14.82 / MAX: 63.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50Samsung SSD 980 PRO 1TB3691215SE +/- 0.05, N = 311.37MIN: 11.1 / MAX: 22.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetSamsung SSD 980 PRO 1TB0.82351.6472.47053.2944.1175SE +/- 0.02, N = 33.66MIN: 3.54 / MAX: 9.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18Samsung SSD 980 PRO 1TB246810SE +/- 0.02, N = 36.16MIN: 5.96 / MAX: 17.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16Samsung SSD 980 PRO 1TB510152025SE +/- 0.04, N = 319.35MIN: 19.07 / MAX: 29.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetSamsung SSD 980 PRO 1TB3691215SE +/- 0.07, N = 310.23MIN: 9.99 / MAX: 21.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0Samsung SSD 980 PRO 1TB246810SE +/- 0.10, N = 36.33MIN: 6.07 / MAX: 15.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetSamsung SSD 980 PRO 1TB0.8731.7462.6193.4924.365SE +/- 0.08, N = 33.88MIN: 3.7 / MAX: 6.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3Samsung SSD 980 PRO 1TB0.9091.8182.7273.6364.545SE +/- 0.08, N = 34.04MIN: 3.8 / MAX: 13.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2Samsung SSD 980 PRO 1TB0.96531.93062.89593.86124.8265SE +/- 0.13, N = 34.29MIN: 4.06 / MAX: 17.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetSamsung SSD 980 PRO 1TB3691215SE +/- 0.13, N = 310.24MIN: 9.92 / MAX: 84.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: inception-v3Samsung SSD 980 PRO 1TB48121620SE +/- 0.09, N = 316.79MIN: 16.52 / MAX: 24.511. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: mobilenet-v1-1.0Samsung SSD 980 PRO 1TB0.45610.91221.36831.82442.2805SE +/- 0.010, N = 32.027MIN: 1.98 / MAX: 3.141. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: MobileNetV2_224Samsung SSD 980 PRO 1TB0.68041.36082.04122.72163.402SE +/- 0.004, N = 33.024MIN: 2.98 / MAX: 4.171. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: SqueezeNetV1.0Samsung SSD 980 PRO 1TB1.06922.13843.20764.27685.346SE +/- 0.021, N = 34.752MIN: 4.66 / MAX: 5.671. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: resnet-v2-50Samsung SSD 980 PRO 1TB48121620SE +/- 0.06, N = 314.61MIN: 14.39 / MAX: 15.811. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: squeezenetv1.1

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: squeezenetv1.1Samsung SSD 980 PRO 1TB0.64371.28741.93112.57483.2185SE +/- 0.047, N = 32.861MIN: 2.74 / MAX: 35.681. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenetV3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: mobilenetV3Samsung SSD 980 PRO 1TB0.37710.75421.13131.50841.8855SE +/- 0.023, N = 31.676MIN: 1.61 / MAX: 2.841. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: nasnet

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: nasnetSamsung SSD 980 PRO 1TB3691215SE +/- 0.07, N = 310.53MIN: 10.29 / MAX: 11.771. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon NanotubeSamsung SSD 980 PRO 1TB1326395265SE +/- 0.16, N = 359.101. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

WRF

Input: conus 2.5km

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5kmSamsung SSD 980 PRO 1TB3K6K9K12K15K15397.731. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 1000Samsung SSD 980 PRO 1TB160K320K480K640K800KSE +/- 849.11, N = 37657371. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 200Samsung SSD 980 PRO 1TB30K60K90K120K150KSE +/- 103.51, N = 31529721. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 100Samsung SSD 980 PRO 1TB16K32K48K64K80KSE +/- 62.63, N = 3766541. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 1000Samsung SSD 980 PRO 1TB60K120K180K240K300KSE +/- 95.71, N = 32782021. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 200Samsung SSD 980 PRO 1TB12K24K36K48K60KSE +/- 105.71, N = 3558851. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 100Samsung SSD 980 PRO 1TB6K12K18K24K30KSE +/- 66.71, N = 3280451. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

spaCy

Model: en_core_web_trf

OpenBenchmarking.orgtokens/sec, More Is BetterspaCy 3.4.1Model: en_core_web_trfSamsung SSD 980 PRO 1TB9001800270036004500SE +/- 9.84, N = 33981

spaCy

Model: en_core_web_lg

OpenBenchmarking.orgtokens/sec, More Is BetterspaCy 3.4.1Model: en_core_web_lgSamsung SSD 980 PRO 1TB3K6K9K12K15KSE +/- 35.10, N = 313673

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB918273645SE +/- 0.13, N = 340.37

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB612182430SE +/- 0.08, N = 324.77

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB130260390520650SE +/- 2.89, N = 3622.41

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB1122334455SE +/- 0.24, N = 350.72

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB48121620SE +/- 0.05, N = 315.86

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB1428425670SE +/- 0.19, N = 363.03

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB306090120150SE +/- 0.52, N = 3142.34

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB50100150200250SE +/- 0.80, N = 3223.19

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB246810SE +/- 0.0297, N = 38.7222

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB306090120150SE +/- 0.39, N = 3114.61

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB1020304050SE +/- 0.05, N = 342.64

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB160320480640800SE +/- 1.16, N = 3746.71

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB714212835SE +/- 0.12, N = 329.10

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB816243240SE +/- 0.14, N = 334.35

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB110220330440550SE +/- 3.29, N = 3485.27

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB1530456075SE +/- 0.47, N = 365.10

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB246810SE +/- 0.0357, N = 38.8145

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB306090120150SE +/- 0.46, N = 3113.38

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB1632486480SE +/- 0.25, N = 372.38

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB100200300400500SE +/- 1.34, N = 3438.93

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB246810SE +/- 0.0287, N = 36.2004

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB4080120160200SE +/- 0.74, N = 3161.23

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB20406080100SE +/- 0.71, N = 3105.84

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB70140210280350SE +/- 1.99, N = 3299.77

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB1.0072.0143.0214.0285.035SE +/- 0.0289, N = 34.4754

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB50100150200250SE +/- 1.43, N = 3223.34

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB1224364860SE +/- 0.18, N = 351.38

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB130260390520650SE +/- 1.98, N = 3617.89

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB816243240SE +/- 0.09, N = 333.45

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB714212835SE +/- 0.08, N = 329.89

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB110220330440550SE +/- 0.79, N = 3511.29

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB1428425670SE +/- 0.09, N = 361.95

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB246810SE +/- 0.0201, N = 36.2699

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB4080120160200SE +/- 0.51, N = 3159.41

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB20406080100SE +/- 1.44, N = 3108.76

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB60120180240300SE +/- 3.82, N = 3291.78

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB0.2510.5020.7531.0041.255SE +/- 0.0076, N = 31.1157

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB2004006008001000SE +/- 6.11, N = 3893.92

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB246810SE +/- 0.0106, N = 36.4127

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB11002200330044005500SE +/- 7.70, N = 34954.46

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB1.01272.02543.03814.05085.0635SE +/- 0.0054, N = 34.5011

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB50100150200250SE +/- 0.26, N = 3222.05

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB1224364860SE +/- 0.04, N = 351.16

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB130260390520650SE +/- 0.42, N = 3620.28

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB510152025SE +/- 0.13, N = 322.21

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB1020304050SE +/- 0.25, N = 345.00

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB4080120160200SE +/- 1.81, N = 3201.26

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB306090120150SE +/- 1.43, N = 3157.91

Neural Magic DeepSparse

Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB3691215SE +/- 0.0208, N = 39.9946

Neural Magic DeepSparse

Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB20406080100SE +/- 0.21, N = 399.98

Neural Magic DeepSparse

Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB1326395265SE +/- 0.54, N = 359.71

Neural Magic DeepSparse

Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB120240360480600SE +/- 4.89, N = 3533.09

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB1.0322.0643.0964.1285.16SE +/- 0.0128, N = 34.5868

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB50100150200250SE +/- 0.61, N = 3217.93

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB510152025SE +/- 0.14, N = 322.65

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB30060090012001500SE +/- 8.56, N = 31405.24

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB918273645SE +/- 0.07, N = 340.42

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-StreamSamsung SSD 980 PRO 1TB612182430SE +/- 0.05, N = 324.74

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB130260390520650SE +/- 3.00, N = 3623.59

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamSamsung SSD 980 PRO 1TB1122334455SE +/- 0.27, N = 350.60

GNU Octave Benchmark

OpenBenchmarking.orgSeconds, Fewer Is BetterGNU Octave Benchmark 6.4.0Samsung SSD 980 PRO 1TB246810SE +/- 0.025, N = 56.000

TensorFlow

Device: CPU - Batch Size: 512 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: ResNet-50Samsung SSD 980 PRO 1TB714212835SE +/- 0.02, N = 329.53

TensorFlow

Device: CPU - Batch Size: 512 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: GoogLeNetSamsung SSD 980 PRO 1TB20406080100SE +/- 0.07, N = 384.08

TensorFlow

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: ResNet-50Samsung SSD 980 PRO 1TB714212835SE +/- 0.03, N = 328.66

TensorFlow

Device: CPU - Batch Size: 256 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: GoogLeNetSamsung SSD 980 PRO 1TB20406080100SE +/- 0.02, N = 383.50

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50Samsung SSD 980 PRO 1TB612182430SE +/- 0.00, N = 327.34

TensorFlow

Device: CPU - Batch Size: 64 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: GoogLeNetSamsung SSD 980 PRO 1TB20406080100SE +/- 0.34, N = 382.01

TensorFlow

Device: CPU - Batch Size: 32 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50Samsung SSD 980 PRO 1TB612182430SE +/- 0.12, N = 327.31

TensorFlow

Device: CPU - Batch Size: 32 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: GoogLeNetSamsung SSD 980 PRO 1TB20406080100SE +/- 0.38, N = 385.59

TensorFlow

Device: CPU - Batch Size: 16 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: ResNet-50Samsung SSD 980 PRO 1TB612182430SE +/- 0.28, N = 325.24

TensorFlow

Device: CPU - Batch Size: 16 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: GoogLeNetSamsung SSD 980 PRO 1TB20406080100SE +/- 0.16, N = 376.32

TensorFlow

Device: CPU - Batch Size: 512 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: AlexNetSamsung SSD 980 PRO 1TB4080120160200SE +/- 0.09, N = 3201.68

TensorFlow

Device: CPU - Batch Size: 256 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: AlexNetSamsung SSD 980 PRO 1TB4080120160200SE +/- 0.88, N = 3194.81

TensorFlow

Device: CPU - Batch Size: 64 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: AlexNetSamsung SSD 980 PRO 1TB4080120160200SE +/- 0.73, N = 3175.72

TensorFlow

Device: CPU - Batch Size: 512 - Model: VGG-16

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: VGG-16Samsung SSD 980 PRO 1TB3691215SE +/- 0.01, N = 310.60

TensorFlow

Device: CPU - Batch Size: 32 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: AlexNetSamsung SSD 980 PRO 1TB306090120150SE +/- 0.03, N = 3157.36

TensorFlow

Device: CPU - Batch Size: 256 - Model: VGG-16

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: VGG-16Samsung SSD 980 PRO 1TB3691215SE +/- 0.01, N = 310.55

TensorFlow

Device: CPU - Batch Size: 16 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: AlexNetSamsung SSD 980 PRO 1TB306090120150SE +/- 1.42, N = 3129.33

TensorFlow

Device: CPU - Batch Size: 64 - Model: VGG-16

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: VGG-16Samsung SSD 980 PRO 1TB3691215SE +/- 0.01, N = 310.22

TensorFlow

Device: CPU - Batch Size: 32 - Model: VGG-16

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: VGG-16Samsung SSD 980 PRO 1TB3691215SE +/- 0.01, N = 39.90

TensorFlow

Device: CPU - Batch Size: 16 - Model: VGG-16

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: VGG-16Samsung SSD 980 PRO 1TB3691215SE +/- 0.01, N = 39.34

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V2Samsung SSD 980 PRO 1TB6K12K18K24K30KSE +/- 57.85, N = 327862.9

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet QuantSamsung SSD 980 PRO 1TB30060090012001500SE +/- 2.79, N = 31519.59

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet FloatSamsung SSD 980 PRO 1TB30060090012001500SE +/- 3.31, N = 31485.94

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: NASNet MobileSamsung SSD 980 PRO 1TB4K8K12K16K20KSE +/- 36.01, N = 319693.5

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V4Samsung SSD 980 PRO 1TB5K10K15K20K25KSE +/- 39.22, N = 321802.6

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNetSamsung SSD 980 PRO 1TB5001000150020002500SE +/- 2.26, N = 32165.59

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Euclidean Cluster

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous Suite 2021.11.02Backend: OpenMP - Kernel: Euclidean ClusterSamsung SSD 980 PRO 1TB30060090012001500SE +/- 1.30, N = 31381.661. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: Points2Image

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous Suite 2021.11.02Backend: OpenMP - Kernel: Points2ImageSamsung SSD 980 PRO 1TB5K10K15K20K25KSE +/- 160.89, N = 325162.201. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Darmstadt Automotive Parallel Heterogeneous Suite

Backend: OpenMP - Kernel: NDT Mapping

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous Suite 2021.11.02Backend: OpenMP - Kernel: NDT MappingSamsung SSD 980 PRO 1TB30060090012001500SE +/- 6.20, N = 31401.191. (CXX) g++ options: -O3 -std=c++11 -fopenmp

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bareSamsung SSD 980 PRO 1TB246810SE +/- 0.001, N = 37.0441. (CXX) g++ options: -O3

Intel MPI Benchmarks

Test: IMB-MPI1 PingPong

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 PingPongSamsung SSD 980 PRO 1TB13002600390052006500SE +/- 90.55, N = 156060.62MIN: 5.41 / MAX: 21642.931. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

Intel MPI Benchmarks

Test: IMB-P2P PingPong

OpenBenchmarking.orgAverage Msg/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-P2P PingPongSamsung SSD 980 PRO 1TB8M16M24M32M40MSE +/- 45475.10, N = 336538343MIN: 9387 / MAX: 905983591. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26Samsung SSD 980 PRO 1TB80M160M240M320M400M3677420001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26Samsung SSD 980 PRO 1TB60M120M180M240M300M2862990001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26Samsung SSD 980 PRO 1TB160M320M480M640M800M7486490001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26Samsung SSD 980 PRO 1TB160M320M480M640M800M7339230001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPSamsung SSD 980 PRO 1TB130260390520650SE +/- 3.22, N = 3603.661. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - DegriddingSamsung SSD 980 PRO 1TB2K4K6K8K10KSE +/- 33.62, N = 158186.031. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - GriddingSamsung SSD 980 PRO 1TB2K4K6K8K10KSE +/- 124.76, N = 159740.411. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingSamsung SSD 980 PRO 1TB6K12K18K24K30KSE +/- 116.50, N = 327028.91. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingSamsung SSD 980 PRO 1TB5K10K15K20K25KSE +/- 208.17, N = 325499.31. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - DegriddingSamsung SSD 980 PRO 1TB2K4K6K8K10KSE +/- 4.68, N = 38148.101. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - GriddingSamsung SSD 980 PRO 1TB12002400360048006000SE +/- 0.58, N = 35441.321. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

RNNoise

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28Samsung SSD 980 PRO 1TB48121620SE +/- 0.02, N = 318.241. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

R Benchmark

OpenBenchmarking.orgSeconds, Fewer Is BetterR BenchmarkSamsung SSD 980 PRO 1TB0.02740.05480.08220.10960.137SE +/- 0.0010, N = 30.12171. R scripting front-end version 4.1.2 (2021-11-01)

DeepSpeech

Acceleration: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterDeepSpeech 0.6Acceleration: CPUSamsung SSD 980 PRO 1TB1632486480SE +/- 0.79, N = 470.42

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkSamsung SSD 980 PRO 1TB120240360480600SE +/- 1.09, N = 3538.47

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUSamsung SSD 980 PRO 1TB90180270360450SE +/- 4.15, N = 15419.35MIN: 380.651. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUSamsung SSD 980 PRO 1TB30060090012001500SE +/- 9.95, N = 31253.69MIN: 1228.291. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUSamsung SSD 980 PRO 1TB90180270360450SE +/- 5.05, N = 15416.81MIN: 370.031. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUSamsung SSD 980 PRO 1TB30060090012001500SE +/- 9.42, N = 31255.04MIN: 1235.241. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUSamsung SSD 980 PRO 1TB90180270360450SE +/- 5.95, N = 15424.16MIN: 392.441. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUSamsung SSD 980 PRO 1TB30060090012001500SE +/- 12.15, N = 31237.80MIN: 1206.51. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUSamsung SSD 980 PRO 1TB0.1170.2340.3510.4680.585SE +/- 0.005655, N = 40.520011MIN: 0.491. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUSamsung SSD 980 PRO 1TB0.1930.3860.5790.7720.965SE +/- 0.009588, N = 40.857656MIN: 0.771. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUSamsung SSD 980 PRO 1TB0.38260.76521.14781.53041.913SE +/- 0.02013, N = 31.70038MIN: 1.531. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUSamsung SSD 980 PRO 1TB0.35230.70461.05691.40921.7615SE +/- 0.00946, N = 31.56562MIN: 1.41. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUSamsung SSD 980 PRO 1TB0.73931.47862.21792.95723.6965SE +/- 0.00823, N = 33.28585MIN: 2.851. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUSamsung SSD 980 PRO 1TB0.14690.29380.44070.58760.7345SE +/- 0.005226, N = 90.652856MIN: 0.621. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUSamsung SSD 980 PRO 1TB0.06750.1350.20250.270.3375SE +/- 0.001871, N = 30.299845MIN: 0.271. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUSamsung SSD 980 PRO 1TB0.4080.8161.2241.6322.04SE +/- 0.00446, N = 31.81344MIN: 1.681. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUSamsung SSD 980 PRO 1TB0.21140.42280.63420.84561.057SE +/- 0.014147, N = 150.939419MIN: 0.781. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverSamsung SSD 980 PRO 1TB10002000300040005000SE +/- 17.66, N = 34580.381. (CC) gcc options: -O3 -mavx2

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateSamsung SSD 980 PRO 1TB510152025SE +/- 0.23, N = 320.681. (CC) gcc options: -O3 -march=native -fopenmp

ArrayFire

Test: BLAS CPU

OpenBenchmarking.orgGFLOPS, More Is BetterArrayFire 3.7Test: BLAS CPUSamsung SSD 980 PRO 1TB400800120016002000SE +/- 6.85, N = 31761.241. (CXX) g++ options: -rdynamic

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3Samsung SSD 980 PRO 1TB4K8K12K16K20KSE +/- 70.84, N = 320397.481. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinSamsung SSD 980 PRO 1TB816243240SE +/- 0.24, N = 336.811. (CXX) g++ options: -O3 -lm -ldl

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsSamsung SSD 980 PRO 1TB816243240SE +/- 0.25, N = 335.681. (CXX) g++ options: -O3 -lm -ldl

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe PeriodicSamsung SSD 980 PRO 1TB800M1600M2400M3200M4000MSE +/- 624615.44, N = 335581766671. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

nekRS

Input: Kershaw

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: KershawSamsung SSD 980 PRO 1TB1000M2000M3000M4000M5000MSE +/- 12527076.01, N = 346697100001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered HalfspaceSamsung SSD 980 PRO 1TB612182430SE +/- 0.11, N = 322.961. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous HalfspaceSamsung SSD 980 PRO 1TB3691215SE +/- 0.07, N = 311.451. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic ModelSamsung SSD 980 PRO 1TB3691215SE +/- 0.068910369, N = 39.0299389191. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered HalfspaceSamsung SSD 980 PRO 1TB612182430SE +/- 0.19, N = 325.421. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. HelensSamsung SSD 980 PRO 1TB3691215SE +/- 0.115311179, N = 39.1097676441. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap ExampleSamsung SSD 980 PRO 1TB3691215SE +/- 0.07, N = 312.781. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 7.0Input: AUSURF112Samsung SSD 980 PRO 1TB60120180240300SE +/- 0.42, N = 3265.431. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal InstallationSamsung SSD 980 PRO 1TB1326395265SE +/- 0.25, N = 356.11

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on WindshieldSamsung SSD 980 PRO 1TB20406080100SE +/- 0.65, N = 3105.08

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop TestSamsung SSD 980 PRO 1TB510152025SE +/- 0.08, N = 322.15

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1MSamsung SSD 980 PRO 1TB60120180240300SE +/- 0.84, N = 3278.60

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper BeamSamsung SSD 980 PRO 1TB1428425670SE +/- 0.23, N = 361.03

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution TimeSamsung SSD 980 PRO 1TB130260390520650611.521. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh TimeSamsung SSD 980 PRO 1TB306090120150114.821. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeSamsung SSD 980 PRO 1TB91827364541.041. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeSamsung SSD 980 PRO 1TB51015202520.361. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Large Mesh Size - Execution TimeSamsung SSD 980 PRO 1TB3K6K9K12K15K14665.881. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Large Mesh Size - Mesh TimeSamsung SSD 980 PRO 1TB160320480640800752.341. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: motorBike - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution TimeSamsung SSD 980 PRO 1TB132639526559.021. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: motorBike - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Mesh TimeSamsung SSD 980 PRO 1TB81624324036.421. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Monte Carlo Simulations of Ionised Nebulae

Input: Dust 2D tau100.0

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2.02.73.3Input: Dust 2D tau100.0Samsung SSD 980 PRO 1TB4080120160200SE +/- 1.31, N = 3178.761. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNASamsung SSD 980 PRO 1TB246810SE +/- 0.058, N = 38.6821. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionSamsung SSD 980 PRO 1TB510152025SE +/- 0.03, N = 321.441. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per DirectionSamsung SSD 980 PRO 1TB1.12922.25843.38764.51685.646SE +/- 0.03302499, N = 35.018860181. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dSamsung SSD 980 PRO 1TB130260390520650SE +/- 0.04, N = 3590.181. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database SearchSamsung SSD 980 PRO 1TB20406080100SE +/- 0.06, N = 3100.341. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm -lmpi

QMCPACK

Input: FeCO6_b3lyp_gms

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: FeCO6_b3lyp_gmsSamsung SSD 980 PRO 1TB4080120160200SE +/- 0.67, N = 3181.921. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

QMCPACK

Input: FeCO6_b3lyp_gms

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: FeCO6_b3lyp_gmsSamsung SSD 980 PRO 1TB4080120160200SE +/- 0.41, N = 3179.151. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: simple-H2OSamsung SSD 980 PRO 1TB612182430SE +/- 0.05, N = 325.371. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

QMCPACK

Input: Li2_STO_ae

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: Li2_STO_aeSamsung SSD 980 PRO 1TB20406080100SE +/- 0.55, N = 393.571. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballSamsung SSD 980 PRO 1TB4008001200160020001753.61. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisSamsung SSD 980 PRO 1TB20406080100SE +/- 0.67, N = 3103.311. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm -lreadline

Palabos

Grid Size: 1000

OpenBenchmarking.orgMega Site Updates Per Second, More Is BetterPalabos 2.3Grid Size: 1000Samsung SSD 980 PRO 1TB70140210280350SE +/- 0.29, N = 3320.661. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm

Palabos

Grid Size: 500

OpenBenchmarking.orgMega Site Updates Per Second, More Is BetterPalabos 2.3Grid Size: 500Samsung SSD 980 PRO 1TB60120180240300SE +/- 0.07, N = 3284.811. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm

Palabos

Grid Size: 400

OpenBenchmarking.orgMega Site Updates Per Second, More Is BetterPalabos 2.3Grid Size: 400Samsung SSD 980 PRO 1TB60120180240300SE +/- 0.10, N = 3270.061. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm

Palabos

Grid Size: 100

OpenBenchmarking.orgMega Site Updates Per Second, More Is BetterPalabos 2.3Grid Size: 100Samsung SSD 980 PRO 1TB80160240320400SE +/- 0.82, N = 3384.381. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigSamsung SSD 980 PRO 1TB0.90211.80422.70633.60844.5105SE +/- 0.003859, N = 34.0094061. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigSamsung SSD 980 PRO 1TB246810SE +/- 0.045366, N = 37.2545771. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512Samsung SSD 980 PRO 1TB1224364860SE +/- 0.02, N = 354.371. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256Samsung SSD 980 PRO 1TB1632486480SE +/- 0.19, N = 373.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128Samsung SSD 980 PRO 1TB20406080100SE +/- 0.69, N = 393.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512Samsung SSD 980 PRO 1TB714212835SE +/- 0.02, N = 327.801. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256Samsung SSD 980 PRO 1TB714212835SE +/- 0.03, N = 328.951. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128Samsung SSD 980 PRO 1TB1224364860SE +/- 0.34, N = 351.181. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512Samsung SSD 980 PRO 1TB20406080100SE +/- 0.05, N = 3108.971. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256Samsung SSD 980 PRO 1TB4080120160200SE +/- 0.37, N = 3180.951. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128Samsung SSD 980 PRO 1TB4080120160200SE +/- 0.20, N = 3170.181. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512Samsung SSD 980 PRO 1TB1122334455SE +/- 0.04, N = 350.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256Samsung SSD 980 PRO 1TB1428425670SE +/- 0.19, N = 364.711. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128Samsung SSD 980 PRO 1TB20406080100SE +/- 1.24, N = 3103.851. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512Samsung SSD 980 PRO 1TB1224364860SE +/- 0.03, N = 354.761. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256Samsung SSD 980 PRO 1TB20406080100SE +/- 0.11, N = 382.971. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128Samsung SSD 980 PRO 1TB20406080100SE +/- 0.51, N = 391.061. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512Samsung SSD 980 PRO 1TB714212835SE +/- 0.02, N = 327.701. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256Samsung SSD 980 PRO 1TB714212835SE +/- 0.02, N = 327.911. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128Samsung SSD 980 PRO 1TB1326395265SE +/- 0.41, N = 359.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512Samsung SSD 980 PRO 1TB20406080100SE +/- 0.06, N = 3100.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256Samsung SSD 980 PRO 1TB4080120160200SE +/- 0.68, N = 3173.841. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128Samsung SSD 980 PRO 1TB4080120160200SE +/- 1.46, N = 3192.591. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512Samsung SSD 980 PRO 1TB1224364860SE +/- 0.04, N = 353.901. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256Samsung SSD 980 PRO 1TB20406080100SE +/- 0.30, N = 375.701. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128Samsung SSD 980 PRO 1TB20406080100SE +/- 0.20, N = 3104.961. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 512Samsung SSD 980 PRO 1TB1224364860SE +/- 0.01, N = 354.341. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 256Samsung SSD 980 PRO 1TB1632486480SE +/- 0.08, N = 373.451. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: double - X Y Z: 128Samsung SSD 980 PRO 1TB20406080100SE +/- 0.68, N = 394.851. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 512Samsung SSD 980 PRO 1TB714212835SE +/- 0.01, N = 327.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 256Samsung SSD 980 PRO 1TB714212835SE +/- 0.05, N = 328.791. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: double - X Y Z: 128Samsung SSD 980 PRO 1TB1224364860SE +/- 0.55, N = 451.101. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 512Samsung SSD 980 PRO 1TB20406080100SE +/- 0.02, N = 3108.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 256Samsung SSD 980 PRO 1TB4080120160200SE +/- 1.53, N = 3183.581. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: Stock - Precision: float - X Y Z: 128Samsung SSD 980 PRO 1TB4080120160200SE +/- 0.74, N = 3169.961. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512Samsung SSD 980 PRO 1TB1122334455SE +/- 0.01, N = 350.361. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256Samsung SSD 980 PRO 1TB1428425670SE +/- 0.08, N = 364.751. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128Samsung SSD 980 PRO 1TB20406080100SE +/- 0.59, N = 3104.481. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 512Samsung SSD 980 PRO 1TB1224364860SE +/- 0.02, N = 354.661. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 256Samsung SSD 980 PRO 1TB20406080100SE +/- 0.16, N = 382.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: Stock - Precision: float - X Y Z: 128Samsung SSD 980 PRO 1TB20406080100SE +/- 0.82, N = 392.221. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512Samsung SSD 980 PRO 1TB714212835SE +/- 0.02, N = 327.671. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256Samsung SSD 980 PRO 1TB714212835SE +/- 0.03, N = 327.971. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128Samsung SSD 980 PRO 1TB1326395265SE +/- 0.57, N = 359.061. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512Samsung SSD 980 PRO 1TB20406080100SE +/- 0.20, N = 3100.401. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256Samsung SSD 980 PRO 1TB4080120160200SE +/- 2.24, N = 3173.621. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128Samsung SSD 980 PRO 1TB4080120160200SE +/- 0.13, N = 3191.331. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512Samsung SSD 980 PRO 1TB1224364860SE +/- 0.01, N = 353.881. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256Samsung SSD 980 PRO 1TB20406080100SE +/- 0.15, N = 375.741. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128Samsung SSD 980 PRO 1TB20406080100SE +/- 0.82, N = 3104.341. (CXX) g++ options: -O3

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Samsung SSD 980 PRO 1TB6K12K18K24K30KSE +/- 139.81, N = 3282711. (CC) gcc options: -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096Samsung SSD 980 PRO 1TB13K26K39K52K65KSE +/- 484.43, N = 9610671. (CC) gcc options: -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32Samsung SSD 980 PRO 1TB10K20K30K40K50KSE +/- 525.31, N = 4475981. (CC) gcc options: -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32Samsung SSD 980 PRO 1TB4K8K12K16K20KSE +/- 85.83, N = 3176011. (CC) gcc options: -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096Samsung SSD 980 PRO 1TB15003000450060007500SE +/- 31.92, N = 37141.51. (CC) gcc options: -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096Samsung SSD 980 PRO 1TB2K4K6K8K10KSE +/- 11.11, N = 39825.01. (CC) gcc options: -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32Samsung SSD 980 PRO 1TB2K4K6K8K10KSE +/- 115.39, N = 3114821. (CC) gcc options: -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32Samsung SSD 980 PRO 1TB2K4K6K8K10KSE +/- 50.60, N = 3112561. (CC) gcc options: -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Laghos

Test: Sedov Blast Wave, ube_922_hex.mesh

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshSamsung SSD 980 PRO 1TB100200300400500SE +/- 2.78, N = 3484.301. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Laghos

Test: Triple Point Problem

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point ProblemSamsung SSD 980 PRO 1TB60120180240300SE +/- 2.84, N = 3267.501. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

FFTE

Test: N=256, 1D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0Test: N=256, 1D Complex FFT RoutineSamsung SSD 980 PRO 1TB70K140K210K280K350KSE +/- 2521.62, N = 3307873.601. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64Samsung SSD 980 PRO 1TB140280420560700SE +/- 0.54, N = 3628.01. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32Samsung SSD 980 PRO 1TB70140210280350SE +/- 0.07, N = 3309.61. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

libxsmm

M N K: 256

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256Samsung SSD 980 PRO 1TB30060090012001500SE +/- 1.34, N = 31438.61. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128Samsung SSD 980 PRO 1TB2004006008001000SE +/- 8.10, N = 31116.11. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2Samsung SSD 980 PRO 1TB200M400M600M800M1000MSE +/- 401333.33, N = 310014506671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Nebular Empirical Analysis Tool

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2.3Samsung SSD 980 PRO 1TB714212835SE +/- 0.04, N = 327.931. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio

Dolfyn

Computational Fluid Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsSamsung SSD 980 PRO 1TB48121620SE +/- 0.03, N = 315.42

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsSamsung SSD 980 PRO 1TB0.07660.15320.22980.30640.383SE +/- 0.00245, N = 30.34055

CP2K Molecular Dynamics

Input: Fayalite-FIST

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2023.1Input: Fayalite-FISTSamsung SSD 980 PRO 1TB2040608010099.451. (F9X) gfortran options: -fopenmp -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kdbm -lcp2kgrid -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -lhdf5 -lhdf5_hl -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

CP2K Molecular Dynamics

Input: H20-64

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2023.1Input: H20-64Samsung SSD 980 PRO 1TB61218243024.941. (F9X) gfortran options: -fopenmp -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kdbm -lcp2kgrid -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -lhdf5 -lhdf5_hl -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -lopenblas -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterSamsung SSD 980 PRO 1TB1.1142.2283.3424.4565.57SE +/- 0.003, N = 34.9511. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverSamsung SSD 980 PRO 1TB1.32322.64643.96965.29286.616SE +/- 0.026, N = 35.8811. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteSamsung SSD 980 PRO 1TB714212835SE +/- 0.16, N = 330.921. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDSamsung SSD 980 PRO 1TB816243240SE +/- 0.30, N = 336.671. (CXX) g++ options: -O2 -lOpenCL

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsSamsung SSD 980 PRO 1TB246810SE +/- 0.02, N = 38.471. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Samsung SSD 980 PRO 1TB20406080100SE +/- 0.36, N = 3111.031. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Samsung SSD 980 PRO 1TB6001200180024003000SE +/- 8.88, N = 32775.621. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1Samsung SSD 980 PRO 1TB20406080100SE +/- 0.06, N = 3108.201. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1Samsung SSD 980 PRO 1TB6001200180024003000SE +/- 1.53, N = 32705.091. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallSamsung SSD 980 PRO 1TB5K10K15K20K25KSE +/- 38.34, N = 321660.41. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingSamsung SSD 980 PRO 1TB4080120160200SE +/- 0.85, N = 3197.261. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilSamsung SSD 980 PRO 1TB0.75331.50662.25993.01323.7665SE +/- 0.022641, N = 133.3479071. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPSamsung SSD 980 PRO 1TB0.16240.32480.48720.64960.812SE +/- 0.008966, N = 40.7219691. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMSamsung SSD 980 PRO 1TB48121620SE +/- 0.16, N = 316.991. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASSamsung SSD 980 PRO 1TB400800120016002000SE +/- 14.81, N = 816911. (CXX) g++ options: -flto -pthread

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthSamsung SSD 980 PRO 1TB6K12K18K24K30KSE +/- 309.71, N = 327044.451. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.1.2

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthSamsung SSD 980 PRO 1TB0.40510.81021.21531.62042.0255SE +/- 0.00539, N = 31.800481. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.1.2

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencySamsung SSD 980 PRO 1TB0.20340.40680.61020.81361.017SE +/- 0.00270, N = 30.904091. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.1.2

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadSamsung SSD 980 PRO 1TB0.40360.80721.21081.61442.018SE +/- 0.00220, N = 31.793681. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.1.2

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransSamsung SSD 980 PRO 1TB48121620SE +/- 0.05, N = 314.531. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.1.2

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteSamsung SSD 980 PRO 1TB714212835SE +/- 0.21, N = 329.051. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.1.2

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLSamsung SSD 980 PRO 1TB20406080100SE +/- 2.45, N = 3108.101. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CSamsung SSD 980 PRO 1TB11K22K33K44K55KSE +/- 258.86, N = 350933.691. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BSamsung SSD 980 PRO 1TB20K40K60K80K100KSE +/- 236.71, N = 393331.701. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CSamsung SSD 980 PRO 1TB12K24K36K48K60KSE +/- 66.98, N = 357204.741. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CSamsung SSD 980 PRO 1TB30K60K90K120K150KSE +/- 379.20, N = 3151979.361. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DSamsung SSD 980 PRO 1TB6001200180024003000SE +/- 20.08, N = 32718.021. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CSamsung SSD 980 PRO 1TB13K26K39K52K65KSE +/- 108.01, N = 362960.511. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DSamsung SSD 980 PRO 1TB14002800420056007000SE +/- 94.18, N = 156568.441. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CSamsung SSD 980 PRO 1TB5K10K15K20K25KSE +/- 261.54, N = 323866.921. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CSamsung SSD 980 PRO 1TB30K60K90K120K150KSE +/- 357.94, N = 3132969.601. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

HPL Linpack

OpenBenchmarking.orgGFLOPS, More Is BetterHPL Linpack 2.3Samsung SSD 980 PRO 1TB20406080100SE +/- 0.04, N = 3103.691. (CC) gcc options: -O2 -lopenblas -lm -lmpi

High Performance Conjugate Gradient

X Y Z: 160 160 160 - RT: 1800

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 160 160 160 - RT: 1800Samsung SSD 980 PRO 1TB510152025SE +/- 0.00, N = 318.901. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 1800

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 1800Samsung SSD 980 PRO 1TB510152025SE +/- 0.01, N = 318.901. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 1800

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 1800Samsung SSD 980 PRO 1TB510152025SE +/- 0.22, N = 318.891. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 160 160 160 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 160 160 160 - RT: 60Samsung SSD 980 PRO 1TB510152025SE +/- 0.00, N = 318.901. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60Samsung SSD 980 PRO 1TB510152025SE +/- 0.01, N = 318.921. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 60Samsung SSD 980 PRO 1TB510152025SE +/- 0.01, N = 319.131. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

IOR

Block Size: 1024MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 1024MB - Disk Target: Default Test DirectorySamsung SSD 980 PRO 1TB30060090012001500SE +/- 10.44, N = 31489.06MIN: 1347.42 / MAX: 1565.791. (CC) gcc options: -O2 -lm -lmpi

IOR

Block Size: 512MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 512MB - Disk Target: Default Test DirectorySamsung SSD 980 PRO 1TB2004006008001000SE +/- 1.91, N = 31059.04MIN: 985.02 / MAX: 1145.21. (CC) gcc options: -O2 -lm -lmpi

IOR

Block Size: 256MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 256MB - Disk Target: Default Test DirectorySamsung SSD 980 PRO 1TB2004006008001000SE +/- 0.79, N = 3907.56MIN: 854.73 / MAX: 1182.561. (CC) gcc options: -O2 -lm -lmpi

IOR

Block Size: 64MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 64MB - Disk Target: Default Test DirectorySamsung SSD 980 PRO 1TB2004006008001000SE +/- 0.79, N = 3974.40MIN: 769.69 / MAX: 1622.261. (CC) gcc options: -O2 -lm -lmpi

IOR

Block Size: 32MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 32MB - Disk Target: Default Test DirectorySamsung SSD 980 PRO 1TB2004006008001000SE +/- 13.63, N = 121009.38MIN: 480 / MAX: 2073.61. (CC) gcc options: -O2 -lm -lmpi

IOR

Block Size: 16MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 16MB - Disk Target: Default Test DirectorySamsung SSD 980 PRO 1TB2004006008001000SE +/- 6.79, N = 3933.08MIN: 504.16 / MAX: 2148.311. (CC) gcc options: -O2 -lm -lmpi

IOR

Block Size: 8MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test DirectorySamsung SSD 980 PRO 1TB2004006008001000SE +/- 5.77, N = 12813.53MIN: 364.4 / MAX: 2104.521. (CC) gcc options: -O2 -lm -lmpi

IOR

Block Size: 2MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test DirectorySamsung SSD 980 PRO 1TB140280420560700SE +/- 7.30, N = 15665.75MIN: 304.4 / MAX: 1664.381. (CC) gcc options: -O2 -lm -lmpi

Whisper.cpp

Model: ggml-medium.en - Input: 2016 State of the Union

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisper.cpp 1.4Model: ggml-medium.en - Input: 2016 State of the UnionSamsung SSD 980 PRO 1TB2004006008001000SE +/- 27.09, N = 9959.471. (CXX) g++ options: -O3 -std=c++11 -fPIC -pthread

Whisper.cpp

Model: ggml-small.en - Input: 2016 State of the Union

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisper.cpp 1.4Model: ggml-small.en - Input: 2016 State of the UnionSamsung SSD 980 PRO 1TB100200300400500SE +/- 15.93, N = 9475.981. (CXX) g++ options: -O3 -std=c++11 -fPIC -pthread

Whisper.cpp

Model: ggml-base.en - Input: 2016 State of the Union

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisper.cpp 1.4Model: ggml-base.en - Input: 2016 State of the UnionSamsung SSD 980 PRO 1TB50100150200250SE +/- 6.03, N = 12208.251. (CXX) g++ options: -O3 -std=c++11 -fPIC -pthread

Faiss

Test: bench_polysemous_sift1m - Polysemous 30

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - Polysemous 30Samsung SSD 980 PRO 1TB0.14830.29660.44490.59320.7415SE +/- 0.001, N = 30.6591. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Faiss

Test: bench_polysemous_sift1m - Polysemous 34

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - Polysemous 34Samsung SSD 980 PRO 1TB0.14960.29920.44880.59840.748SE +/- 0.001, N = 30.6651. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Faiss

Test: bench_polysemous_sift1m - Polysemous 38

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - Polysemous 38Samsung SSD 980 PRO 1TB0.15320.30640.45960.61280.766SE +/- 0.001, N = 30.6811. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Faiss

Test: bench_polysemous_sift1m - Polysemous 42

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - Polysemous 42Samsung SSD 980 PRO 1TB0.16380.32760.49140.65520.819SE +/- 0.000, N = 30.7281. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Faiss

Test: bench_polysemous_sift1m - Polysemous 46

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - Polysemous 46Samsung SSD 980 PRO 1TB0.19640.39280.58920.78560.982SE +/- 0.000, N = 30.8731. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Faiss

Test: bench_polysemous_sift1m - Polysemous 50

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - Polysemous 50Samsung SSD 980 PRO 1TB0.27630.55260.82891.10521.3815SE +/- 0.000, N = 31.2281. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Faiss

Test: bench_polysemous_sift1m - Polysemous 54

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - Polysemous 54Samsung SSD 980 PRO 1TB0.42910.85821.28731.71642.1455SE +/- 0.002, N = 31.9071. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Faiss

Test: bench_polysemous_sift1m - Polysemous 58

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - Polysemous 58Samsung SSD 980 PRO 1TB0.68151.3632.04452.7263.4075SE +/- 0.003, N = 33.0291. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Faiss

Test: bench_polysemous_sift1m - Polysemous 62

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - Polysemous 62Samsung SSD 980 PRO 1TB1.10182.20363.30544.40725.509SE +/- 0.003, N = 34.8971. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

Faiss

Test: bench_polysemous_sift1m - Polysemous 64

OpenBenchmarking.orgms per query, Fewer Is BetterFaiss 1.7.4Test: bench_polysemous_sift1m - Polysemous 64Samsung SSD 980 PRO 1TB1.34012.68024.02035.36046.7005SE +/- 0.002, N = 35.9561. (F9X) gfortran options: -O2 -frecursive -m64 -fopenmp -msse3 -mssse3 -msse4.1 -mavx -mavx2 -fno-tree-vectorize -lm -lpthread -lgfortran -lc

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetSamsung SSD 980 PRO 1TB1.33882.67764.01645.35526.694SE +/- 0.25, N = 35.95MIN: 5.44 / MAX: 17.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mSamsung SSD 980 PRO 1TB48121620SE +/- 1.97, N = 317.59MIN: 14.46 / MAX: 161.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefaceSamsung SSD 980 PRO 1TB0.5131.0261.5392.0522.565SE +/- 0.25, N = 32.28MIN: 1.93 / MAX: 3.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2Samsung SSD 980 PRO 1TB1.21732.43463.65194.86926.0865SE +/- 0.55, N = 35.41MIN: 4.73 / MAX: 15.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Intel MPI Benchmarks

Test: IMB-MPI1 Sendrecv

OpenBenchmarking.orgAverage usec, Fewer Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 SendrecvSamsung SSD 980 PRO 1TB20406080100SE +/- 1.38, N = 1575.26MIN: 0.36 / MAX: 2923.861. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

Intel MPI Benchmarks

Test: IMB-MPI1 Sendrecv

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 SendrecvSamsung SSD 980 PRO 1TB11002200330044005500SE +/- 93.14, N = 155133.49MAX: 24149.021. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

Intel MPI Benchmarks

Test: IMB-MPI1 Exchange

OpenBenchmarking.orgAverage usec, Fewer Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 ExchangeSamsung SSD 980 PRO 1TB306090120150SE +/- 3.98, N = 12135.45MIN: 0.72 / MAX: 5787.961. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

Intel MPI Benchmarks

Test: IMB-MPI1 Exchange

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 ExchangeSamsung SSD 980 PRO 1TB2K4K6K8K10KSE +/- 247.12, N = 127897.23MAX: 40197.481. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUSamsung SSD 980 PRO 1TB0.15590.31180.46770.62360.7795SE +/- 0.027689, N = 150.692944MIN: 0.451. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

RELION

Test: Basic - Device: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 4.0.1Test: Basic - Device: CPUSamsung SSD 980 PRO 1TB90180270360450SE +/- 9.25, N = 9413.011. (CXX) g++ options: -fopenmp -std=c++11 -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -ljpeg -lmpi_cxx -lmpi

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessSamsung SSD 980 PRO 1TB0.07970.15940.23910.31880.3985SE +/- 0.01763, N = 30.354271. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.1.2

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMSamsung SSD 980 PRO 1TB714212835SE +/- 1.20, N = 330.001. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.1.2

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CSamsung SSD 980 PRO 1TB13002600390052006500SE +/- 154.00, N = 156255.951. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

IOR

Block Size: 4MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 4MB - Disk Target: Default Test DirectorySamsung SSD 980 PRO 1TB140280420560700SE +/- 13.18, N = 15645.60MIN: 225.55 / MAX: 1839.221. (CC) gcc options: -O2 -lm -lmpi


Phoronix Test Suite v10.8.5