2 x AMD EPYC 9124 16-Core testing with a Supermicro H13DSH (1.5 BIOS) and astdrmfb on AlmaLinux 9.2 via the Phoronix Test Suite.
EPYC 9124 2P Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
b c d Processor: 2 x AMD EPYC 9124 16-Core @ 3.00GHz (32 Cores / 64 Threads), Motherboard: Supermicro H13DSH (1.5 BIOS), Memory: 24 x 32 GB DDR5-4800MT/s Samsung M321R4GA3BB6-CQKET, Disk: 2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07, Graphics: astdrmfb
OS: AlmaLinux 9.2, Kernel: 5.14.0-284.25.1.el9_2.x86_64 (x86_64), Compiler: GCC 11.3.1 20221121, File-System: ext4, Screen Resolution: 1024x768
extra tests OpenBenchmarking.org Phoronix Test Suite 2 x AMD EPYC 9124 16-Core @ 3.00GHz (32 Cores / 64 Threads) Supermicro H13DSH (1.5 BIOS) 24 x 32 GB DDR5-4800MT/s Samsung M321R4GA3BB6-CQKET 2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07 astdrmfb AlmaLinux 9.2 5.14.0-284.25.1.el9_2.x86_64 (x86_64) GCC 11.3.1 20221121 ext4 1024x768 Processor Motherboard Memory Disk Graphics OS Kernel Compiler File-System Screen Resolution Extra Tests Performance System Logs - Transparent Huge Pages: always - --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113e - OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS) - Python 3.9.16 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
EPYC 9124 2P b c d Result Overview Phoronix Test Suite 100% 101% 102% 103% NCNN Stress-NG Apache Cassandra Dragonflydb SVT-AV1 Remhos Kripke Timed Linux Kernel Compilation SPECFEM3D nekRS BRL-CAD Intel Open Image Denoise VVenC Laghos Neural Magic DeepSparse Liquid-DSP Embree OSPRay Blender
extra tests laghos: Triple Point Problem laghos: Sedov Blast Wave, ube_922_hex.mesh remhos: Sample Remap Example specfem3d: Mount St. Helens specfem3d: Layered Halfspace specfem3d: Tomographic Model specfem3d: Homogeneous Halfspace specfem3d: Water-layered Halfspace nekrs: Kershaw nekrs: TurboPipe Periodic embree: Pathtracer - Crown embree: Pathtracer ISPC - Crown embree: Pathtracer - Asian Dragon embree: Pathtracer - Asian Dragon Obj embree: Pathtracer ISPC - Asian Dragon embree: Pathtracer ISPC - Asian Dragon Obj svt-av1: Preset 4 - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K svt-av1: Preset 12 - Bosphorus 4K svt-av1: Preset 13 - Bosphorus 4K svt-av1: Preset 4 - Bosphorus 1080p svt-av1: Preset 8 - Bosphorus 1080p svt-av1: Preset 12 - Bosphorus 1080p svt-av1: Preset 13 - Bosphorus 1080p vvenc: Bosphorus 4K - Fast vvenc: Bosphorus 4K - Faster vvenc: Bosphorus 1080p - Fast vvenc: Bosphorus 1080p - Faster oidn: RT.hdr_alb_nrm.3840x2160 - CPU-Only oidn: RT.ldr_alb_nrm.3840x2160 - CPU-Only oidn: RTLightmap.hdr.4096x4096 - CPU-Only ospray: particle_volume/ao/real_time ospray: particle_volume/scivis/real_time ospray: particle_volume/pathtracer/real_time ospray: gravity_spheres_volume/dim_512/ao/real_time ospray: gravity_spheres_volume/dim_512/scivis/real_time ospray: gravity_spheres_volume/dim_512/pathtracer/real_time build-linux-kernel: defconfig liquid-dsp: 1 - 256 - 32 liquid-dsp: 1 - 256 - 57 liquid-dsp: 2 - 256 - 32 liquid-dsp: 2 - 256 - 57 liquid-dsp: 4 - 256 - 32 liquid-dsp: 4 - 256 - 57 liquid-dsp: 8 - 256 - 32 liquid-dsp: 8 - 256 - 57 liquid-dsp: 1 - 256 - 512 liquid-dsp: 16 - 256 - 32 liquid-dsp: 16 - 256 - 57 liquid-dsp: 2 - 256 - 512 liquid-dsp: 32 - 256 - 32 liquid-dsp: 32 - 256 - 57 liquid-dsp: 4 - 256 - 512 liquid-dsp: 64 - 256 - 32 liquid-dsp: 64 - 256 - 57 liquid-dsp: 8 - 256 - 512 liquid-dsp: 16 - 256 - 512 liquid-dsp: 32 - 256 - 512 liquid-dsp: 64 - 256 - 512 dragonflydb: 10 - 1:10 dragonflydb: 20 - 1:10 dragonflydb: 50 - 1:10 dragonflydb: 10 - 1:100 dragonflydb: 20 - 1:100 dragonflydb: 50 - 1:100 deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Stream deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Stream deepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream deepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream deepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Stream deepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Synchronous Single-Stream deepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Synchronous Single-Stream deepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream deepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream deepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Stream deepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Synchronous Single-Stream deepsparse: ResNet-50, Baseline - Synchronous Single-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: ResNet-50, Sparse INT8 - Synchronous Single-Stream deepsparse: ResNet-50, Sparse INT8 - Synchronous Single-Stream deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO - Synchronous Single-Stream deepsparse: CV Detection, YOLOv5s COCO - Synchronous Single-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Synchronous Single-Stream deepsparse: BERT-Large, NLP Question Answering - Synchronous Single-Stream deepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream deepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream deepsparse: CV Classification, ResNet-50 ImageNet - Synchronous Single-Stream deepsparse: CV Classification, ResNet-50 ImageNet - Synchronous Single-Stream deepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Stream deepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Stream deepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream deepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream deepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream deepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Stream deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Stream deepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream deepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream deepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Stream deepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Stream deepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream deepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream deepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Stream deepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Stream stress-ng: Hash stress-ng: MMAP stress-ng: NUMA stress-ng: Pipe stress-ng: Poll stress-ng: Zlib stress-ng: Futex stress-ng: MEMFD stress-ng: Mutex stress-ng: Atomic stress-ng: Crypto stress-ng: Malloc stress-ng: Cloning stress-ng: Forking stress-ng: AVL Tree stress-ng: SENDFILE stress-ng: CPU Cache stress-ng: CPU Stress stress-ng: Semaphores stress-ng: Matrix Math stress-ng: Vector Math stress-ng: AVX-512 VNNI stress-ng: Function Call stress-ng: x86_64 RdRand stress-ng: Floating Point stress-ng: Matrix 3D Math stress-ng: Memory Copying stress-ng: Vector Shuffle stress-ng: Mixed Scheduler stress-ng: Socket Activity stress-ng: Wide Vector Math stress-ng: Context Switching stress-ng: Fused Multiply-Add stress-ng: Vector Floating Point stress-ng: Glibc C String Functions stress-ng: Glibc Qsort Data Sorting stress-ng: System V Message Passing ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m ncnn: CPU - vision_transformer ncnn: CPU - FastestDet blender: BMW27 - CPU-Only blender: Classroom - CPU-Only blender: Fishy Cat - CPU-Only blender: Barbershop - CPU-Only blender: Pabellon Barcelona - CPU-Only cassandra: Writes kripke: brl-cad: VGR Performance Metric stress-ng: Pthread EPYC 9124 2P b c d 196.44 264.82 20.356 14.713591567 38.09607639 14.949147803 18.924698419 34.649147643 11469600000 7410850000 37.9663 39.1194 42.6362 38.3359 48.5104 40.8216 5.068 70.774 188.696 199.364 15.154 139.31 443.528 522.521 6.763 12.755 18.942 33.655 1.36 1.37 0.65 10.8317 10.8472 177.273 10.1003 9.87682 11.9287 35.17 35288000 52850000 68824000 105650000 136890000 178110000 275810000 338790000 12637000 550610000 608470000 24680000 1081000000 1205600000 49627000 2056200000 1809000000 98607000 197790000 388330000 501070000 12095393.84 13016172.38 16386566.17 11365230.14 15292696.2 15294187.64 19.7944 793.7775 14.0918 70.9558 754.6362 21.1701 204.8144 4.8772 356.3287 44.8589 138.6214 7.2056 106.7864 149.549 48.3168 20.6865 258.6675 61.7898 140.8153 7.0918 2673.221 5.9715 904.4208 1.1021 113.9236 140.1403 67.6903 14.7603 24.8554 640.625 17.0654 58.5883 257.7738 61.985 140.6834 7.0989 115.0024 138.8209 68.2224 14.65 168.7591 94.6839 98.9576 10.0975 36.2719 438.1881 24.5385 40.7322 374.0267 42.7185 91.5346 10.9112 85.6076 186.5994 51.7581 19.3126 19.9271 794.188 14.1161 70.8325 7217477.2 1142.01 8.89 20487917.54 4334100.75 2934.24 4322676.82 920.9 438082.3 235.35 79760.82 136350929.27 1173.14 1008.85 410.95 852899.85 774009.01 89602.51 81543324.92 173872.46 234249.81 3675272.51 27757.95 12012881.51 11911.03 11756.35 13430.33 24987.46 35532.56 9605.29 1539140.95 18252120.52 32101622.54 107730.4 41243730.27 917.08 6868467.37 22.98 11.63 11.27 14.59 10.89 15.65 5.8 29.49 28.31 13.53 8.06 26.29 35.63 24.9 33.75 57.23 16.62 37.66 94.82 49.25 373.93 122.49 222075 376258700 545952 196.03 264.90 20.310 14.676186076 38.391611888 15.069091794 19.239359346 36.437858763 11495900000 7417740000 37.7133 39.3104 42.6999 38.2082 48.5427 40.8127 5.092 71.849 194.503 199.277 14.992 139.402 457.785 545.787 6.816 12.705 19.029 34.195 1.36 1.36 0.64 10.8613 10.8482 176.982 10.1463 9.82878 11.8874 34.369 35213667 52906667 68813667 102062867 137223333 174952667 273636667 331220000 12528333 549303333 638966667 25040333 1078866667 1203966667 49195333 2049533333 1807366667 98769000 198006667 388873333 503286667 11029528.70 14116849.90 16274465.59 11311943.10 14344471.84 15714147.26 19.7988 793.8161 13.9971 71.4366 757.1128 21.1000 202.8945 4.9229 355.5777 44.9310 139.7381 7.1484 106.6840 149.7098 48.5739 20.5774 257.8867 61.9786 141.0785 7.0789 2670.2118 5.9740 913.3288 1.0913 113.5859 140.5312 67.7272 14.7528 24.7941 641.2395 17.0275 58.7203 257.6865 62.0231 140.7348 7.0963 114.4758 139.4428 68.2516 14.6439 168.7549 94.6615 99.2795 10.0652 36.2924 437.9078 24.5286 40.7479 374.8172 42.6342 91.2602 10.9447 85.3011 186.9272 52.2749 19.1220 19.8545 793.8167 14.0037 71.4022 7218980.87 1143.51 17.65 21069388.89 4331008.99 2940.06 4311036.05 913.40 437829.05 234.81 79687.85 137065291.04 1143.64 1011.36 413.89 859483.20 763301.01 89306.28 78539814.90 174068.70 234493.07 3671132.65 27681.20 12013496.37 11937.72 11593.68 13421.50 25048.79 35867.33 3050.45 1542414.02 18094107.24 32117609.18 107171.34 41635360.27 917.31 6872106.48 23.67 11.38 11.90 15.32 10.89 15.65 5.82 29.63 28.48 13.66 8.22 26.63 35.64 25.13 34.75 58.52 17.64 37.85 95.10 49.21 373.06 122.37 226790 374033600 539832 70031.30 196.34 265.03 20.766 14.797486366 38.368702202 15.270741613 19.388546348 35.298387476 11206400000 7460290000 37.4057 39.4809 42.8557 38.2233 48.4663 40.744 5.071 71.426 170.95 196.679 15.299 136.778 426.754 539.106 6.888 12.71 19.083 33.685 1.37 1.37 0.65 10.8635 10.8212 177.244 10.0585 9.85806 11.9023 35.282 35183000 52837000 68863000 105450000 137280000 178700000 274970000 334440000 12666000 547680000 634300000 25070000 1080000000 1180800000 49997000 2045000000 1832000000 99528000 197330000 386130000 501110000 12032750.26 14637785.39 16263996.16 11871319.97 13357787.11 14906385.78 19.8333 792.2274 14.0319 71.259 753.7418 21.191 204.5303 4.8831 356.9094 44.7637 140.8233 7.0929 106.9582 149.2727 48.4368 20.6351 257.7484 62.0239 140.3737 7.1143 2675.0085 5.9629 899.5957 1.1081 113.366 140.8798 67.7514 14.7471 24.7677 640.0875 17.0439 58.6622 258.2072 61.9005 140.12 7.1274 114.5294 139.3405 68.1941 14.6561 168.8102 94.632 99.3749 10.0557 36.3468 436.7701 24.6 40.6301 372.9577 42.8515 92.5062 10.7975 85.4452 186.8214 52.3239 19.1043 19.9271 794.0791 14.028 71.2779 7218080.7 1128.46 18.45 20689455.39 4335334.87 2940.27 3946891.51 920.29 436953.86 235.56 79781.66 137440336.06 1168.88 1008.05 407.35 857316.41 786282.11 88082.89 94057409.46 173848.98 234217.54 3674818.59 27539.19 12011361.11 11904.73 11673.11 13405.8 25080.33 35991.41 9611.34 1541832.29 18545533.36 32144757.8 106411.74 39897113.46 920.53 6858428.66 23.5 12.04 11.98 15.38 10.75 15.18 6.15 29.75 28.56 13.6 8.14 26.84 35.5 24.93 36.8 58.14 19.87 37.75 95.02 49.26 373.55 123 228846 365660900 542621 195.40 263.85 20.904 14.505645253 38.489779649 14.877710346 18.853828876 34.822227482 11520900000 7470490000 37.677 39.3841 42.7641 37.9876 48.4746 40.5726 5.053 72.353 199.523 195.829 14.974 136.071 419.853 545.195 6.768 12.678 19.166 33.572 1.36 1.37 0.65 10.8634 10.8453 176.614 10.0774 9.81318 11.9002 35.283 35195000 52765000 68834000 105630000 136500000 174280000 271730000 331200000 12652000 549390000 650410000 24900000 1073800000 1234500000 49884000 2045600000 1778600000 96449000 195510000 387460000 504660000 11686474.26 15179164.86 15253621.53 11860591.65 14250093.71 17583489.38 19.81 795.852 13.9192 71.8362 757.477 21.0894 202.5019 4.9317 355.8877 44.9148 139.7934 7.1456 106.1467 150.276 48.2512 20.714 257.4237 62.1006 140.4568 7.1102 2661.2255 5.9947 890.9448 1.1189 113.6578 140.5767 67.7657 14.7454 24.7153 641.6051 16.9961 58.8279 258.5599 61.8053 140.2014 7.1228 114.282 139.6127 68.1621 14.6632 168.7247 94.6197 99.1222 10.0813 36.3849 436.9749 24.536 40.7345 373.0279 42.8337 90.0756 11.0868 85.4854 186.6482 50.9089 19.6356 19.8763 794.0387 14.0139 71.35 7218607.12 1131.55 18.35 20249488.61 4337787.72 2944.54 4022496.07 912.62 438280.16 236.99 107643.49 137476652.93 1170.31 1007.46 410.89 857324.89 776708.72 88466.54 92019395.13 173878.7 234239.6 3667964.15 27731.16 12013602.06 11939.41 10869.49 13428.64 25006.73 35640.36 9504.38 1543291.91 17931974.92 32151217.03 106715.85 40991307.95 916.74 6851573.2 23.55 11.29 11.64 13.96 10.7 14.76 5.63 29.03 28.69 13.65 8.13 26.53 35.75 24.22 33.5 55.05 16 37.93 94.88 48.95 372.72 122.7 230402 372072000 544648 70043.78 OpenBenchmarking.org
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem d c b EPYC 9124 2P 40 80 120 160 200 SE +/- 0.76, N = 3 195.40 196.34 196.03 196.44 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh d c b EPYC 9124 2P 60 120 180 240 300 SE +/- 0.70, N = 3 263.85 265.03 264.90 264.82 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example d c b EPYC 9124 2P 5 10 15 20 25 SE +/- 0.20, N = 3 20.90 20.77 20.31 20.36 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Mount St. Helens d c b EPYC 9124 2P 4 8 12 16 20 SE +/- 0.06, N = 3 14.51 14.80 14.68 14.71 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Layered Halfspace d c b EPYC 9124 2P 9 18 27 36 45 SE +/- 0.26, N = 3 38.49 38.37 38.39 38.10 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Tomographic Model d c b EPYC 9124 2P 4 8 12 16 20 SE +/- 0.10, N = 3 14.88 15.27 15.07 14.95 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Homogeneous Halfspace d c b EPYC 9124 2P 5 10 15 20 25 SE +/- 0.18, N = 7 18.85 19.39 19.24 18.92 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Water-layered Halfspace d c b EPYC 9124 2P 8 16 24 32 40 SE +/- 0.39, N = 5 34.82 35.30 36.44 34.65 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi
nekRS nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: Kershaw d c b EPYC 9124 2P 2000M 4000M 6000M 8000M 10000M SE +/- 49802643.84, N = 3 11520900000 11206400000 11495900000 11469600000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: TurboPipe Periodic d c b EPYC 9124 2P 1600M 3200M 4800M 6400M 8000M SE +/- 22116600.85, N = 3 7470490000 7460290000 7417740000 7410850000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer - Model: Crown d c b EPYC 9124 2P 9 18 27 36 45 SE +/- 0.01, N = 3 37.68 37.41 37.71 37.97 MIN: 37.26 / MAX: 39.44 MIN: 36.79 / MAX: 38.87 MIN: 37.2 / MAX: 38.96 MIN: 37.52 / MAX: 39.47
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Crown d c b EPYC 9124 2P 9 18 27 36 45 SE +/- 0.04, N = 3 39.38 39.48 39.31 39.12 MIN: 38.86 / MAX: 40.97 MIN: 39 / MAX: 40.94 MIN: 38.66 / MAX: 40.8 MIN: 38.49 / MAX: 40.34
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer - Model: Asian Dragon d c b EPYC 9124 2P 10 20 30 40 50 SE +/- 0.04, N = 3 42.76 42.86 42.70 42.64 MIN: 42.48 / MAX: 43.18 MIN: 42.63 / MAX: 43.37 MIN: 42.4 / MAX: 43.23 MIN: 42.35 / MAX: 43.05
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer - Model: Asian Dragon Obj d c b EPYC 9124 2P 9 18 27 36 45 SE +/- 0.02, N = 3 37.99 38.22 38.21 38.34 MIN: 37.71 / MAX: 38.4 MIN: 37.93 / MAX: 38.58 MIN: 37.93 / MAX: 38.75 MIN: 37.99 / MAX: 38.77
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon d c b EPYC 9124 2P 11 22 33 44 55 SE +/- 0.05, N = 3 48.47 48.47 48.54 48.51 MIN: 48.1 / MAX: 49.28 MIN: 48.15 / MAX: 49.27 MIN: 48.1 / MAX: 49.38 MIN: 48.2 / MAX: 49.13
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon Obj d c b EPYC 9124 2P 9 18 27 36 45 SE +/- 0.04, N = 3 40.57 40.74 40.81 40.82 MIN: 40.28 / MAX: 41.19 MIN: 40.34 / MAX: 41.4 MIN: 40.43 / MAX: 41.85 MIN: 40.51 / MAX: 41.28
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.6 Encoder Mode: Preset 4 - Input: Bosphorus 4K d c b EPYC 9124 2P 1.1457 2.2914 3.4371 4.5828 5.7285 SE +/- 0.021, N = 3 5.053 5.071 5.092 5.068 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.6 Encoder Mode: Preset 8 - Input: Bosphorus 4K d c b EPYC 9124 2P 16 32 48 64 80 SE +/- 0.21, N = 3 72.35 71.43 71.85 70.77 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.6 Encoder Mode: Preset 12 - Input: Bosphorus 4K d c b EPYC 9124 2P 40 80 120 160 200 SE +/- 1.55, N = 3 199.52 170.95 194.50 188.70 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.6 Encoder Mode: Preset 13 - Input: Bosphorus 4K d c b EPYC 9124 2P 40 80 120 160 200 SE +/- 1.95, N = 3 195.83 196.68 199.28 199.36 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.6 Encoder Mode: Preset 4 - Input: Bosphorus 1080p d c b EPYC 9124 2P 4 8 12 16 20 SE +/- 0.08, N = 3 14.97 15.30 14.99 15.15 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.6 Encoder Mode: Preset 8 - Input: Bosphorus 1080p d c b EPYC 9124 2P 30 60 90 120 150 SE +/- 0.50, N = 3 136.07 136.78 139.40 139.31 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.6 Encoder Mode: Preset 12 - Input: Bosphorus 1080p d c b EPYC 9124 2P 100 200 300 400 500 SE +/- 5.17, N = 3 419.85 426.75 457.79 443.53 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.6 Encoder Mode: Preset 13 - Input: Bosphorus 1080p d c b EPYC 9124 2P 120 240 360 480 600 SE +/- 4.09, N = 15 545.20 539.11 545.79 522.52 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Fast d c b EPYC 9124 2P 2 4 6 8 10 SE +/- 0.051, N = 3 6.768 6.888 6.816 6.763 1. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Faster d c b EPYC 9124 2P 3 6 9 12 15 SE +/- 0.01, N = 3 12.68 12.71 12.71 12.76 1. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 1080p - Video Preset: Fast d c b EPYC 9124 2P 5 10 15 20 25 SE +/- 0.03, N = 3 19.17 19.08 19.03 18.94 1. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 1080p - Video Preset: Faster d c b EPYC 9124 2P 8 16 24 32 40 SE +/- 0.11, N = 3 33.57 33.69 34.20 33.66 1. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time d c b EPYC 9124 2P 3 6 9 12 15 SE +/- 0.00, N = 3 10.86 10.86 10.86 10.83
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time d c b EPYC 9124 2P 3 6 9 12 15 SE +/- 0.01, N = 3 10.85 10.82 10.85 10.85
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time d c b EPYC 9124 2P 40 80 120 160 200 SE +/- 0.36, N = 3 176.61 177.24 176.98 177.27
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time d c b EPYC 9124 2P 3 6 9 12 15 SE +/- 0.01, N = 3 10.08 10.06 10.15 10.10
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time d c b EPYC 9124 2P 3 6 9 12 15 SE +/- 0.01904, N = 3 9.81318 9.85806 9.82878 9.87682
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time d c b EPYC 9124 2P 3 6 9 12 15 SE +/- 0.00, N = 3 11.90 11.90 11.89 11.93
Build: allmodconfig
EPYC 9124 2P: The test quit with a non-zero exit status.
b: The test quit with a non-zero exit status.
c: The test quit with a non-zero exit status.
d: The test quit with a non-zero exit status.
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 32 d c b EPYC 9124 2P 8M 16M 24M 32M 40M SE +/- 26660.42, N = 3 35195000 35183000 35213667 35288000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 57 d c b EPYC 9124 2P 11M 22M 33M 44M 55M SE +/- 24037.01, N = 3 52765000 52837000 52906667 52850000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 32 d c b EPYC 9124 2P 15M 30M 45M 60M 75M SE +/- 21231.53, N = 3 68834000 68863000 68813667 68824000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 57 d c b EPYC 9124 2P 20M 40M 60M 80M 100M SE +/- 2143861.14, N = 15 105630000 105450000 102062867 105650000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 32 d c b EPYC 9124 2P 30M 60M 90M 120M 150M SE +/- 301182.85, N = 3 136500000 137280000 137223333 136890000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 57 d c b EPYC 9124 2P 40M 80M 120M 160M 200M SE +/- 2041803.93, N = 15 174280000 178700000 174952667 178110000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 32 d c b EPYC 9124 2P 60M 120M 180M 240M 300M SE +/- 1505958.54, N = 3 271730000 274970000 273636667 275810000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 57 d c b EPYC 9124 2P 70M 140M 210M 280M 350M SE +/- 2830995.11, N = 3 331200000 334440000 331220000 338790000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 512 d c b EPYC 9124 2P 3M 6M 9M 12M 15M SE +/- 143667.83, N = 3 12652000 12666000 12528333 12637000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 32 d c b EPYC 9124 2P 120M 240M 360M 480M 600M SE +/- 1392172.56, N = 3 549390000 547680000 549303333 550610000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 57 d c b EPYC 9124 2P 140M 280M 420M 560M 700M SE +/- 2539254.05, N = 3 650410000 634300000 638966667 608470000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 512 d c b EPYC 9124 2P 5M 10M 15M 20M 25M SE +/- 138974.02, N = 3 24900000 25070000 25040333 24680000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 32 d c b EPYC 9124 2P 200M 400M 600M 800M 1000M SE +/- 1146492.23, N = 3 1073800000 1080000000 1078866667 1081000000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 57 d c b EPYC 9124 2P 300M 600M 900M 1200M 1500M SE +/- 10114401.17, N = 3 1234500000 1180800000 1203966667 1205600000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 512 d c b EPYC 9124 2P 11M 22M 33M 44M 55M SE +/- 220678.75, N = 3 49884000 49997000 49195333 49627000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 32 d c b EPYC 9124 2P 400M 800M 1200M 1600M 2000M SE +/- 1319511.69, N = 3 2045600000 2045000000 2049533333 2056200000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 57 d c b EPYC 9124 2P 400M 800M 1200M 1600M 2000M SE +/- 7859884.08, N = 3 1778600000 1832000000 1807366667 1809000000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 512 d c b EPYC 9124 2P 20M 40M 60M 80M 100M SE +/- 197811.36, N = 3 96449000 99528000 98769000 98607000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 d c b EPYC 9124 2P 40M 80M 120M 160M 200M SE +/- 401718.53, N = 3 195510000 197330000 198006667 197790000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 d c b EPYC 9124 2P 80M 160M 240M 320M 400M SE +/- 468946.81, N = 3 387460000 386130000 388873333 388330000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 512 d c b EPYC 9124 2P 110M 220M 330M 440M 550M SE +/- 219266.45, N = 3 504660000 501110000 503286667 501070000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Dragonflydb Dragonfly is an open-source database server that is a "modern Redis replacement" that aims to be the fastest memory store while being compliant with the Redis and Memcached protocols. For benchmarking Dragonfly, Memtier_benchmark is used as a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 10 - Set To Get Ratio: 1:10 d c b EPYC 9124 2P 3M 6M 9M 12M 15M SE +/- 28474.68, N = 3 11686474.26 12032750.26 11029528.70 12095393.84 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 20 - Set To Get Ratio: 1:10 d c b EPYC 9124 2P 3M 6M 9M 12M 15M SE +/- 184904.56, N = 15 15179164.86 14637785.39 14116849.90 13016172.38 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 50 - Set To Get Ratio: 1:10 d c b EPYC 9124 2P 4M 8M 12M 16M 20M SE +/- 323344.22, N = 15 15253621.53 16263996.16 16274465.59 16386566.17 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Clients Per Thread: 60 - Set To Get Ratio: 1:10
EPYC 9124 2P: The test run did not produce a result. E: Connection error: Connection reset by peer
b: The test run did not produce a result. E: Connection error: Connection reset by peer
c: The test run did not produce a result. E: Connection error: Connection reset by peer
d: The test run did not produce a result. E: Connection error: Connection reset by peer
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 10 - Set To Get Ratio: 1:100 d c b EPYC 9124 2P 3M 6M 9M 12M 15M SE +/- 109712.47, N = 3 11860591.65 11871319.97 11311943.10 11365230.14 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 20 - Set To Get Ratio: 1:100 d c b EPYC 9124 2P 3M 6M 9M 12M 15M SE +/- 164469.07, N = 15 14250093.71 13357787.11 14344471.84 15292696.20 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 50 - Set To Get Ratio: 1:100 d c b EPYC 9124 2P 4M 8M 12M 16M 20M SE +/- 219976.25, N = 15 17583489.38 14906385.78 15714147.26 15294187.64 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Clients Per Thread: 60 - Set To Get Ratio: 1:100
EPYC 9124 2P: The test run did not produce a result. E: Connection error: Connection reset by peer
b: The test run did not produce a result. E: Connection error: Connection reset by peer
c: The test run did not produce a result. E: Connection error: Connection reset by peer
d: The test run did not produce a result. E: Connection error: Connection reset by peer
Neural Magic DeepSparse OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream d c b EPYC 9124 2P 5 10 15 20 25 SE +/- 0.06, N = 3 19.81 19.83 19.80 19.79
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream d c b EPYC 9124 2P 200 400 600 800 1000 SE +/- 0.18, N = 3 795.85 792.23 793.82 793.78
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Stream d c b EPYC 9124 2P 16 32 48 64 80 SE +/- 0.25, N = 3 71.84 71.26 71.44 70.96
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream d c b EPYC 9124 2P 160 320 480 640 800 SE +/- 1.99, N = 3 757.48 753.74 757.11 754.64
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream d c b EPYC 9124 2P 5 10 15 20 25 SE +/- 0.06, N = 3 21.09 21.19 21.10 21.17
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream d c b EPYC 9124 2P 40 80 120 160 200 SE +/- 0.98, N = 3 202.50 204.53 202.89 204.81
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream d c b EPYC 9124 2P 1.1096 2.2192 3.3288 4.4384 5.548 SE +/- 0.0237, N = 3 4.9317 4.8831 4.9229 4.8772
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream d c b EPYC 9124 2P 80 160 240 320 400 SE +/- 0.15, N = 3 355.89 356.91 355.58 356.33
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream d c b EPYC 9124 2P 10 20 30 40 50 SE +/- 0.01, N = 3 44.91 44.76 44.93 44.86
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-Stream d c b EPYC 9124 2P 30 60 90 120 150 SE +/- 0.26, N = 3 139.79 140.82 139.74 138.62
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-Stream d c b EPYC 9124 2P 2 4 6 8 10 SE +/- 0.0129, N = 3 7.1456 7.0929 7.1484 7.2056
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream d c b EPYC 9124 2P 20 40 60 80 100 SE +/- 0.09, N = 3 106.15 106.96 106.68 106.79
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream d c b EPYC 9124 2P 30 60 90 120 150 SE +/- 0.18, N = 3 150.28 149.27 149.71 149.55
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-Stream d c b EPYC 9124 2P 11 22 33 44 55 SE +/- 0.15, N = 3 48.25 48.44 48.57 48.32
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-Stream d c b EPYC 9124 2P 5 10 15 20 25 SE +/- 0.06, N = 3 20.71 20.64 20.58 20.69
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream d c b EPYC 9124 2P 80 160 240 320 400 SE +/- 0.27, N = 3 373.03 372.96 374.82 374.03
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream d c b EPYC 9124 2P 200 400 600 800 1000 SE +/- 0.67, N = 3 794.04 794.08 793.82 794.19
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MMAP d c b EPYC 9124 2P 200 400 600 800 1000 SE +/- 10.51, N = 3 1131.55 1128.46 1143.51 1142.01 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pipe d c b EPYC 9124 2P 5M 10M 15M 20M 25M SE +/- 263402.50, N = 15 20249488.61 20689455.39 21069388.89 20487917.54 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Poll d c b EPYC 9124 2P 900K 1800K 2700K 3600K 4500K SE +/- 1071.05, N = 3 4337787.72 4335334.87 4331008.99 4334100.75 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Zlib d c b EPYC 9124 2P 600 1200 1800 2400 3000 SE +/- 1.86, N = 3 2944.54 2940.27 2940.06 2934.24 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Futex d c b EPYC 9124 2P 900K 1800K 2700K 3600K 4500K SE +/- 13871.52, N = 3 4022496.07 3946891.51 4311036.05 4322676.82 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MEMFD d c b EPYC 9124 2P 200 400 600 800 1000 SE +/- 2.16, N = 3 912.62 920.29 913.40 920.90 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mutex d c b EPYC 9124 2P 90K 180K 270K 360K 450K SE +/- 421.09, N = 3 438280.16 436953.86 437829.05 438082.30 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Atomic d c b EPYC 9124 2P 50 100 150 200 250 SE +/- 0.55, N = 3 236.99 235.56 234.81 235.35 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Crypto d c b EPYC 9124 2P 20K 40K 60K 80K 100K SE +/- 23.57, N = 3 107643.49 79781.66 79687.85 79760.82 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Malloc d c b EPYC 9124 2P 30M 60M 90M 120M 150M SE +/- 86125.58, N = 3 137476652.93 137440336.06 137065291.04 136350929.27 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Cloning d c b EPYC 9124 2P 300 600 900 1200 1500 SE +/- 10.16, N = 3 1170.31 1168.88 1143.64 1173.14 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Forking d c b EPYC 9124 2P 200 400 600 800 1000 SE +/- 5.44, N = 3 1007.46 1008.05 1011.36 1008.85 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Test: Pthread
EPYC 9124 2P: The test quit with a non-zero exit status.
c: The test quit with a non-zero exit status.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVL Tree d c b EPYC 9124 2P 90 180 270 360 450 SE +/- 5.37, N = 3 410.89 407.35 413.89 410.95 1. (CXX) g++ options: -O2 -std=gnu99 -lc
Test: IO_uring
EPYC 9124 2P: The test run did not produce a result.
b: The test run did not produce a result.
c: The test run did not produce a result.
d: The test run did not produce a result.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: SENDFILE d c b EPYC 9124 2P 200K 400K 600K 800K 1000K SE +/- 459.45, N = 3 857324.89 857316.41 859483.20 852899.85 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Cache d c b EPYC 9124 2P 200K 400K 600K 800K 1000K SE +/- 13475.92, N = 15 776708.72 786282.11 763301.01 774009.01 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Stress d c b EPYC 9124 2P 20K 40K 60K 80K 100K SE +/- 346.97, N = 3 88466.54 88082.89 89306.28 89602.51 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Semaphores d c b EPYC 9124 2P 20M 40M 60M 80M 100M SE +/- 972856.35, N = 15 92019395.13 94057409.46 78539814.90 81543324.92 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix Math d c b EPYC 9124 2P 40K 80K 120K 160K 200K SE +/- 1.07, N = 3 173878.70 173848.98 174068.70 173872.46 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Math d c b EPYC 9124 2P 50K 100K 150K 200K 250K SE +/- 5.36, N = 3 234239.60 234217.54 234493.07 234249.81 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVX-512 VNNI d c b EPYC 9124 2P 800K 1600K 2400K 3200K 4000K SE +/- 4464.99, N = 3 3667964.15 3674818.59 3671132.65 3675272.51 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Function Call d c b EPYC 9124 2P 6K 12K 18K 24K 30K SE +/- 27.79, N = 3 27731.16 27539.19 27681.20 27757.95 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: x86_64 RdRand d c b EPYC 9124 2P 3M 6M 9M 12M 15M SE +/- 82.64, N = 3 12013602.06 12011361.11 12013496.37 12012881.51 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Floating Point d c b EPYC 9124 2P 3K 6K 9K 12K 15K SE +/- 1.29, N = 3 11939.41 11904.73 11937.72 11911.03 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix 3D Math d c b EPYC 9124 2P 3K 6K 9K 12K 15K SE +/- 31.82, N = 3 10869.49 11673.11 11593.68 11756.35 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Memory Copying d c b EPYC 9124 2P 3K 6K 9K 12K 15K SE +/- 2.68, N = 3 13428.64 13405.80 13421.50 13430.33 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Shuffle d c b EPYC 9124 2P 5K 10K 15K 20K 25K SE +/- 29.34, N = 3 25006.73 25080.33 25048.79 24987.46 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mixed Scheduler d c b EPYC 9124 2P 8K 16K 24K 32K 40K SE +/- 132.19, N = 3 35640.36 35991.41 35867.33 35532.56 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Socket Activity d c b EPYC 9124 2P 2K 4K 6K 8K 10K SE +/- 941.54, N = 15 9504.38 9611.34 3050.45 9605.29 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Wide Vector Math d c b EPYC 9124 2P 300K 600K 900K 1200K 1500K SE +/- 1152.35, N = 3 1543291.91 1541832.29 1542414.02 1539140.95 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Context Switching d c b EPYC 9124 2P 4M 8M 12M 16M 20M SE +/- 108301.82, N = 3 17931974.92 18545533.36 18094107.24 18252120.52 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Fused Multiply-Add d c b EPYC 9124 2P 7M 14M 21M 28M 35M SE +/- 5730.93, N = 3 32151217.03 32144757.80 32117609.18 32101622.54 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Floating Point d c b EPYC 9124 2P 20K 40K 60K 80K 100K SE +/- 414.14, N = 3 106715.85 106411.74 107171.34 107730.40 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc C String Functions d c b EPYC 9124 2P 9M 18M 27M 36M 45M SE +/- 540521.68, N = 3 40991307.95 39897113.46 41635360.27 41243730.27 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc Qsort Data Sorting d c b EPYC 9124 2P 200 400 600 800 1000 SE +/- 0.71, N = 3 916.74 920.53 917.31 917.08 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: System V Message Passing d c b EPYC 9124 2P 1.5M 3M 4.5M 6M 7.5M SE +/- 5858.93, N = 3 6851573.20 6858428.66 6872106.48 6868467.37 1. (CXX) g++ options: -O2 -std=gnu99 -lc
NCNN NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mobilenet d c b EPYC 9124 2P 6 12 18 24 30 SE +/- 0.17, N = 3 23.97 23.50 23.67 22.98 MIN: 23.73 / MAX: 24.33 MIN: 23.22 / MAX: 27.65 MIN: 22.96 / MAX: 27.9 MIN: 22.68 / MAX: 27.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 d c b EPYC 9124 2P 3 6 9 12 15 SE +/- 0.10, N = 3 11.83 12.04 11.38 11.63 MIN: 11.32 / MAX: 16.7 MIN: 10.97 / MAX: 124.42 MIN: 10.59 / MAX: 46.25 MIN: 11.11 / MAX: 16.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 d c b EPYC 9124 2P 3 6 9 12 15 SE +/- 0.03, N = 3 12.01 11.98 11.90 11.27 MIN: 11.78 / MAX: 15.81 MIN: 11.57 / MAX: 16.2 MIN: 11.07 / MAX: 16.04 MIN: 10.84 / MAX: 15.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 d c b EPYC 9124 2P 4 8 12 16 20 SE +/- 0.07, N = 3 15.46 15.38 15.32 14.59 MIN: 15.18 / MAX: 25.14 MIN: 14.6 / MAX: 86.15 MIN: 14.99 / MAX: 19.69 MIN: 14.26 / MAX: 19.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet d c b EPYC 9124 2P 3 6 9 12 15 SE +/- 0.11, N = 3 10.83 10.75 10.89 10.89 MIN: 10.72 / MAX: 12.4 MIN: 10.55 / MAX: 20.58 MIN: 10.59 / MAX: 11.25 MIN: 10.64 / MAX: 15.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 d c b EPYC 9124 2P 4 8 12 16 20 SE +/- 0.09, N = 3 15.43 15.18 15.65 15.65 MIN: 15.17 / MAX: 17.31 MIN: 14.78 / MAX: 24.12 MIN: 15.22 / MAX: 20.88 MIN: 15.38 / MAX: 19.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface d c b EPYC 9124 2P 2 4 6 8 10 SE +/- 0.12, N = 3 5.84 6.15 5.82 5.80 MIN: 5.66 / MAX: 10.11 MIN: 5.85 / MAX: 10.7 MIN: 5.44 / MAX: 10.1 MIN: 5.61 / MAX: 6.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet d c b EPYC 9124 2P 7 14 21 28 35 SE +/- 0.14, N = 3 30.11 29.75 29.63 29.49 MIN: 29.42 / MAX: 32.28 MIN: 29.08 / MAX: 39.28 MIN: 28.91 / MAX: 37.12 MIN: 28.8 / MAX: 33.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 d c b EPYC 9124 2P 7 14 21 28 35 SE +/- 0.61, N = 3 28.69 28.56 28.48 28.31 MIN: 28.07 / MAX: 37.47 MIN: 25.87 / MAX: 151.12 MIN: 26.51 / MAX: 33.89 MIN: 27.15 / MAX: 33.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 d c b EPYC 9124 2P 4 8 12 16 20 SE +/- 0.18, N = 3 13.75 13.60 13.66 13.53 MIN: 13.34 / MAX: 17.81 MIN: 13.23 / MAX: 17.34 MIN: 13.02 / MAX: 18.5 MIN: 13.23 / MAX: 13.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet d c b EPYC 9124 2P 2 4 6 8 10 SE +/- 0.23, N = 3 8.14 8.14 8.22 8.06 MIN: 7.84 / MAX: 8.57 MIN: 7.75 / MAX: 12.48 MIN: 7.66 / MAX: 12.54 MIN: 7.76 / MAX: 12.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 d c b EPYC 9124 2P 6 12 18 24 30 SE +/- 0.32, N = 3 26.68 26.84 26.63 26.29 MIN: 26.37 / MAX: 27.5 MIN: 26.51 / MAX: 31.18 MIN: 25.98 / MAX: 30.72 MIN: 25.96 / MAX: 30.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny d c b EPYC 9124 2P 9 18 27 36 45 SE +/- 0.25, N = 3 40.34 35.50 35.64 35.63 MIN: 39.03 / MAX: 43.73 MIN: 34.27 / MAX: 39.94 MIN: 34.1 / MAX: 39.46 MIN: 34.31 / MAX: 39.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd d c b EPYC 9124 2P 6 12 18 24 30 SE +/- 0.09, N = 3 25.61 24.93 25.13 24.90 MIN: 24.93 / MAX: 30.16 MIN: 24.54 / MAX: 29.12 MIN: 24.57 / MAX: 30.59 MIN: 24.53 / MAX: 28.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m d c b EPYC 9124 2P 8 16 24 32 40 SE +/- 0.30, N = 3 34.57 36.80 34.75 33.75 MIN: 34.34 / MAX: 37.1 MIN: 33.81 / MAX: 451.66 MIN: 32.99 / MAX: 39.74 MIN: 33.36 / MAX: 38.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer d c b EPYC 9124 2P 13 26 39 52 65 SE +/- 1.11, N = 3 55.81 58.14 58.52 57.23 MIN: 55.15 / MAX: 61.12 MIN: 56.8 / MAX: 80.65 MIN: 56.14 / MAX: 303.74 MIN: 56.51 / MAX: 61.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet d c b EPYC 9124 2P 5 10 15 20 25 SE +/- 1.24, N = 3 16.00 19.87 17.64 16.62 MIN: 15.86 / MAX: 20.49 MIN: 19.45 / MAX: 29.36 MIN: 15.65 / MAX: 24.31 MIN: 16.27 / MAX: 18.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only d c b EPYC 9124 2P 9 18 27 36 45 SE +/- 0.02, N = 3 37.93 37.75 37.85 37.66
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only d c b EPYC 9124 2P 20 40 60 80 100 SE +/- 0.08, N = 3 94.88 95.02 95.10 94.82
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only d c b EPYC 9124 2P 11 22 33 44 55 SE +/- 0.13, N = 3 48.95 49.26 49.21 49.25
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only d c b EPYC 9124 2P 80 160 240 320 400 SE +/- 0.81, N = 3 372.72 373.55 373.06 373.93
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only d c b EPYC 9124 2P 30 60 90 120 150 SE +/- 0.41, N = 3 122.70 123.00 122.37 122.49
Kripke Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.6 d c b EPYC 9124 2P 80M 160M 240M 320M 400M SE +/- 2419934.15, N = 3 372072000 365660900 374033600 376258700 1. (CXX) g++ options: -O3 -fopenmp -ldl
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.36 VGR Performance Metric d c b EPYC 9124 2P 120K 240K 360K 480K 600K 544648 542621 539832 545952 1. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6
EPYC 9124 2P Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 25 August 2023 02:02 by user .
b Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 25 August 2023 04:34 by user .
c Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 25 August 2023 12:55 by user .
d Processor: 2 x AMD EPYC 9124 16-Core @ 3.00GHz (32 Cores / 64 Threads), Motherboard: Supermicro H13DSH (1.5 BIOS), Memory: 24 x 32 GB DDR5-4800MT/s Samsung M321R4GA3BB6-CQKET, Disk: 2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07, Graphics: astdrmfb
OS: AlmaLinux 9.2, Kernel: 5.14.0-284.25.1.el9_2.x86_64 (x86_64), Compiler: GCC 11.3.1 20221121, File-System: ext4, Screen Resolution: 1024x768
Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-islProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113eJava Notes: OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Notes: Python 3.9.16Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 25 August 2023 16:37 by user .