3600x 2024 AMD Ryzen 9 3900XT 12-Core testing with a MSI MEG X570 GODLIKE (MS-7C34) v1.0 (1.B3 BIOS) and AMD Radeon RX 56/64 8GB on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2410197-NE-3600X202416&sor&grt .
3600x 2024 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution a b c AMD Ryzen 9 3900XT 12-Core @ 3.80GHz (12 Cores / 24 Threads) MSI MEG X570 GODLIKE (MS-7C34) v1.0 (1.B3 BIOS) AMD Starship/Matisse 2 x 8GB DDR4-4133MT/s F4-4133C19-8GTZKW 500GB Seagate FireCuda 520 SSD ZP500GM30002 AMD Radeon RX 56/64 8GB AMD Vega 10 HDMI Audio DELL U2723QE Realtek Device 2600 + Realtek Killer E3000 2.5GbE + Intel Wi-Fi 6 AX200 Ubuntu 22.04 6.8.0-47-generic (x86_64) GNOME Shell 42.9 X Server + Wayland 4.6 Mesa 22.0.1 (LLVM 13.0.1 DRM 3.57) 1.3.204 GCC 11.4.0 ext4 3840x2160 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8701021 Graphics Details - a, b: BAR1 / Visible vRAM Size: 256 MB Java Details - a: OpenJDK Runtime Environment (build 11.0.24+8-post-Ubuntu-1ubuntu322.04) Python Details - a: Python 3.10.12 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
3600x 2024 blosc: blosclz shuffle - 8MB blosc: blosclz shuffle - 16MB blosc: blosclz shuffle - 32MB blosc: blosclz shuffle - 64MB blosc: blosclz noshuffle - 8MB blosc: blosclz shuffle - 128MB blosc: blosclz bitshuffle - 8MB blosc: blosclz noshuffle - 16MB blosc: blosclz noshuffle - 32MB blosc: blosclz noshuffle - 64MB blosc: blosclz bitshuffle - 16MB blosc: blosclz bitshuffle - 32MB blosc: blosclz bitshuffle - 64MB blosc: blosclz noshuffle - 128MB blosc: blosclz bitshuffle - 128MB furmark: 1920 x 1080 - FurMark OpenGL - Off furmark: 1920 x 1080 - FurMark Vulkan - Off furmark: 1920 x 1080 - FurMark Knot OpenGL - Off furmark: 1920 x 1080 - Furmark Knot Vulkan - Off glibc-bench: cos glibc-bench: exp glibc-bench: ffs glibc-bench: pow glibc-bench: sin glibc-bench: log2 glibc-bench: modf glibc-bench: sinh glibc-bench: sqrt glibc-bench: tanh glibc-bench: asinh glibc-bench: atanh glibc-bench: ffsll glibc-bench: sincos glibc-bench: pthread_once lczero: BLAS litert: DeepLab V3 litert: SqueezeNet litert: Inception V4 litert: NASNet Mobile litert: Mobilenet Float litert: Mobilenet Quant litert: Inception ResNet V2 litert: Quantized COCO SSD MobileNet v1 namd: ATPase with 327,506 Atoms onednn: IP Shapes 1D - CPU onednn: IP Shapes 3D - CPU onednn: Convolution Batch Shapes Auto - CPU onednn: Deconvolution Batch shapes_1d - CPU onednn: Deconvolution Batch shapes_3d - CPU onednn: Recurrent Neural Network Training - CPU onednn: Recurrent Neural Network Inference - CPU xnnpack: FP32MobileNetV1 xnnpack: FP32MobileNetV2 xnnpack: FP32MobileNetV3Large xnnpack: FP32MobileNetV3Small xnnpack: FP16MobileNetV1 xnnpack: FP16MobileNetV2 xnnpack: FP16MobileNetV3Large xnnpack: FP16MobileNetV3Small xnnpack: QS8MobileNetV2 a b c 11575 11523.1 10369.6 8550.6 9500 6896.5 10444.6 8745 6738.3 5462.1 10404.9 9784 8054.3 4695.4 6703.8 42.121109 38.168106 11.254252 15.955235 81.3098 16.7658 6.03612 42.704 71.885 13.463 6.69978 29.16 10.3879 39.6167 31.7853 36.9881 6.06866 41.1823 6.03187 63 3688.37 2548.21 38445.9 12847.8 2282.72 1698.8 32634.7 2646.99 0.77506 2.55382 15.9354 18.9965 7.03818 5.33741 2654.7 1370.26 2454 2263 2505 1183 2493 1915 2178 1084 1377 11536.1 11253.3 10252.8 8530.4 9622.5 6893 10497.1 8834.7 6814.6 5424.4 10549.8 9540.4 8517.2 4696.9 6587.4 41.91703 78.4383 16.9909 6.08496 44.6435 71.035 13.5004 6.55046 28.1594 7.84162 40.0893 31.5253 36.9931 6.03505 41.9762 6.03169 3653.79 2553.5 38646.1 2239.6 1689.17 32941 2652.69 0.78593 2.66658 12.0669 18.9056 5.96129 5.36237 2666.16 1364.38 2502 2220 2512 1194 2485 1911 2174 1082 1371 11542.8 11379.4 10512.1 8536.4 9750.8 6900.4 10361.2 8993.3 6865.3 5439.3 10414.3 9643.8 8352.2 4715 6615.2 81.3725 19.2329 6.07068 44.6405 71.0232 12.9174 6.89904 29.4491 10.3278 39.6121 31.7904 35.7851 6.0688 41.1967 5.78067 3629.77 2546.22 39144.4 12877.1 2259.66 1686.52 32820.2 2648.53 0.78383 2.62929 12.2266 18.8464 6.03743 5.35165 2651.48 1364.68 2430 2206 2479 1189 2491 1912 2170 1082 1371 OpenBenchmarking.org
C-Blosc Test: blosclz shuffle - Buffer Size: 8MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 8MB a c b 2K 4K 6K 8K 10K 11575.0 11542.8 11536.1 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz shuffle - Buffer Size: 16MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 16MB a c b 2K 4K 6K 8K 10K 11523.1 11379.4 11253.3 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz shuffle - Buffer Size: 32MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 32MB c a b 2K 4K 6K 8K 10K 10512.1 10369.6 10252.8 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz shuffle - Buffer Size: 64MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 64MB a c b 2K 4K 6K 8K 10K 8550.6 8536.4 8530.4 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 8MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 8MB c b a 2K 4K 6K 8K 10K 9750.8 9622.5 9500.0 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz shuffle - Buffer Size: 128MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 128MB c a b 1500 3000 4500 6000 7500 6900.4 6896.5 6893.0 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 8MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 8MB b a c 2K 4K 6K 8K 10K 10497.1 10444.6 10361.2 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 16MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 16MB c b a 2K 4K 6K 8K 10K 8993.3 8834.7 8745.0 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 32MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 32MB c b a 1500 3000 4500 6000 7500 6865.3 6814.6 6738.3 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 64MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 64MB a c b 1200 2400 3600 4800 6000 5462.1 5439.3 5424.4 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 16MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 16MB b c a 2K 4K 6K 8K 10K 10549.8 10414.3 10404.9 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 32MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 32MB a c b 2K 4K 6K 8K 10K 9784.0 9643.8 9540.4 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 64MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 64MB b c a 2K 4K 6K 8K 10K 8517.2 8352.2 8054.3 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 128MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 128MB c b a 1000 2000 3000 4000 5000 4715.0 4696.9 4695.4 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 128MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 128MB a c b 1400 2800 4200 5600 7000 6703.8 6615.2 6587.4 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
FurMark Resolution: 1920 x 1080 - Demo: FurMark OpenGL - MSAA: Off OpenBenchmarking.org FPS, More Is Better FurMark 2.1 Resolution: 1920 x 1080 - Demo: FurMark OpenGL - MSAA: Off a b 10 20 30 40 50 42.12 41.92 MIN: 41 / MAX: 44 MIN: 41 / MAX: 44
FurMark Resolution: 1920 x 1080 - Demo: FurMark Vulkan - MSAA: Off OpenBenchmarking.org FPS, More Is Better FurMark 2.1 Resolution: 1920 x 1080 - Demo: FurMark Vulkan - MSAA: Off a 9 18 27 36 45 38.17 MIN: 38 / MAX: 40
FurMark Resolution: 1920 x 1080 - Demo: FurMark Knot OpenGL - MSAA: Off OpenBenchmarking.org FPS, More Is Better FurMark 2.1 Resolution: 1920 x 1080 - Demo: FurMark Knot OpenGL - MSAA: Off a 3 6 9 12 15 11.25 MAX: 13
FurMark Resolution: 1920 x 1080 - Demo: Furmark Knot Vulkan - MSAA: Off OpenBenchmarking.org FPS, More Is Better FurMark 2.1 Resolution: 1920 x 1080 - Demo: Furmark Knot Vulkan - MSAA: Off a 4 8 12 16 20 15.96 MAX: 19
Glibc Benchmarks Benchmark: cos OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: cos b a c 20 40 60 80 100 78.44 81.31 81.37 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: exp OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: exp a b c 5 10 15 20 25 16.77 16.99 19.23 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: ffs OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: ffs a c b 2 4 6 8 10 6.03612 6.07068 6.08496 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: pow OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: pow a c b 10 20 30 40 50 42.70 44.64 44.64 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: sin OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: sin c b a 16 32 48 64 80 71.02 71.04 71.89 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: log2 OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: log2 c a b 3 6 9 12 15 12.92 13.46 13.50 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: modf OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: modf b a c 2 4 6 8 10 6.55046 6.69978 6.89904 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: sinh OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: sinh b a c 7 14 21 28 35 28.16 29.16 29.45 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: sqrt OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: sqrt b c a 3 6 9 12 15 7.84162 10.32780 10.38790 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: tanh OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: tanh c a b 9 18 27 36 45 39.61 39.62 40.09 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: asinh OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: asinh b a c 7 14 21 28 35 31.53 31.79 31.79 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: atanh OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: atanh c a b 9 18 27 36 45 35.79 36.99 36.99 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: ffsll OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: ffsll b a c 2 4 6 8 10 6.03505 6.06866 6.06880 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: sincos OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: sincos a c b 10 20 30 40 50 41.18 41.20 41.98 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
Glibc Benchmarks Benchmark: pthread_once OpenBenchmarking.org ns, Fewer Is Better Glibc Benchmarks 2.39 Benchmark: pthread_once c b a 2 4 6 8 10 5.78067 6.03169 6.03187 1. (CC) gcc options: -pie -nostdlib -nostartfiles -lgcc -lgcc_s
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: BLAS a 14 28 42 56 70 63 1. (CXX) g++ options: -flto -pthread
LiteRT Model: DeepLab V3 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: DeepLab V3 c b a 800 1600 2400 3200 4000 3629.77 3653.79 3688.37
LiteRT Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: SqueezeNet c a b 500 1000 1500 2000 2500 2546.22 2548.21 2553.50
LiteRT Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Inception V4 a b c 8K 16K 24K 32K 40K 38445.9 38646.1 39144.4
LiteRT Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: NASNet Mobile a c 3K 6K 9K 12K 15K 12847.8 12877.1
LiteRT Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Mobilenet Float b c a 500 1000 1500 2000 2500 2239.60 2259.66 2282.72
LiteRT Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Mobilenet Quant c b a 400 800 1200 1600 2000 1686.52 1689.17 1698.80
LiteRT Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Inception ResNet V2 a c b 7K 14K 21K 28K 35K 32634.7 32820.2 32941.0
LiteRT Model: Quantized COCO SSD MobileNet v1 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Quantized COCO SSD MobileNet v1 a c b 600 1200 1800 2400 3000 2646.99 2648.53 2652.69
NAMD Input: ATPase with 327,506 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: ATPase with 327,506 Atoms b c a 0.1768 0.3536 0.5304 0.7072 0.884 0.78593 0.78383 0.77506
oneDNN Harness: IP Shapes 1D - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.6 Harness: IP Shapes 1D - Engine: CPU a c b 0.6 1.2 1.8 2.4 3 2.55382 2.62929 2.66658 MIN: 2.5 MIN: 2.57 MIN: 2.59 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl
oneDNN Harness: IP Shapes 3D - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.6 Harness: IP Shapes 3D - Engine: CPU b c a 4 8 12 16 20 12.07 12.23 15.94 MIN: 11.98 MIN: 12.12 MIN: 12.64 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.6 Harness: Convolution Batch Shapes Auto - Engine: CPU c b a 5 10 15 20 25 18.85 18.91 19.00 MIN: 18.7 MIN: 18.8 MIN: 18.73 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.6 Harness: Deconvolution Batch shapes_1d - Engine: CPU b c a 2 4 6 8 10 5.96129 6.03743 7.03818 MIN: 5.08 MIN: 5.15 MIN: 5.34 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.6 Harness: Deconvolution Batch shapes_3d - Engine: CPU a c b 1.2065 2.413 3.6195 4.826 6.0325 5.33741 5.35165 5.36237 MIN: 5.23 MIN: 5.26 MIN: 5.26 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.6 Harness: Recurrent Neural Network Training - Engine: CPU c a b 600 1200 1800 2400 3000 2651.48 2654.70 2666.16 MIN: 2643.75 MIN: 2646.21 MIN: 2649.73 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.6 Harness: Recurrent Neural Network Inference - Engine: CPU b c a 300 600 900 1200 1500 1364.38 1364.68 1370.26 MIN: 1359.36 MIN: 1358.89 MIN: 1357.53 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl
XNNPACK Model: FP32MobileNetV1 OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP32MobileNetV1 c a b 500 1000 1500 2000 2500 2430 2454 2502 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP32MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP32MobileNetV2 c b a 500 1000 1500 2000 2500 2206 2220 2263 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP32MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP32MobileNetV3Large c a b 500 1000 1500 2000 2500 2479 2505 2512 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP32MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP32MobileNetV3Small a c b 300 600 900 1200 1500 1183 1189 1194 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV1 OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP16MobileNetV1 b c a 500 1000 1500 2000 2500 2485 2491 2493 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP16MobileNetV2 b c a 400 800 1200 1600 2000 1911 1912 1915 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP16MobileNetV3Large c b a 500 1000 1500 2000 2500 2170 2174 2178 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP16MobileNetV3Small b c a 200 400 600 800 1000 1082 1082 1084 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QS8MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: QS8MobileNetV2 b c a 300 600 900 1200 1500 1371 1371 1377 1. (CXX) g++ options: -O3 -lrt -lm
Phoronix Test Suite v10.8.5