3200u april

AMD Ryzen 3 3200U testing with a MOTILE PF4PU1F (N.1.03 BIOS) and AMD Radeon Vega 3 512MB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2204013-NE-3200UAPRI16
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
A
April 01
  40 Minutes
B
April 01
  2 Hours, 38 Minutes
C
April 01
  2 Hours, 48 Minutes
Invert Hiding All Results Option
  2 Hours, 2 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):


3200u aprilOpenBenchmarking.orgPhoronix Test Suite 10.8.3AMD Ryzen 3 3200U @ 2.60GHz (2 Cores / 4 Threads)MOTILE PF4PU1F (N.1.03 BIOS)AMD Raven/Raven23584MB128GB BIWIN SSDAMD Radeon Vega 3 512MB (1200/1200MHz)AMD Raven/Raven2/FenghuangRealtek RTL8111/8168/8411 + Intel Dual Band-AC 3168NGWUbuntu 20.045.15.0-051500-generic (x86_64)GNOME Shell 3.36.9X Server 1.20.134.6 Mesa 22.0.0-devel (git-9cb9101 2022-01-08 focal-oibaf-ppa) (LLVM 13.0.0 DRM 3.42)GCC 9.4.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution3200u April BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-yTrUTS/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8108102- OpenJDK Runtime Environment (build 11.0.14+9-Ubuntu-0ubuntu2.20.04)- Python 3.8.10- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

ABCResult OverviewPhoronix Test Suite 10.8.3100%110%121%131%fast-clispeedtest-cliJava JMHoneDNNperf-bench

3200u aprilfast-cli: Internet Download Speedfast-cli: Internet Upload Speedfast-cli: Internet Latencyfast-cli: Internet Loaded Latency (Bufferbloat)speedtest-cli: Internet Download Speedspeedtest-cli: Internet Upload Speedspeedtest-cli: Internet Latencyperf-bench: Epoll Waitperf-bench: Futex Hashperf-bench: Memcpy 1MBperf-bench: Memset 1MBperf-bench: Sched Pipeperf-bench: Futex Lock-Piperf-bench: Syscall Basiconednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUjava-jmh: ThroughputABC804.9258970.155.8942.541495289447994814.93741345.84081721898737251245725936.865716.845128.54596.6897848.299155.87364.396568.548837.059549.060238761.820075.538532.42022516.448738516.620185.113.34322843596070.78704.82518871.725.9229.932493971437385314.63208444.37497021396437471317106638.173216.834328.77456.7514749.076556.817065.647469.817339.054249.557739175.720542.539394.220561.216.647839570.920641.713.33212953181899.576534.82523449.965.6844.090493722440581314.81864244.09918121136637551325629238.386516.944428.62956.7392049.497456.885765.957669.755738.742750.033739745.020661.939940.820752.716.683140176.820988.013.37682961014078.74OpenBenchmarking.org

fast-cli

OpenBenchmarking.orgMbit/s, More Is Betterfast-cliInternet Download SpeedABC20406080100SE +/- 1.56, N = 15SE +/- 1.87, N = 15807053
OpenBenchmarking.orgMbit/s, More Is Betterfast-cliInternet Download SpeedABC1530456075Min: 58 / Avg: 69.93 / Max: 78Min: 41 / Avg: 53.13 / Max: 63

OpenBenchmarking.orgMbit/s, More Is Betterfast-cliInternet Upload SpeedABC1.10252.2053.30754.415.5125SE +/- 0.13, N = 15SE +/- 0.14, N = 154.94.84.8
OpenBenchmarking.orgMbit/s, More Is Betterfast-cliInternet Upload SpeedABC246810Min: 3.7 / Avg: 4.83 / Max: 5.7Min: 3.5 / Avg: 4.84 / Max: 5.5

OpenBenchmarking.orgms, Fewer Is Betterfast-cliInternet LatencyABC612182430SE +/- 0.54, N = 15SE +/- 0.63, N = 15252525
OpenBenchmarking.orgms, Fewer Is Betterfast-cliInternet LatencyABC612182430Min: 21 / Avg: 24.6 / Max: 29Min: 21 / Avg: 25.47 / Max: 31

OpenBenchmarking.orgms, Fewer Is Betterfast-cliInternet Loaded Latency (Bufferbloat)ABC50100150200250SE +/- 5.46, N = 15SE +/- 20.74, N = 1589188234
OpenBenchmarking.orgms, Fewer Is Betterfast-cliInternet Loaded Latency (Bufferbloat)ABC4080120160200Min: 151 / Avg: 188.4 / Max: 227Min: 103 / Avg: 233.73 / Max: 358

speedtest-cli

OpenBenchmarking.orgMbit/s, More Is Betterspeedtest-cli 2.1.3Internet Download SpeedABC1632486480SE +/- 1.89, N = 15SE +/- 1.56, N = 1470.1571.7249.96
OpenBenchmarking.orgMbit/s, More Is Betterspeedtest-cli 2.1.3Internet Download SpeedABC1428425670Min: 58.45 / Avg: 71.72 / Max: 81.64Min: 43.25 / Avg: 49.96 / Max: 65.41

OpenBenchmarking.orgMbit/s, More Is Betterspeedtest-cli 2.1.3Internet Upload SpeedABC1.3322.6643.9965.3286.66SE +/- 0.08, N = 15SE +/- 0.09, N = 145.895.925.68
OpenBenchmarking.orgMbit/s, More Is Betterspeedtest-cli 2.1.3Internet Upload SpeedABC246810Min: 5.54 / Avg: 5.92 / Max: 6.61Min: 4.92 / Avg: 5.68 / Max: 6.26

OpenBenchmarking.orgms, Fewer Is Betterspeedtest-cli 2.1.3Internet LatencyABC1020304050SE +/- 0.79, N = 15SE +/- 2.30, N = 1442.5429.9344.09
OpenBenchmarking.orgms, Fewer Is Betterspeedtest-cli 2.1.3Internet LatencyABC918273645Min: 26.89 / Avg: 29.93 / Max: 37.03Min: 30.84 / Avg: 44.09 / Max: 64.48

perf-bench

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Epoll WaitABC110K220K330K440K550KSE +/- 1986.39, N = 3SE +/- 1579.52, N = 34952894939714937221. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Epoll WaitABC90K180K270K360K450KMin: 491224 / Avg: 493971 / Max: 497830Min: 491085 / Avg: 493722.33 / Max: 4965471. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex HashABC1000K2000K3000K4000K5000KSE +/- 18272.82, N = 3SE +/- 25564.63, N = 34479948437385344058131. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex HashABC800K1600K2400K3200K4000KMin: 4350941 / Avg: 4373853 / Max: 4409966Min: 4377078 / Avg: 4405812.67 / Max: 44568051. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memcpy 1MBABC48121620SE +/- 0.12, N = 3SE +/- 0.05, N = 314.9414.6314.821. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memcpy 1MBABC48121620Min: 14.4 / Avg: 14.63 / Max: 14.81Min: 14.76 / Avg: 14.82 / Max: 14.911. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma

OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memset 1MBABC1020304050SE +/- 0.41, N = 10SE +/- 0.38, N = 1245.8444.3744.101. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.orgGB/sec, More Is Betterperf-benchBenchmark: Memset 1MBABC918273645Min: 41.31 / Avg: 44.37 / Max: 45.73Min: 41.82 / Avg: 44.1 / Max: 45.211. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched PipeABC50K100K150K200K250KSE +/- 3038.65, N = 3SE +/- 3146.65, N = 42189872139642113661. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Sched PipeABC40K80K120K160K200KMin: 210281 / Avg: 213964 / Max: 219992Min: 206308 / Avg: 211366 / Max: 2203271. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Lock-PiABC8001600240032004000SE +/- 11.02, N = 3SE +/- 4.04, N = 33725374737551. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Futex Lock-PiABC7001400210028003500Min: 3725 / Avg: 3747 / Max: 3759Min: 3747 / Avg: 3755 / Max: 37601. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma

OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Syscall BasicABC3M6M9M12M15MSE +/- 84928.86, N = 3SE +/- 95015.37, N = 31245725913171066132562921. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
OpenBenchmarking.orgops/sec, More Is Betterperf-benchBenchmark: Syscall BasicABC2M4M6M8M10MMin: 13032900 / Avg: 13171066.33 / Max: 13325714Min: 13148037 / Avg: 13256291.67 / Max: 134456761. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUABC918273645SE +/- 0.61, N = 3SE +/- 0.58, N = 336.8738.1738.39MIN: 34.13MIN: 33.92MIN: 34.091. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUABC816243240Min: 36.95 / Avg: 38.17 / Max: 38.91Min: 37.25 / Avg: 38.39 / Max: 39.181. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUABC48121620SE +/- 0.03, N = 3SE +/- 0.05, N = 316.8516.8316.94MIN: 16.75MIN: 16.67MIN: 16.691. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUABC48121620Min: 16.79 / Avg: 16.83 / Max: 16.89Min: 16.86 / Avg: 16.94 / Max: 17.021. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUABC714212835SE +/- 0.06, N = 3SE +/- 0.08, N = 328.5528.7728.63MIN: 26.62MIN: 26.44MIN: 26.651. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUABC612182430Min: 28.69 / Avg: 28.77 / Max: 28.88Min: 28.51 / Avg: 28.63 / Max: 28.781. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUABC246810SE +/- 0.02701, N = 3SE +/- 0.03112, N = 36.689786.751476.73920MIN: 6.21MIN: 6.08MIN: 6.121. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUABC3691215Min: 6.71 / Avg: 6.75 / Max: 6.8Min: 6.7 / Avg: 6.74 / Max: 6.81. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUABC1122334455SE +/- 0.27, N = 3SE +/- 0.30, N = 348.3049.0849.50MIN: 46.76MIN: 47.39MIN: 47.051. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUABC1020304050Min: 48.56 / Avg: 49.08 / Max: 49.46Min: 48.91 / Avg: 49.5 / Max: 49.821. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUABC1326395265SE +/- 0.20, N = 3SE +/- 0.36, N = 355.8756.8256.89MIN: 51.91MIN: 51.77MIN: 52.241. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUABC1122334455Min: 56.44 / Avg: 56.82 / Max: 57.12Min: 56.24 / Avg: 56.89 / Max: 57.471. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUABC1530456075SE +/- 0.05, N = 3SE +/- 0.07, N = 364.4065.6565.96MIN: 61.73MIN: 63.07MIN: 63.151. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUABC1326395265Min: 65.55 / Avg: 65.65 / Max: 65.72Min: 65.85 / Avg: 65.96 / Max: 66.081. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUABC1632486480SE +/- 0.27, N = 3SE +/- 0.32, N = 368.5569.8269.76MIN: 68.19MIN: 68.71MIN: 68.521. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUABC1428425670Min: 69.33 / Avg: 69.82 / Max: 70.26Min: 69.12 / Avg: 69.76 / Max: 70.151. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUABC918273645SE +/- 0.39, N = 3SE +/- 0.47, N = 337.0639.0538.74MIN: 32.44MIN: 32.94MIN: 32.141. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUABC816243240Min: 38.32 / Avg: 39.05 / Max: 39.63Min: 37.82 / Avg: 38.74 / Max: 39.331. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUABC1122334455SE +/- 0.05, N = 3SE +/- 0.25, N = 349.0649.5650.03MIN: 47.61MIN: 46.99MIN: 45.61. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUABC1020304050Min: 49.5 / Avg: 49.56 / Max: 49.66Min: 49.77 / Avg: 50.03 / Max: 50.531. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUABC9K18K27K36K45KSE +/- 210.62, N = 3SE +/- 132.17, N = 338761.839175.739745.0MIN: 38608.8MIN: 38587.5MIN: 39411.11. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUABC7K14K21K28K35KMin: 38831.2 / Avg: 39175.73 / Max: 39557.9Min: 39580.4 / Avg: 39744.97 / Max: 40006.41. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUABC4K8K12K16K20KSE +/- 9.59, N = 3SE +/- 32.94, N = 320075.520542.520661.9MIN: 19976.5MIN: 20405.1MIN: 20509.81. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUABC4K8K12K16K20KMin: 20532.4 / Avg: 20542.53 / Max: 20561.7Min: 20614.7 / Avg: 20661.9 / Max: 20725.31. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUABC9K18K27K36K45KSE +/- 199.84, N = 3SE +/- 265.92, N = 338532.439394.239940.8MIN: 38387MIN: 39008.4MIN: 39404.31. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUABC7K14K21K28K35KMin: 39168.4 / Avg: 39394.2 / Max: 39792.7Min: 39563 / Avg: 39940.83 / Max: 40453.91. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUABC4K8K12K16K20KSE +/- 77.64, N = 3SE +/- 76.03, N = 320225.020561.220752.7MIN: 20127.6MIN: 20284.8MIN: 204851. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUABC4K8K12K16K20KMin: 20407.5 / Avg: 20561.23 / Max: 20657Min: 20642.2 / Avg: 20752.67 / Max: 20898.41. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUABC48121620SE +/- 0.02, N = 3SE +/- 0.01, N = 316.4516.6516.68MIN: 15.49MIN: 15.56MIN: 15.421. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUABC48121620Min: 16.63 / Avg: 16.65 / Max: 16.68Min: 16.67 / Avg: 16.68 / Max: 16.71. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUABC9K18K27K36K45KSE +/- 251.58, N = 3SE +/- 53.42, N = 338516.639570.940176.8MIN: 38386.3MIN: 38855.2MIN: 39914.61. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUABC7K14K21K28K35KMin: 39068.6 / Avg: 39570.9 / Max: 39847.4Min: 40085.7 / Avg: 40176.83 / Max: 40270.71. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUABC4K8K12K16K20KSE +/- 45.51, N = 3SE +/- 25.94, N = 320185.120641.720988.0MIN: 20078.8MIN: 20454.6MIN: 20793.71. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUABC4K8K12K16K20KMin: 20561.7 / Avg: 20641.7 / Max: 20719.3Min: 20948.7 / Avg: 20988.03 / Max: 210371. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUABC3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 313.3413.3313.38MIN: 11.94MIN: 11.95MIN: 12.061. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUABC48121620Min: 13.31 / Avg: 13.33 / Max: 13.35Min: 13.36 / Avg: 13.38 / Max: 13.41. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Java JMH

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughputABC600M1200M1800M2400M3000M2843596070.782953181899.582961014078.74