Tests for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1801075-AL-RETPOLINE03 retpoline-testing - Phoronix Test Suite retpoline-testing Tests for a future article.
HTML result view exported from: https://openbenchmarking.org/result/1801075-AL-RETPOLINE03&grr&sro&rro .
Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Driver OpenCL Compiler File-System Screen Resolution EPYC 7601 2 x Xeon Gold 6138 noretpoline Retpoline Retpoline + GCC no retpoline Retpoline Retpoline + GCC AMD EPYC 7601 32-Core @ 2.20GHz (32 Cores / 64 Threads) TYAN B8026T70AE24HR AMD Device 1450 129024MB 280GB INTEL SSDPE21D280GA ASPEED ASPEED Family VE228 Broadcom Limited NetXtreme BCM5720 Gigabit PCIe Ubuntu 17.10 4.14.0-phx-retpoline (x86_64) GNOME Shell 3.26.1 modesetting 1.19.5 OpenCL 1.2 pocl 1.0 LLVM 5.0.0 GCC 7.2.0 + Clang 5.0.0-3 + LLVM 5.0.0 ext4 1920x1080 4.14.0-phx-retpoline-gcc-retpo (x86_64) 2 x Intel Xeon Gold 6138 @ 3.70GHz (40 Cores / 80 Threads) TYAN S7106 Intel Device 2020 96256MB 256GB Samsung SSD 850 + 2000GB Seagate ST2000DM006-2DM1 + 2 x 120GB TOSHIBA-TR150 Intel I210 Gigabit Connection 4.14.0-phx-retpoline (x86_64) GCC 7.2.0 4.14.0-phx-retpoline-gcc-retpo (x86_64) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Disk Details - EPYC 7601: noretpoline: NONE / data=ordered,errors=remount-ro,relatime,rw - EPYC 7601: Retpoline: NONE / data=ordered,errors=remount-ro,relatime,rw - EPYC 7601: Retpoline + GCC: NONE / data=ordered,errors=remount-ro,relatime,rw - 2 x Xeon Gold 6138: no retpoline: CFQ / data=ordered,errors=remount-ro,relatime,rw - 2 x Xeon Gold 6138: Retpoline: CFQ / data=ordered,errors=remount-ro,relatime,rw - 2 x Xeon Gold 6138: Retpoline + GCC: CFQ / data=ordered,errors=remount-ro,relatime,rw Processor Details - EPYC 7601: noretpoline: Scaling Governor: acpi-cpufreq ondemand - EPYC 7601: Retpoline: Scaling Governor: acpi-cpufreq ondemand - EPYC 7601: Retpoline + GCC: Scaling Governor: acpi-cpufreq ondemand - 2 x Xeon Gold 6138: no retpoline: Scaling Governor: intel_pstate powersave - 2 x Xeon Gold 6138: Retpoline: Scaling Governor: intel_pstate powersave - 2 x Xeon Gold 6138: Retpoline + GCC: Scaling Governor: intel_pstate powersave System Details - Python 2.7.14.
pgbench: Buffer Test - Normal Load - Read Write pgbench: Buffer Test - Normal Load - Read Only scikit-learn: apache: Static Web Page Serving pybench: Total For Average Test Times redis: SET redis: GET redis: LPUSH redis: SADD redis: LPOP tjbench: Decompression Throughput glibc-bench: pthread_once glibc-bench: sqrt glibc-bench: ffs compress-lzma: 256MB File Compression stockfish: Total Time c-ray: Total Time build-linux-kernel: Time To Compile build-apache: Time To Compile ebizzy: john-the-ripper: Blowfish cachebench: Read / Modify / Write cachebench: Write cachebench: Read lzbench: Zstd 1 - Compression lzbench: XZ 0 - Compression rodinia: OpenMP CFD Solver rodinia: OpenMP LavaMD parboil: OpenMP MRI Gridding parboil: OpenMP CUTCP t-test1: 2 t-test1: 1 compilebench: Initial Create compilebench: Compile fs-mark: 4000 Files, 32 Sub Dirs, 1MB Size fs-mark: 1000 Files, 1MB Size fio: Seq Write - Libaio - No - Yes - 4KB - Default Test Directory fio: Seq Write - Libaio - No - Yes - 2MB - Default Test Directory fio: Seq Read - Libaio - No - Yes - 4KB - Default Test Directory fio: Seq Read - Libaio - No - Yes - 2MB - Default Test Directory fio: Rand Write - Libaio - No - Yes - 4KB - Default Test Directory fio: Rand Write - Libaio - No - Yes - 2MB - Default Test Directory fio: Rand Read - Libaio - No - Yes - 4KB - Default Test Directory fio: Rand Read - Libaio - No - Yes - 2MB - Default Test Directory EPYC 7601 2 x Xeon Gold 6138 noretpoline Retpoline Retpoline + GCC no retpoline Retpoline Retpoline + GCC 34.00 16587.34 1794 1131889.69 1394712.79 1117859.50 1233207.56 1520530.25 140.83 4.83 4.70 4.83 329.57 4507 3.50 38.30 31.71 1093230 35458 22924.24 21622.77 2205.73 335 24 10.97 31.65 283.16 2.71 14.38 37.76 410.85 1696.87 581.87 597.93 1081.23 1971.10 1183.53 2511.77 1059.73 1972.70 1176.20 2509.83 33.91 16838.76 1801 1161527.67 1362255.92 1147686.87 1293172.50 1394465.79 140.99 4.83 4.70 4.83 328.95 4507 3.45 38.44 32.21 1073587 35714 22918.96 21556.98 2215.24 336 24 10.76 31.86 283.25 2.73 14.46 36.68 406.41 1691.37 629.27 639.77 1091.50 2158.03 1160.83 2618.67 1046.93 2157.60 1165.97 2615.60 34.47 16716.48 1795 1162399.91 1352327.02 1123252.42 1249051.08 1230732.38 140.87 4.83 4.70 4.83 329.00 4501 3.48 38.52 32.12 952691 35715 22900.81 21835.76 2214.25 335 24 11.08 31.87 288.36 2.70 14.66 38.18 406.91 1693.10 631.30 619.67 1072.57 2156.87 1193.80 2618.20 1048.77 2158.27 1128.57 2637.27 2136.64 599258.14 184.75 22258.72 1307 1564339.04 1627717.71 1357638.81 1589703.69 1423431.25 145.81 3.87 12.30 3.26 281.84 3437 3.16 30.25 26.24 946321 49652 25777.79 24954.96 2973.08 361 32 9.73 28.50 407.91 2.35 23.12 65.91 533.70 1496.78 111.02 128.67 415.49 514.75 457.26 519.38 338.58 523.89 383.14 543.05 2467.38 597483.03 185.68 21347.64 1306 1525833.21 1738340.06 1382896.04 1573307.83 1486377.46 154.02 3.80 9.95 3.26 281.25 3654 3.15 30.46 26.31 954773 48399 25061.33 23882.88 2872.09 369 30 9.93 28.70 413.85 2.38 23.29 66.56 508.92 1694.04 136.67 133.73 413.25 526.12 454.44 518.49 341.01 514.67 384.79 543.18 3976.32 577641.99 186.08 18757.65 1315 1399642.27 1574731.42 1454926.31 1604656.60 1347143.44 146.51 4.49 9.96 3.26 282.04 3613 3.18 30.63 26.32 950671 51089 24928.44 23753.79 2839.79 370 31 10.54 28.43 405.25 2.38 23.04 64.45 478.76 1637.02 111.17 129.63 416.94 527.34 456.92 518.48 341.98 512.18 388.19 543.37 OpenBenchmarking.org
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write no retpoline Retpoline + GCC Retpoline 900 1800 2700 3600 4500 SE +/- 40.05, N = 3 SE +/- 15.09, N = 3 SE +/- 49.01, N = 6 2136.64 3976.32 2467.38 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -fPIC -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
PostgreSQL pgbench Scaling: Buffer Test - Test: Normal Load - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 10.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only no retpoline Retpoline + GCC Retpoline 130K 260K 390K 520K 650K SE +/- 2613.28, N = 3 SE +/- 4346.70, N = 3 SE +/- 1869.94, N = 3 599258.14 577641.99 597483.03 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -fPIC -lpgcommon -lpgport -lpthread -lrt -lcrypt -ldl -lm
Scikit-Learn EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better Scikit-Learn 0.17.1 noretpoline Retpoline + GCC Retpoline no retpoline 40 80 120 160 200 SE +/- 0.06, N = 3 SE +/- 0.48, N = 3 SE +/- 0.02, N = 3 SE +/- 0.99, N = 3 SE +/- 2.58, N = 3 SE +/- 0.29, N = 3 34.00 34.47 33.91 184.75 186.08 185.68
Apache Benchmark Static Web Page Serving EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.7 Static Web Page Serving noretpoline Retpoline + GCC Retpoline no retpoline 5K 10K 15K 20K 25K SE +/- 41.45, N = 3 SE +/- 296.19, N = 3 SE +/- 97.15, N = 3 SE +/- 139.49, N = 3 SE +/- 51.11, N = 3 SE +/- 63.48, N = 3 16587.34 16716.48 16838.76 22258.72 18757.65 21347.64 1. (CC) gcc options: -shared -fPIC -O2 -pthread
PyBench Total For Average Test Times EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2008-08-14 Total For Average Test Times noretpoline Retpoline + GCC Retpoline no retpoline 400 800 1200 1600 2000 SE +/- 7.97, N = 3 SE +/- 11.33, N = 3 SE +/- 1.76, N = 3 SE +/- 1.53, N = 3 1794 1795 1801 1307 1315 1306
Redis Test: SET EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Requests Per Second, More Is Better Redis 3.0.1 Test: SET noretpoline Retpoline + GCC Retpoline no retpoline 300K 600K 900K 1200K 1500K SE +/- 26854.76, N = 6 SE +/- 5879.00, N = 3 SE +/- 7115.17, N = 3 SE +/- 28059.92, N = 3 SE +/- 55833.41, N = 6 SE +/- 90449.67, N = 6 1131889.69 1162399.91 1161527.67 1564339.04 1399642.27 1525833.21 1. (CC) gcc options: -ggdb -rdynamic -lm -pthread
Redis Test: GET EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Requests Per Second, More Is Better Redis 3.0.1 Test: GET noretpoline Retpoline + GCC Retpoline no retpoline 400K 800K 1200K 1600K 2000K SE +/- 2966.06, N = 3 SE +/- 28137.42, N = 6 SE +/- 27266.47, N = 3 SE +/- 50934.82, N = 6 SE +/- 80851.67, N = 6 SE +/- 96355.24, N = 6 1394712.79 1352327.02 1362255.92 1627717.71 1574731.42 1738340.06 1. (CC) gcc options: -ggdb -rdynamic -lm -pthread
Redis Test: LPUSH EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Requests Per Second, More Is Better Redis 3.0.1 Test: LPUSH noretpoline Retpoline + GCC Retpoline no retpoline 300K 600K 900K 1200K 1500K SE +/- 8338.77, N = 3 SE +/- 6595.02, N = 3 SE +/- 3427.20, N = 3 SE +/- 57810.96, N = 6 SE +/- 49927.23, N = 6 SE +/- 51060.27, N = 6 1117859.50 1123252.42 1147686.87 1357638.81 1454926.31 1382896.04 1. (CC) gcc options: -ggdb -rdynamic -lm -pthread
Redis Test: SADD EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Requests Per Second, More Is Better Redis 3.0.1 Test: SADD noretpoline Retpoline + GCC Retpoline no retpoline 300K 600K 900K 1200K 1500K SE +/- 19184.10, N = 4 SE +/- 7598.74, N = 3 SE +/- 51667.16, N = 6 SE +/- 68212.97, N = 6 SE +/- 62798.70, N = 6 SE +/- 95230.31, N = 6 1233207.56 1249051.08 1293172.50 1589703.69 1604656.60 1573307.83 1. (CC) gcc options: -ggdb -rdynamic -lm -pthread
Redis Test: LPOP EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Requests Per Second, More Is Better Redis 3.0.1 Test: LPOP noretpoline Retpoline + GCC Retpoline no retpoline 300K 600K 900K 1200K 1500K SE +/- 1539.75, N = 3 SE +/- 11491.58, N = 3 SE +/- 61195.38, N = 6 SE +/- 36207.27, N = 6 SE +/- 25234.38, N = 6 SE +/- 77843.26, N = 6 1520530.25 1230732.38 1394465.79 1423431.25 1347143.44 1486377.46 1. (CC) gcc options: -ggdb -rdynamic -lm -pthread
libjpeg-turbo tjbench Test: Decompression Throughput EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 1.5.1 Test: Decompression Throughput noretpoline Retpoline + GCC Retpoline no retpoline 30 60 90 120 150 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.00, N = 3 SE +/- 5.29, N = 6 SE +/- 6.27, N = 6 SE +/- 1.27, N = 3 140.83 140.87 140.99 145.81 146.51 154.02 1. (CC) gcc options: -O3 -lm
glibc bench Benchmark: pthread_once EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: pthread_once noretpoline Retpoline + GCC Retpoline no retpoline 1.0868 2.1736 3.2604 4.3472 5.434 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 SE +/- 0.24, N = 6 SE +/- 0.00, N = 3 4.83 4.83 4.83 3.87 4.49 3.80
glibc bench Benchmark: sqrt EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: sqrt noretpoline Retpoline + GCC Retpoline no retpoline 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.62, N = 6 SE +/- 0.62, N = 6 4.70 4.70 4.70 12.30 9.96 9.95
glibc bench Benchmark: ffs EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org nanoseconds, Fewer Is Better glibc bench 1.0 Benchmark: ffs noretpoline Retpoline + GCC Retpoline no retpoline 1.0868 2.1736 3.2604 4.3472 5.434 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.83 4.83 4.83 3.26 3.26 3.26
LZMA Compression 256MB File Compression EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better LZMA Compression 256MB File Compression noretpoline Retpoline + GCC Retpoline no retpoline 70 140 210 280 350 SE +/- 0.18, N = 3 SE +/- 0.42, N = 3 SE +/- 0.22, N = 3 SE +/- 0.34, N = 3 SE +/- 2.80, N = 3 SE +/- 0.60, N = 3 329.57 329.00 328.95 281.84 282.04 281.25 1. (CXX) g++ options: -O2
Stockfish Total Time EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org ms, Fewer Is Better Stockfish 2014-11-26 Total Time noretpoline Retpoline + GCC Retpoline no retpoline 1000 2000 3000 4000 5000 SE +/- 3.48, N = 3 SE +/- 2.65, N = 3 SE +/- 37.22, N = 3 SE +/- 210.25, N = 6 SE +/- 202.73, N = 6 4507 4501 4507 3437 3613 3654 1. (CXX) g++ options: -lpthread -fno-exceptions -fno-rtti -ansi -pedantic -O3 -msse -msse3 -mpopcnt -flto
C-Ray Total Time EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time noretpoline Retpoline + GCC Retpoline no retpoline 0.7875 1.575 2.3625 3.15 3.9375 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.50 3.48 3.45 3.16 3.18 3.15 1. (CC) gcc options: -lm -lpthread -O3
Timed Linux Kernel Compilation Time To Compile EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 4.13 Time To Compile noretpoline Retpoline + GCC Retpoline no retpoline 9 18 27 36 45 SE +/- 0.68, N = 6 SE +/- 0.63, N = 6 SE +/- 0.76, N = 6 SE +/- 0.75, N = 6 SE +/- 0.80, N = 6 SE +/- 0.77, N = 6 38.30 38.52 38.44 30.25 30.63 30.46
Timed Apache Compilation Time To Compile EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.7 Time To Compile noretpoline Retpoline + GCC Retpoline no retpoline 7 14 21 28 35 SE +/- 0.14, N = 3 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 0.14, N = 3 31.71 32.12 32.21 26.24 26.32 26.31
ebizzy EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 noretpoline Retpoline + GCC Retpoline no retpoline 200K 400K 600K 800K 1000K SE +/- 19950.45, N = 3 SE +/- 23463.01, N = 6 SE +/- 26430.87, N = 6 SE +/- 7835.23, N = 3 SE +/- 13692.48, N = 5 SE +/- 13516.78, N = 6 1093230 952691 1073587 946321 950671 954773 1. (CC) gcc options: -pthread -lpthread -O3 -march=native
John The Ripper Test: Blowfish EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0 Test: Blowfish noretpoline Retpoline + GCC Retpoline no retpoline 11K 22K 33K 44K 55K SE +/- 400.35, N = 3 SE +/- 232.00, N = 3 SE +/- 192.26, N = 3 SE +/- 562.61, N = 3 SE +/- 357.46, N = 3 SE +/- 1882.95, N = 6 35458 35715 35714 49652 51089 48399 1. (CC) gcc options: -fopenmp -lcrypt
CacheBench Test: Read / Modify / Write EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write noretpoline Retpoline + GCC Retpoline no retpoline 6K 12K 18K 24K 30K SE +/- 5.63, N = 3 SE +/- 12.91, N = 3 SE +/- 9.68, N = 3 SE +/- 112.30, N = 3 SE +/- 226.86, N = 3 SE +/- 271.49, N = 3 22924.24 22900.81 22918.96 25777.79 24928.44 25061.33 1. (CC) gcc options: -lrt
CacheBench Test: Write EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write noretpoline Retpoline + GCC Retpoline no retpoline 5K 10K 15K 20K 25K SE +/- 53.65, N = 3 SE +/- 25.70, N = 3 SE +/- 133.19, N = 3 SE +/- 34.79, N = 3 SE +/- 82.06, N = 3 SE +/- 197.71, N = 3 21622.77 21835.76 21556.98 24954.96 23753.79 23882.88 1. (CC) gcc options: -lrt
CacheBench Test: Read EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read noretpoline Retpoline + GCC Retpoline no retpoline 600 1200 1800 2400 3000 SE +/- 9.38, N = 3 SE +/- 0.98, N = 3 SE +/- 0.02, N = 3 SE +/- 7.30, N = 3 SE +/- 11.39, N = 3 SE +/- 12.31, N = 3 2205.73 2214.25 2215.24 2973.08 2839.79 2872.09 1. (CC) gcc options: -lrt
lzbench Test: Zstd 1 - Process: Decompression EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Decompression noretpoline Retpoline + GCC Retpoline no retpoline 200 400 600 800 1000 SE +/- 4.33, N = 3 SE +/- 3.93, N = 3 SE +/- 4.51, N = 3 SE +/- 4.58, N = 3 SE +/- 3.00, N = 3 SE +/- 4.18, N = 3 911 910 911 975 983 982 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: Zstd 1 - Process: Compression EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Compression noretpoline Retpoline + GCC Retpoline no retpoline 80 160 240 320 400 SE +/- 1.33, N = 3 SE +/- 1.20, N = 3 SE +/- 1.33, N = 3 SE +/- 0.88, N = 3 SE +/- 2.08, N = 3 SE +/- 2.52, N = 3 335 335 336 361 370 369 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Decompression EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Decompression noretpoline Retpoline + GCC Retpoline no retpoline 20 40 60 80 100 SE +/- 1.03, N = 6 SE +/- 1.77, N = 6 SE +/- 2.00, N = 6 77 77 76 89 86 87 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
lzbench Test: XZ 0 - Process: Compression EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Compression noretpoline Retpoline + GCC Retpoline no retpoline 7 14 21 28 35 SE +/- 0.72, N = 6 SE +/- 0.50, N = 6 SE +/- 0.56, N = 6 24 24 24 32 31 30 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
Rodinia Test: OpenMP CFD Solver EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP CFD Solver noretpoline Retpoline + GCC Retpoline no retpoline 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 SE +/- 0.16, N = 3 SE +/- 0.30, N = 6 SE +/- 0.15, N = 5 10.97 11.08 10.76 9.73 10.54 9.93 1. (CXX) g++ options: -O2 -lOpenCL
Rodinia Test: OpenMP LavaMD EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 2.4 Test: OpenMP LavaMD noretpoline Retpoline + GCC Retpoline no retpoline 7 14 21 28 35 SE +/- 0.14, N = 3 SE +/- 0.14, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 SE +/- 0.16, N = 3 31.65 31.87 31.86 28.50 28.43 28.70 1. (CXX) g++ options: -O2 -lOpenCL
Parboil Test: OpenMP MRI Gridding EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding noretpoline Retpoline + GCC Retpoline no retpoline 90 180 270 360 450 SE +/- 0.38, N = 3 SE +/- 1.39, N = 3 SE +/- 0.14, N = 3 SE +/- 6.83, N = 3 SE +/- 6.86, N = 4 SE +/- 6.92, N = 3 283.16 288.36 283.25 407.91 405.25 413.85 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Parboil Test: OpenMP CUTCP EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP noretpoline Retpoline + GCC Retpoline no retpoline 0.6143 1.2286 1.8429 2.4572 3.0715 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 6 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 2.71 2.70 2.73 2.35 2.38 2.38 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
t-test1 Threads: 2 EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 noretpoline Retpoline + GCC Retpoline no retpoline 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.13, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 14.38 14.66 14.46 23.12 23.04 23.29 1. (CC) gcc options: -pthread
t-test1 Threads: 1 EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 1 noretpoline Retpoline + GCC Retpoline no retpoline 15 30 45 60 75 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 SE +/- 0.09, N = 3 37.76 38.18 36.68 65.91 64.45 66.56 1. (CC) gcc options: -pthread
Compile Bench Test: Initial Create EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Initial Create noretpoline Retpoline + GCC Retpoline no retpoline 120 240 360 480 600 SE +/- 2.54, N = 3 SE +/- 4.20, N = 3 SE +/- 3.88, N = 3 SE +/- 1.88, N = 3 SE +/- 27.64, N = 3 SE +/- 25.41, N = 3 410.85 406.91 406.41 533.70 478.76 508.92
Compile Bench Test: Compile EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better Compile Bench 0.6 Test: Compile noretpoline Retpoline + GCC Retpoline no retpoline 400 800 1200 1600 2000 SE +/- 9.70, N = 3 SE +/- 4.84, N = 3 SE +/- 3.64, N = 3 SE +/- 7.13, N = 3 SE +/- 40.27, N = 6 SE +/- 62.71, N = 6 1696.87 1693.10 1691.37 1496.78 1637.02 1694.04
FS-Mark Test: 4000 Files, 32 Sub Dirs, 1MB Size EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Files/s, More Is Better FS-Mark 3.3 Test: 4000 Files, 32 Sub Dirs, 1MB Size noretpoline Retpoline + GCC Retpoline no retpoline 140 280 420 560 700 SE +/- 1.13, N = 3 SE +/- 4.00, N = 3 SE +/- 1.74, N = 3 SE +/- 2.97, N = 6 SE +/- 7.23, N = 6 SE +/- 0.54, N = 3 581.87 631.30 629.27 111.02 111.17 136.67 1. (CC) gcc options: -static
FS-Mark Test: 1000 Files, 1MB Size EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org Files/s, More Is Better FS-Mark 3.3 Test: 1000 Files, 1MB Size noretpoline Retpoline + GCC Retpoline no retpoline 140 280 420 560 700 SE +/- 5.75, N = 3 SE +/- 3.59, N = 3 SE +/- 1.90, N = 3 SE +/- 2.17, N = 3 SE +/- 2.15, N = 6 SE +/- 0.20, N = 3 597.93 619.67 639.77 128.67 129.63 133.73 1. (CC) gcc options: -static
Flexible IO Tester Type: Sequential Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 2.1.13 Type: Sequential Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 60K 120K 180K 240K 300K SE +/- 3663.70, N = 3 SE +/- 1888.66, N = 3 SE +/- 4389.73, N = 6 SE +/- 614.11, N = 3 SE +/- 361.30, N = 3 SE +/- 241.21, N = 3 276778 274530 279404 103868 104232 103308 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Sequential Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better Flexible IO Tester 2.1.13 Type: Sequential Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 200 400 600 800 1000 SE +/- 14.34, N = 3 SE +/- 7.38, N = 3 SE +/- 17.15, N = 6 SE +/- 2.46, N = 3 SE +/- 1.44, N = 3 SE +/- 0.97, N = 3 1081.23 1072.57 1091.50 415.49 416.94 413.25 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Sequential Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 2.1.13 Type: Sequential Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 200 400 600 800 1000 SE +/- 0.58, N = 3 SE +/- 1.86, N = 3 SE +/- 1.86, N = 3 SE +/- 4.34, N = 4 982 1075 1075 248 254 253 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Sequential Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better Flexible IO Tester 2.1.13 Type: Sequential Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 500 1000 1500 2000 2500 SE +/- 1.26, N = 3 SE +/- 4.03, N = 3 SE +/- 3.77, N = 3 SE +/- 8.78, N = 4 SE +/- 0.39, N = 3 SE +/- 0.90, N = 3 1971.10 2156.87 2158.03 514.75 527.34 526.12 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Sequential Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 2.1.13 Type: Sequential Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 70K 140K 210K 280K 350K SE +/- 4646.93, N = 3 SE +/- 4452.90, N = 3 SE +/- 1598.66, N = 3 SE +/- 246.03, N = 3 SE +/- 379.04, N = 3 SE +/- 33.79, N = 3 302950 305665 297144 114311 114227 113608 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Sequential Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better Flexible IO Tester 2.1.13 Type: Sequential Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 300 600 900 1200 1500 SE +/- 18.11, N = 3 SE +/- 17.58, N = 3 SE +/- 6.24, N = 3 SE +/- 0.98, N = 3 SE +/- 1.52, N = 3 SE +/- 0.14, N = 3 1183.53 1193.80 1160.83 457.26 456.92 454.44 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Sequential Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 2.1.13 Type: Sequential Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 1252 1306 1306 250 250 250 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Sequential Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better Flexible IO Tester 2.1.13 Type: Sequential Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 600 1200 1800 2400 3000 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.72, N = 3 SE +/- 0.05, N = 3 SE +/- 0.25, N = 3 2511.77 2618.20 2618.67 519.38 518.48 518.49 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Random Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 2.1.13 Type: Random Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 60K 120K 180K 240K 300K SE +/- 2932.09, N = 3 SE +/- 2976.09, N = 3 SE +/- 4419.58, N = 6 SE +/- 484.83, N = 3 SE +/- 451.27, N = 3 SE +/- 444.91, N = 3 271273 268542 268041 84641 85492 85249 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Random Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better Flexible IO Tester 2.1.13 Type: Random Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 200 400 600 800 1000 SE +/- 11.43, N = 3 SE +/- 11.31, N = 3 SE +/- 17.28, N = 6 SE +/- 1.94, N = 3 SE +/- 1.80, N = 3 SE +/- 1.78, N = 3 1059.73 1048.77 1046.93 338.58 341.98 341.01 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Random Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 2.1.13 Type: Random Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 200 400 600 800 1000 SE +/- 0.67, N = 3 SE +/- 1.67, N = 3 SE +/- 1.53, N = 3 SE +/- 0.58, N = 3 SE +/- 1.20, N = 3 SE +/- 1.20, N = 3 983 1076 1075 252 246 248 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Random Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better Flexible IO Tester 2.1.13 Type: Random Write - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 500 1000 1500 2000 2500 SE +/- 1.43, N = 3 SE +/- 3.18, N = 3 SE +/- 3.16, N = 3 SE +/- 0.94, N = 3 SE +/- 2.16, N = 3 SE +/- 2.04, N = 3 1972.70 2158.27 2157.60 523.89 512.18 514.67 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Random Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 2.1.13 Type: Random Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 60K 120K 180K 240K 300K SE +/- 4402.78, N = 3 SE +/- 591.14, N = 3 SE +/- 3375.56, N = 3 SE +/- 1071.43, N = 3 SE +/- 504.90, N = 3 SE +/- 309.68, N = 3 301037 288951 298428 95782 97043 96194 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Random Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better Flexible IO Tester 2.1.13 Type: Random Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 4KB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 300 600 900 1200 1500 SE +/- 17.24, N = 3 SE +/- 1.94, N = 3 SE +/- 13.29, N = 3 SE +/- 4.29, N = 3 SE +/- 2.02, N = 3 SE +/- 1.24, N = 3 1176.20 1128.57 1165.97 383.14 388.19 384.79 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Random Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org IOPS, More Is Better Flexible IO Tester 2.1.13 Type: Random Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 300 600 900 1200 1500 SE +/- 11.00, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 1251 1315 1304 262 262 262 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Flexible IO Tester Type: Random Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory EPYC 7601 2 x Xeon Gold 6138 OpenBenchmarking.org MB/s, More Is Better Flexible IO Tester 2.1.13 Type: Random Read - IO Engine: Libaio - Buffered: No - Direct: Yes - Block Size: 2MB - Disk Target: Default Test Directory noretpoline Retpoline + GCC Retpoline no retpoline 600 1200 1800 2400 3000 SE +/- 0.24, N = 3 SE +/- 22.27, N = 3 SE +/- 0.45, N = 3 SE +/- 1.17, N = 3 SE +/- 0.10, N = 3 SE +/- 0.14, N = 3 2509.83 2637.27 2615.60 543.05 543.37 543.18 1. (CC) gcc options: -rdynamic -std=gnu99 -O3 -ffast-math -include -lrt -laio -lz -lm -lpthread -ldl
Phoronix Test Suite v10.8.4