Java AMD EPYC Tuning Performance

Java workload tuning on AMD EPYC Turin. Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2411302-NE-JAVAAMDEP26
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Sensor Monitoring

Show Accumulated Sensor Monitoring Data For Displayed Results
Generate Power Efficiency / Performance Per Watt Results

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Stock
November 29 2024
  1 Hour, 28 Minutes
Java Throughput
November 30 2024
  2 Hours, 10 Minutes
Java Latency
November 30 2024
  1 Hour, 49 Minutes
Power Efficiency
November 30 2024
  2 Hours, 1 Minute
Invert Behavior (Only Show Selected Data)
  1 Hour, 52 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Java AMD EPYC Tuning PerformanceOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.12.0-rc7-linux-pm-next-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x768ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionJava AMD EPYC Tuning Performance BenchmarksSystem Logs- Transparent Huge Pages: madvise- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116 - OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10)- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

StockJava ThroughputJava LatencyPower EfficiencyResult OverviewPhoronix Test Suite100%113%126%140%153%Sunflow Rendering SystemApache CassandraRenaissanceDaCapo BenchmarkJava JMH

Java AMD EPYC Tuning Performancedacapobench: Eclipsedacapobench: Avrora AVR Simulation Frameworkdacapobench: Batik SVG Toolkitdacapobench: FOP Print Formatterdacapobench: Apache Tomcatdacapobench: Apache Xalan XSLTdacapobench: Apache Lucene Search Indexdacapobench: Apache Lucene Search Enginedacapobench: H2 Database Enginedacapobench: Jythondacapobench: Tradebeansdacapobench: Tradesoapdacapobench: BioJava Biological Data Frameworkdacapobench: GraphChidacapobench: jMonkeyEnginedacapobench: Apache Kafkadacapobench: Spring Bootdacapobench: Zxing 1D/2D Barcode Image Processingrenaissance: Apache Spark Bayesrenaissance: Apache Spark PageRankrenaissance: Savina Reactors.IOrenaissance: Akka Unbalanced Cobwebbed Treerenaissance: Rand Forestrenaissance: In-Memory Database Shootoutrenaissance: Scala Dottyrenaissance: Finagle HTTP Requestsrenaissance: Genetic Algorithm Using Jenetics + Futuresrenaissance: ALS Movie Lensrenaissance: Gaussian Mixture Modeljava-jmh: Throughputsunflow: Global Illumination + Image Synthesiscassandra: WritesStockJava ThroughputJava LatencyPower Efficiency63152291925338103773722464140206737645064300140472094679350602641558151.92065.35172.110860.9383.54228.7448.82713.0894.015721.03254.3312830489306.030.24246475363432259925340106974222383959213537644513274140462179679350702636543152.82101.15166.810738.8391.83851.9458.82741.8777.515592.63182.4313156984419.280.25945962462582272933344105774122564053206237604413283840252126679350632661544152.02083.55081.410956.6382.93924.3457.52792.9903.615599.63194.4314639272675.160.302491896947630301469564136897534445072254748895240364653013440685350863311602193.62268.59954.213573.5606.65263.2655.23052.31291.720553.93925.3315423179604.610.370356428OpenBenchmarking.org

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: EclipseJava LatencyStockJava ThroughputPower Efficiency2K4K6K8K10KSE +/- 35.23, N = 3SE +/- 38.05, N = 3SE +/- 7.97, N = 3SE +/- 72.63, N = 106258631563439476

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Avrora AVR Simulation FrameworkJava ThroughputJava LatencyStockPower Efficiency7001400210028003500SE +/- 17.33, N = 15SE +/- 14.51, N = 15SE +/- 23.08, N = 6SE +/- 27.41, N = 52259227222913030

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Batik SVG ToolkitStockJava ThroughputJava LatencyPower Efficiency30060090012001500SE +/- 3.54, N = 8SE +/- 2.28, N = 8SE +/- 4.52, N = 8SE +/- 23.97, N = 129259259331469

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: FOP Print FormatterStockJava ThroughputJava LatencyPower Efficiency120240360480600SE +/- 1.47, N = 10SE +/- 1.32, N = 10SE +/- 2.24, N = 14SE +/- 6.80, N = 15338340344564

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache TomcatStockJava LatencyJava ThroughputPower Efficiency30060090012001500SE +/- 1.76, N = 7SE +/- 3.16, N = 7SE +/- 8.41, N = 7SE +/- 7.83, N = 61037105710691368

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache Xalan XSLTStockJava LatencyJava ThroughputPower Efficiency2004006008001000SE +/- 3.95, N = 10SE +/- 4.42, N = 9SE +/- 4.52, N = 9SE +/- 7.58, N = 8737741742975

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache Lucene Search IndexJava ThroughputStockJava LatencyPower Efficiency7001400210028003500SE +/- 15.06, N = 3SE +/- 18.03, N = 3SE +/- 1.53, N = 3SE +/- 68.10, N = 152238224622563444

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache Lucene Search EngineJava ThroughputJava LatencyStockPower Efficiency11002200330044005500SE +/- 44.71, N = 15SE +/- 48.20, N = 4SE +/- 19.75, N = 4SE +/- 61.68, N = 33959405341405072

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: H2 Database EngineJava LatencyStockJava ThroughputPower Efficiency5001000150020002500SE +/- 25.05, N = 4SE +/- 22.63, N = 5SE +/- 11.05, N = 3SE +/- 27.68, N = 52062206721352547

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: JythonJava LatencyStockJava ThroughputPower Efficiency10002000300040005000SE +/- 13.83, N = 4SE +/- 3.22, N = 4SE +/- 7.80, N = 4SE +/- 92.77, N = 123760376437644889

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: TradebeansJava LatencyJava ThroughputStockPower Efficiency11002200330044005500SE +/- 134.49, N = 12SE +/- 94.53, N = 15SE +/- 96.06, N = 15SE +/- 55.56, N = 154413451350645240

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: TradesoapJava ThroughputJava LatencyStockPower Efficiency8001600240032004000SE +/- 42.48, N = 15SE +/- 25.31, N = 15SE +/- 26.56, N = 4SE +/- 26.21, N = 152741283830013646

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: BioJava Biological Data FrameworkJava LatencyJava ThroughputStockPower Efficiency11002200330044005500SE +/- 14.73, N = 4SE +/- 12.85, N = 4SE +/- 7.35, N = 4SE +/- 84.18, N = 154025404640475301

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: GraphChiStockJava LatencyJava ThroughputPower Efficiency7001400210028003500SE +/- 8.25, N = 5SE +/- 14.98, N = 5SE +/- 22.52, N = 5SE +/- 23.13, N = 32094212621793440

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: jMonkeyEngineStockJava ThroughputJava LatencyPower Efficiency15003000450060007500SE +/- 1.00, N = 3SE +/- 1.33, N = 3SE +/- 0.33, N = 3SE +/- 1.15, N = 36793679367936853

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache KafkaStockJava LatencyJava ThroughputPower Efficiency11002200330044005500SE +/- 0.58, N = 3SE +/- 5.84, N = 3SE +/- 5.51, N = 3SE +/- 5.81, N = 35060506350705086

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Spring BootJava ThroughputStockJava LatencyPower Efficiency7001400210028003500SE +/- 26.57, N = 6SE +/- 26.53, N = 5SE +/- 27.73, N = 4SE +/- 18.15, N = 42636264126613311

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Zxing 1D/2D Barcode Image ProcessingJava ThroughputJava LatencyStockPower Efficiency130260390520650SE +/- 3.60, N = 14SE +/- 3.12, N = 10SE +/- 4.02, N = 10SE +/- 4.50, N = 15543544558602

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Apache Spark BayesStockJava LatencyJava ThroughputPower Efficiency4080120160200SE +/- 0.82, N = 3SE +/- 1.40, N = 15SE +/- 1.64, N = 15SE +/- 1.40, N = 3151.9152.0152.8193.6MIN: 128.87 / MAX: 209.2MIN: 121.4 / MAX: 361.56MIN: 127.3 / MAX: 382.17MIN: 157.46 / MAX: 378.44

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Apache Spark PageRankStockJava LatencyJava ThroughputPower Efficiency5001000150020002500SE +/- 8.00, N = 3SE +/- 18.35, N = 3SE +/- 15.93, N = 3SE +/- 19.39, N = 32065.32083.52101.12268.5MIN: 1460.75 / MAX: 2081.13MIN: 1424.26 / MAX: 2119.52MIN: 1452.23 / MAX: 2117.08MIN: 1959.01 / MAX: 2401.72

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Savina Reactors.IOJava LatencyJava ThroughputStockPower Efficiency2K4K6K8K10KSE +/- 49.91, N = 3SE +/- 37.22, N = 3SE +/- 54.20, N = 5SE +/- 125.38, N = 35081.45166.85172.19954.2MIN: 5006.45 / MAX: 8171.58MIN: 5095.51 / MAX: 7816.47MIN: 4973.43 / MAX: 8208.11MIN: 7679.87 / MAX: 10841.86

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Akka Unbalanced Cobwebbed TreeJava ThroughputStockJava LatencyPower Efficiency3K6K9K12K15KSE +/- 48.01, N = 3SE +/- 79.02, N = 3SE +/- 89.47, N = 3SE +/- 34.93, N = 310738.810860.910956.613573.5MIN: 10253.41 / MAX: 10941.76MIN: 10242.4 / MAX: 11077.04MIN: 10463.34 / MAX: 11370.49MIN: 12956.64 / MAX: 13724.14

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Random ForestJava LatencyStockJava ThroughputPower Efficiency130260390520650SE +/- 2.98, N = 10SE +/- 4.43, N = 3SE +/- 3.80, N = 15SE +/- 4.36, N = 15382.9383.5391.8606.6MIN: 339.12 / MAX: 447.53MIN: 347.9 / MAX: 457.62MIN: 342.67 / MAX: 493.98MIN: 536.2 / MAX: 741

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: In-Memory Database ShootoutJava ThroughputJava LatencyStockPower Efficiency11002200330044005500SE +/- 8.42, N = 3SE +/- 19.30, N = 3SE +/- 7.53, N = 3SE +/- 65.42, N = 33851.93924.34228.75263.2MIN: 3402.36 / MAX: 4284.31MIN: 3683.75 / MAX: 4557.34MIN: 4037.24 / MAX: 4846.07MIN: 4666.19 / MAX: 6061.56

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Scala DottyStockJava LatencyJava ThroughputPower Efficiency140280420560700SE +/- 8.07, N = 15SE +/- 4.40, N = 15SE +/- 6.53, N = 15SE +/- 20.51, N = 12448.8457.5458.8655.2MIN: 353.63 / MAX: 587.33MIN: 349.58 / MAX: 587.42MIN: 336.82 / MAX: 624.11MIN: 430.6 / MAX: 1104.36

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Finagle HTTP RequestsStockJava ThroughputJava LatencyPower Efficiency7001400210028003500SE +/- 20.00, N = 3SE +/- 7.56, N = 3SE +/- 20.65, N = 3SE +/- 35.51, N = 32713.02741.82792.93052.3MIN: 1897.78 / MAX: 2748.55MIN: 2037.17 / MAX: 2755.41MIN: 2055.2 / MAX: 2818.67MIN: 2255.4 / MAX: 3115.04

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Genetic Algorithm Using Jenetics + FuturesJava ThroughputStockJava LatencyPower Efficiency30060090012001500SE +/- 14.03, N = 15SE +/- 8.09, N = 3SE +/- 5.33, N = 3SE +/- 14.18, N = 3777.5894.0903.61291.7MIN: 667.17 / MAX: 1033.21MIN: 841.31 / MAX: 918.73MIN: 765.31 / MAX: 912.11MIN: 1207.53 / MAX: 1390.99

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: ALS Movie LensJava ThroughputJava LatencyStockPower Efficiency4K8K12K16K20KSE +/- 85.32, N = 3SE +/- 58.79, N = 3SE +/- 91.51, N = 3SE +/- 133.62, N = 315592.615599.615721.020553.9MIN: 15012.81 / MAX: 15741.43MIN: 15144.58 / MAX: 15795.77MIN: 15022.39 / MAX: 15859.07MIN: 19144.07 / MAX: 20820.8

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Gaussian Mixture ModelJava ThroughputJava LatencyStockPower Efficiency8001600240032004000SE +/- 9.38, N = 3SE +/- 10.95, N = 3SE +/- 24.37, N = 3SE +/- 30.43, N = 33182.43194.43254.33925.3MIN: 2333.74 / MAX: 3195.1MIN: 2370.4 / MAX: 3212.45MIN: 2372.34 / MAX: 3324.36MIN: 3849.24 / MAX: 4555.85

Java JMH

This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughputPower EfficiencyJava LatencyJava ThroughputStock70000M140000M210000M280000M350000M315423179604.61314639272675.16313156984419.28312830489306.03

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image SynthesisStockJava ThroughputJava LatencyPower Efficiency0.08330.16660.24990.33320.4165SE +/- 0.003, N = 15SE +/- 0.005, N = 15SE +/- 0.003, N = 15SE +/- 0.007, N = 150.2420.2590.3020.370MIN: 0.14 / MAX: 0.67MIN: 0.15 / MAX: 0.66MIN: 0.18 / MAX: 0.63MIN: 0.21 / MAX: 1.01

Apache Cassandra

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 5.0Test: WritesJava LatencyStockJava ThroughputPower Efficiency110K220K330K440K550KSE +/- 2078.80, N = 3SE +/- 3783.12, N = 3SE +/- 784.82, N = 3SE +/- 5044.93, N = 3491896464753459624356428