Java AMD EPYC Tuning Performance

Java workload tuning on AMD EPYC Turin. Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2411302-NE-JAVAAMDEP26
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Sensor Monitoring

Show Accumulated Sensor Monitoring Data For Displayed Results
Generate Power Efficiency / Performance Per Watt Results

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Stock
November 29 2024
  1 Hour, 28 Minutes
Java Throughput
November 30 2024
  2 Hours, 10 Minutes
Java Latency
November 30 2024
  1 Hour, 49 Minutes
Power Efficiency
November 30 2024
  2 Hours, 1 Minute
Invert Behavior (Only Show Selected Data)
  1 Hour, 52 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Java AMD EPYC Tuning PerformanceOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.12.0-rc7-linux-pm-next-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x768ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionJava AMD EPYC Tuning Performance BenchmarksSystem Logs- Transparent Huge Pages: madvise- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116 - OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10)- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

StockJava ThroughputJava LatencyPower EfficiencyResult OverviewPhoronix Test Suite100%113%126%140%153%Sunflow Rendering SystemApache CassandraRenaissanceDaCapo BenchmarkJava JMH

Java AMD EPYC Tuning Performancecassandra: Writesdacapobench: Eclipsedacapobench: Avrora AVR Simulation Frameworkdacapobench: Batik SVG Toolkitdacapobench: FOP Print Formatterdacapobench: Apache Tomcatdacapobench: Apache Xalan XSLTdacapobench: Apache Lucene Search Indexdacapobench: Apache Lucene Search Enginedacapobench: H2 Database Enginedacapobench: Jythondacapobench: Tradebeansdacapobench: Tradesoapdacapobench: BioJava Biological Data Frameworkdacapobench: GraphChidacapobench: jMonkeyEnginedacapobench: Apache Kafkadacapobench: Spring Bootdacapobench: Zxing 1D/2D Barcode Image Processingjava-jmh: Throughputrenaissance: Apache Spark Bayesrenaissance: Apache Spark PageRankrenaissance: Savina Reactors.IOrenaissance: Akka Unbalanced Cobwebbed Treerenaissance: Rand Forestrenaissance: In-Memory Database Shootoutrenaissance: Scala Dottyrenaissance: Finagle HTTP Requestsrenaissance: Genetic Algorithm Using Jenetics + Futuresrenaissance: ALS Movie Lensrenaissance: Gaussian Mixture Modelsunflow: Global Illumination + Image SynthesisStockJava ThroughputJava LatencyPower Efficiency46475363152291925338103773722464140206737645064300140472094679350602641558312830489306.03151.92065.35172.110860.9383.54228.7448.82713.0894.015721.03254.30.24245962463432259925340106974222383959213537644513274140462179679350702636543313156984419.28152.82101.15166.810738.8391.83851.9458.82741.8777.515592.63182.40.25949189662582272933344105774122564053206237604413283840252126679350632661544314639272675.16152.02083.55081.410956.6382.93924.3457.52792.9903.615599.63194.40.302356428947630301469564136897534445072254748895240364653013440685350863311602315423179604.61193.62268.59954.213573.5606.65263.2655.23052.31291.720553.93925.30.370OpenBenchmarking.org

Apache Cassandra

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 5.0Test: WritesJava LatencyJava ThroughputPower EfficiencyStock110K220K330K440K550KSE +/- 2078.80, N = 3SE +/- 784.82, N = 3SE +/- 5044.93, N = 3SE +/- 3783.12, N = 3491896459624356428464753

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: EclipseJava LatencyJava ThroughputPower EfficiencyStock2K4K6K8K10KSE +/- 35.23, N = 3SE +/- 7.97, N = 3SE +/- 72.63, N = 10SE +/- 38.05, N = 36258634394766315

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Avrora AVR Simulation FrameworkJava LatencyJava ThroughputPower EfficiencyStock7001400210028003500SE +/- 14.51, N = 15SE +/- 17.33, N = 15SE +/- 27.41, N = 5SE +/- 23.08, N = 62272225930302291

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Batik SVG ToolkitJava LatencyJava ThroughputPower EfficiencyStock30060090012001500SE +/- 4.52, N = 8SE +/- 2.28, N = 8SE +/- 23.97, N = 12SE +/- 3.54, N = 89339251469925

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: FOP Print FormatterJava LatencyJava ThroughputPower EfficiencyStock120240360480600SE +/- 2.24, N = 14SE +/- 1.32, N = 10SE +/- 6.80, N = 15SE +/- 1.47, N = 10344340564338

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache TomcatJava LatencyJava ThroughputPower EfficiencyStock30060090012001500SE +/- 3.16, N = 7SE +/- 8.41, N = 7SE +/- 7.83, N = 6SE +/- 1.76, N = 71057106913681037

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache Xalan XSLTJava LatencyJava ThroughputPower EfficiencyStock2004006008001000SE +/- 4.42, N = 9SE +/- 4.52, N = 9SE +/- 7.58, N = 8SE +/- 3.95, N = 10741742975737

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache Lucene Search IndexJava LatencyJava ThroughputPower EfficiencyStock7001400210028003500SE +/- 1.53, N = 3SE +/- 15.06, N = 3SE +/- 68.10, N = 15SE +/- 18.03, N = 32256223834442246

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache Lucene Search EngineJava LatencyJava ThroughputPower EfficiencyStock11002200330044005500SE +/- 48.20, N = 4SE +/- 44.71, N = 15SE +/- 61.68, N = 3SE +/- 19.75, N = 44053395950724140

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: H2 Database EngineJava LatencyJava ThroughputPower EfficiencyStock5001000150020002500SE +/- 25.05, N = 4SE +/- 11.05, N = 3SE +/- 27.68, N = 5SE +/- 22.63, N = 52062213525472067

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: JythonJava LatencyJava ThroughputPower EfficiencyStock10002000300040005000SE +/- 13.83, N = 4SE +/- 7.80, N = 4SE +/- 92.77, N = 12SE +/- 3.22, N = 43760376448893764

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: TradebeansJava LatencyJava ThroughputPower EfficiencyStock11002200330044005500SE +/- 134.49, N = 12SE +/- 94.53, N = 15SE +/- 55.56, N = 15SE +/- 96.06, N = 154413451352405064

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: TradesoapJava LatencyJava ThroughputPower EfficiencyStock8001600240032004000SE +/- 25.31, N = 15SE +/- 42.48, N = 15SE +/- 26.21, N = 15SE +/- 26.56, N = 42838274136463001

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: BioJava Biological Data FrameworkJava LatencyJava ThroughputPower EfficiencyStock11002200330044005500SE +/- 14.73, N = 4SE +/- 12.85, N = 4SE +/- 84.18, N = 15SE +/- 7.35, N = 44025404653014047

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: GraphChiJava LatencyJava ThroughputPower EfficiencyStock7001400210028003500SE +/- 14.98, N = 5SE +/- 22.52, N = 5SE +/- 23.13, N = 3SE +/- 8.25, N = 52126217934402094

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: jMonkeyEngineJava LatencyJava ThroughputPower EfficiencyStock15003000450060007500SE +/- 0.33, N = 3SE +/- 1.33, N = 3SE +/- 1.15, N = 3SE +/- 1.00, N = 36793679368536793

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache KafkaJava LatencyJava ThroughputPower EfficiencyStock11002200330044005500SE +/- 5.84, N = 3SE +/- 5.51, N = 3SE +/- 5.81, N = 3SE +/- 0.58, N = 35063507050865060

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Spring BootJava LatencyJava ThroughputPower EfficiencyStock7001400210028003500SE +/- 27.73, N = 4SE +/- 26.57, N = 6SE +/- 18.15, N = 4SE +/- 26.53, N = 52661263633112641

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Zxing 1D/2D Barcode Image ProcessingJava LatencyJava ThroughputPower EfficiencyStock130260390520650SE +/- 3.12, N = 10SE +/- 3.60, N = 14SE +/- 4.50, N = 15SE +/- 4.02, N = 10544543602558

Java JMH

This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughputJava LatencyJava ThroughputPower EfficiencyStock70000M140000M210000M280000M350000M314639272675.16313156984419.28315423179604.61312830489306.03

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Apache Spark BayesJava LatencyJava ThroughputPower EfficiencyStock4080120160200SE +/- 1.40, N = 15SE +/- 1.64, N = 15SE +/- 1.40, N = 3SE +/- 0.82, N = 3152.0152.8193.6151.9MIN: 121.4 / MAX: 361.56MIN: 127.3 / MAX: 382.17MIN: 157.46 / MAX: 378.44MIN: 128.87 / MAX: 209.2

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Apache Spark PageRankJava LatencyJava ThroughputPower EfficiencyStock5001000150020002500SE +/- 18.35, N = 3SE +/- 15.93, N = 3SE +/- 19.39, N = 3SE +/- 8.00, N = 32083.52101.12268.52065.3MIN: 1424.26 / MAX: 2119.52MIN: 1452.23 / MAX: 2117.08MIN: 1959.01 / MAX: 2401.72MIN: 1460.75 / MAX: 2081.13

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Savina Reactors.IOJava LatencyJava ThroughputPower EfficiencyStock2K4K6K8K10KSE +/- 49.91, N = 3SE +/- 37.22, N = 3SE +/- 125.38, N = 3SE +/- 54.20, N = 55081.45166.89954.25172.1MIN: 5006.45 / MAX: 8171.58MIN: 5095.51 / MAX: 7816.47MIN: 7679.87 / MAX: 10841.86MIN: 4973.43 / MAX: 8208.11

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Akka Unbalanced Cobwebbed TreeJava LatencyJava ThroughputPower EfficiencyStock3K6K9K12K15KSE +/- 89.47, N = 3SE +/- 48.01, N = 3SE +/- 34.93, N = 3SE +/- 79.02, N = 310956.610738.813573.510860.9MIN: 10463.34 / MAX: 11370.49MIN: 10253.41 / MAX: 10941.76MIN: 12956.64 / MAX: 13724.14MIN: 10242.4 / MAX: 11077.04

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Random ForestJava LatencyJava ThroughputPower EfficiencyStock130260390520650SE +/- 2.98, N = 10SE +/- 3.80, N = 15SE +/- 4.36, N = 15SE +/- 4.43, N = 3382.9391.8606.6383.5MIN: 339.12 / MAX: 447.53MIN: 342.67 / MAX: 493.98MIN: 536.2 / MAX: 741MIN: 347.9 / MAX: 457.62

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: In-Memory Database ShootoutJava LatencyJava ThroughputPower EfficiencyStock11002200330044005500SE +/- 19.30, N = 3SE +/- 8.42, N = 3SE +/- 65.42, N = 3SE +/- 7.53, N = 33924.33851.95263.24228.7MIN: 3683.75 / MAX: 4557.34MIN: 3402.36 / MAX: 4284.31MIN: 4666.19 / MAX: 6061.56MIN: 4037.24 / MAX: 4846.07

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Scala DottyJava LatencyJava ThroughputPower EfficiencyStock140280420560700SE +/- 4.40, N = 15SE +/- 6.53, N = 15SE +/- 20.51, N = 12SE +/- 8.07, N = 15457.5458.8655.2448.8MIN: 349.58 / MAX: 587.42MIN: 336.82 / MAX: 624.11MIN: 430.6 / MAX: 1104.36MIN: 353.63 / MAX: 587.33

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Finagle HTTP RequestsJava LatencyJava ThroughputPower EfficiencyStock7001400210028003500SE +/- 20.65, N = 3SE +/- 7.56, N = 3SE +/- 35.51, N = 3SE +/- 20.00, N = 32792.92741.83052.32713.0MIN: 2055.2 / MAX: 2818.67MIN: 2037.17 / MAX: 2755.41MIN: 2255.4 / MAX: 3115.04MIN: 1897.78 / MAX: 2748.55

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Genetic Algorithm Using Jenetics + FuturesJava LatencyJava ThroughputPower EfficiencyStock30060090012001500SE +/- 5.33, N = 3SE +/- 14.03, N = 15SE +/- 14.18, N = 3SE +/- 8.09, N = 3903.6777.51291.7894.0MIN: 765.31 / MAX: 912.11MIN: 667.17 / MAX: 1033.21MIN: 1207.53 / MAX: 1390.99MIN: 841.31 / MAX: 918.73

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: ALS Movie LensJava LatencyJava ThroughputPower EfficiencyStock4K8K12K16K20KSE +/- 58.79, N = 3SE +/- 85.32, N = 3SE +/- 133.62, N = 3SE +/- 91.51, N = 315599.615592.620553.915721.0MIN: 15144.58 / MAX: 15795.77MIN: 15012.81 / MAX: 15741.43MIN: 19144.07 / MAX: 20820.8MIN: 15022.39 / MAX: 15859.07

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Gaussian Mixture ModelJava LatencyJava ThroughputPower EfficiencyStock8001600240032004000SE +/- 10.95, N = 3SE +/- 9.38, N = 3SE +/- 30.43, N = 3SE +/- 24.37, N = 33194.43182.43925.33254.3MIN: 2370.4 / MAX: 3212.45MIN: 2333.74 / MAX: 3195.1MIN: 3849.24 / MAX: 4555.85MIN: 2372.34 / MAX: 3324.36

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image SynthesisJava LatencyJava ThroughputPower EfficiencyStock0.08330.16660.24990.33320.4165SE +/- 0.003, N = 15SE +/- 0.005, N = 15SE +/- 0.007, N = 15SE +/- 0.003, N = 150.3020.2590.3700.242MIN: 0.18 / MAX: 0.63MIN: 0.15 / MAX: 0.66MIN: 0.21 / MAX: 1.01MIN: 0.14 / MAX: 0.67