20210524

Intel 0000 testing with a Gigabyte H81-D3 (F3 BIOS) and Gigabyte NVIDIA Quadro RTX 3000 6GB on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2105245-HA-20210524876
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Gigabyte NVIDIA Quadro RTX 3000
May 23 2021
  1 Hour, 21 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


20210524OpenBenchmarking.orgPhoronix Test SuiteIntel 0000 @ 3.00GHz (4 Cores)Gigabyte H81-D3 (F3 BIOS)Intel 4th Gen Core DRAM1 x 2048 MB DDR3-1333MT/s120GB ZTSSD-S11-120G-MGigabyte NVIDIA Quadro RTX 3000 6GBIntel Xeon E3-1200 v3/4thASUS PB277Realtek RTL8111/8168/8411Ubuntu 20.045.8.0-53-generic (x86_64)GNOME Shell 3.36.7X Server 1.20.9NVIDIA 465.19.014.6.0OpenCL 3.0 CUDA 11.3.551.2.168GCC 7.5.0 + CUDA 10.1ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution20210524 BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xffff0006 - Thermald 1.9.1 - GPU Compute Cores: 1920- Python 3.8.5- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Vulnerable: Clear buffers attempted no microcode; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + srbds: Vulnerable: No microcode + tsx_async_abort: Vulnerable: Clear buffers attempted no microcode; SMT disabled

20210524vkpeak: fp32-scalarvkpeak: fp32-vec4vkpeak: fp64-scalarvkpeak: fp64-vec4vkpeak: int32-scalarvkpeak: int32-vec4realsr-ncnn: 4x - Norealsr-ncnn: 4x - Yeswaifu2x-ncnn: 2x - 3 - Nowaifu2x-ncnn: 2x - 3 - Yeshashcat: MD5hashcat: SHA1hashcat: 7-Ziphashcat: SHA-512hashcat: TrueCrypt RIPEMD160 + XTSmixbench: OpenCL - Integermixbench: NVIDIA CUDA - Integermixbench: OpenCL - Double Precisionmixbench: OpenCL - Single Precisionmixbench: NVIDIA CUDA - Half Precisionmixbench: NVIDIA CUDA - Double Precisionmixbench: NVIDIA CUDA - Single Precisionshoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writenamd-cuda: ATPase Simulation - 327,506 Atomsbetsy: ETC1 - Highestbetsy: ETC2 RGB - Highestvkresample: 2x - DoubleGigabyte NVIDIA Quadro RTX 3000152.62197.3249.1855.4030.9131.6318.830115.7352.5427.5531951796666765950000003087679050333332422337212.736811.79193.37220.6912462.01193.177240.77129.7046.1162662.38512.7735265.3311970.137157.076.73946.70001002.710241.5297.5245.40.7461810.72212.476358.249OpenBenchmarking.org

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarGigabyte NVIDIA Quadro RTX 3000306090120150SE +/- 0.02, N = 3152.62

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-vec4Gigabyte NVIDIA Quadro RTX 30004080120160200SE +/- 0.00, N = 3197.32

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-scalarGigabyte NVIDIA Quadro RTX 30001122334455SE +/- 0.00, N = 349.18

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-vec4Gigabyte NVIDIA Quadro RTX 30001224364860SE +/- 0.01, N = 355.40

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-scalarGigabyte NVIDIA Quadro RTX 3000714212835SE +/- 0.00, N = 330.91

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-vec4Gigabyte NVIDIA Quadro RTX 3000714212835SE +/- 0.00, N = 331.63

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoGigabyte NVIDIA Quadro RTX 3000510152025SE +/- 0.18, N = 318.83

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesGigabyte NVIDIA Quadro RTX 3000306090120150SE +/- 0.04, N = 3115.74

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: NoGigabyte NVIDIA Quadro RTX 30000.5721.1441.7162.2882.86SE +/- 0.019, N = 32.542

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesGigabyte NVIDIA Quadro RTX 3000246810SE +/- 0.014, N = 37.553

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: MD5Gigabyte NVIDIA Quadro RTX 30004000M8000M12000M16000M20000MSE +/- 36703420.97, N = 319517966667

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA1Gigabyte NVIDIA Quadro RTX 30001400M2800M4200M5600M7000MSE +/- 3329164.06, N = 36595000000

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: 7-ZipGigabyte NVIDIA Quadro RTX 300070K140K210K280K350KSE +/- 145.30, N = 3308767

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: SHA-512Gigabyte NVIDIA Quadro RTX 3000200M400M600M800M1000MSE +/- 1086789.36, N = 3905033333

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.1.1Benchmark: TrueCrypt RIPEMD160 + XTSGigabyte NVIDIA Quadro RTX 300050K100K150K200K250KSE +/- 218.58, N = 3242233

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: IntegerGigabyte NVIDIA Quadro RTX 300015003000450060007500SE +/- 5.86, N = 37212.731. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: IntegerGigabyte NVIDIA Quadro RTX 300015003000450060007500SE +/- 6.13, N = 36811.791. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Double PrecisionGigabyte NVIDIA Quadro RTX 30004080120160200SE +/- 0.00, N = 3193.31. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: OpenCL - Benchmark: Single PrecisionGigabyte NVIDIA Quadro RTX 300015003000450060007500SE +/- 1.90, N = 37220.691. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Half PrecisionGigabyte NVIDIA Quadro RTX 30003K6K9K12K15KSE +/- 33.00, N = 312462.011. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Double PrecisionGigabyte NVIDIA Quadro RTX 30004080120160200SE +/- 0.13, N = 3193.171. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2020-06-23Backend: NVIDIA CUDA - Benchmark: Single PrecisionGigabyte NVIDIA Quadro RTX 300016003200480064008000SE +/- 0.97, N = 37240.771. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DGigabyte NVIDIA Quadro RTX 3000306090120150SE +/- 0.03, N = 3129.701. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadGigabyte NVIDIA Quadro RTX 3000246810SE +/- 0.0027, N = 36.11621. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPGigabyte NVIDIA Quadro RTX 3000140280420560700SE +/- 28.89, N = 12662.391. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashGigabyte NVIDIA Quadro RTX 30003691215SE +/- 0.01, N = 312.771. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionGigabyte NVIDIA Quadro RTX 300060120180240300SE +/- 0.08, N = 3265.331. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NGigabyte NVIDIA Quadro RTX 3000400800120016002000SE +/- 4.48, N = 31970.131. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsGigabyte NVIDIA Quadro RTX 300015003000450060007500SE +/- 15.17, N = 37157.071. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadGigabyte NVIDIA Quadro RTX 3000246810SE +/- 0.0000, N = 36.73941. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackGigabyte NVIDIA Quadro RTX 3000246810SE +/- 0.0000, N = 36.70001. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthGigabyte NVIDIA Quadro RTX 30002004006008001000SE +/- 4.12, N = 31002.711. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGigabyte NVIDIA Quadro RTX 300050100150200250SE +/- 0.12, N = 3241.51. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGigabyte NVIDIA Quadro RTX 300060120180240300SE +/- 0.15, N = 3297.51. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGigabyte NVIDIA Quadro RTX 300050100150200250SE +/- 0.82, N = 3245.41. (CC) gcc options: -O2 -flto -lOpenCL

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsGigabyte NVIDIA Quadro RTX 30000.16790.33580.50370.67160.8395SE +/- 0.00755, N = 30.74618

Betsy GPU Compressor

Betsy is an open-source GPU compressor of various GPU compression techniques. Betsy is written in GLSL for Vulkan/OpenGL (compute shader) support for GPU-based texture compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: HighestGigabyte NVIDIA Quadro RTX 30003691215SE +/- 1.23, N = 1510.721. (CXX) g++ options: -O3 -O2 -lpthread -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: HighestGigabyte NVIDIA Quadro RTX 30003691215SE +/- 0.02, N = 312.481. (CXX) g++ options: -O3 -O2 -lpthread -ldl

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleGigabyte NVIDIA Quadro RTX 300080160240320400SE +/- 0.30, N = 3358.251. (CXX) g++ options: -O3 -pthread