AMD EPYC 7763 Cooling Performance

AMD EPYC 7763 64-Core CPU benchmarks by Michael Larabel evaluating some heatsink fans in a 4U server.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2104096-IB-HEATSINK430
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

AV1 2 Tests
Timed Code Compilation 6 Tests
C/C++ Compiler Tests 7 Tests
CPU Massive 10 Tests
Creator Workloads 10 Tests
Encoding 4 Tests
Game Development 2 Tests
HPC - High Performance Computing 5 Tests
Machine Learning 2 Tests
Molecular Dynamics 3 Tests
MPI Benchmarks 2 Tests
Multi-Core 17 Tests
NVIDIA GPU Compute 5 Tests
OpenMPI Tests 2 Tests
Programmer / Developer System Benchmarks 7 Tests
Python Tests 3 Tests
Renderers 3 Tests
Scientific Computing 3 Tests
Software Defined Radio 4 Tests
Server CPU Tests 8 Tests
Video Encoding 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Noctua NH-U9 TR4-SP3
April 08 2021
  8 Hours, 12 Minutes
Dynatron A26
April 09 2021
  8 Hours, 45 Minutes
Dynatron A38
April 09 2021
  11 Hours, 17 Minutes
Invert Hiding All Results Option
  9 Hours, 25 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7763 Cooling PerformanceOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)Supermicro H12SSL-i v1.01 (2.0 BIOS)AMD Starship/Matisse126GB3841GB Micron_9300_MTFDHAL3T8TDPllvmpipe2 x Broadcom NetXtreme BCM5720 2-port PCIeUbuntu 20.045.12.0-051200rc6daily20210408-generic (x86_64) 20210407GNOME Shell 3.36.4X Server 1.20.83.3 Mesa 20.0.8 (LLVM 10.0.0 128 bits)GCC 9.3.0ext41024x768ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionAMD EPYC 7763 Cooling Performance BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa001119 - Python 3.8.2- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Noctua NH-U9 TR4-SP3Dynatron A26Dynatron A38Result OverviewPhoronix Test Suite100%101%101%102%StockfishXcompact3d Incompact3dTimed Erlang/OTP CompilationChaos Group V-RAYViennaCLOpenSCADMobile Neural NetworkTimed Node.js CompilationASTC EncoderTimed GDB GNU Debugger CompilationLuaRadioGROMACSIndigoBenchAOM AV1Timed Apache CompilationSVT-AV1Timed Linux Kernel CompilationNAMDsimdjsonsrsLTEGNU RadioSVT-HEVCLiquid-DSPGNU GMP GMPbenchTimed Mesa CompilationSVT-VP9BlenderoneDNN

AMD EPYC 7763 Cooling Performanceindigobench: CPU - Supercarindigobench: CPU - Bedroomv-ray: CPUblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyblender: Barbershop - CPU-Onlybuild-linux-kernel: Time To Compilebuild-gdb: Time To Compilebuild-apache: Time To Compilebuild-mesa: Time To Compilebuild-nodejs: Time To Compilebuild-erlang: Time To Compileaom-av1: Speed 9 Realtime - Bosphorus 1080paom-av1: Speed 9 Realtime - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 1080paom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 6 Realtime - Bosphorus 1080paom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 1080paom-av1: Speed 6 Two-Pass - Bosphorus 4Kaom-av1: Speed 4 Two-Pass - Bosphorus 1080paom-av1: Speed 4 Two-Pass - Bosphorus 4Kaom-av1: Speed 0 Two-Pass - Bosphorus 1080paom-av1: Speed 0 Two-Pass - Bosphorus 4Ksvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-av1: Enc Mode 8 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 0 - 1080pviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - sDOTgmpbench: Total Timeopenscad: Projector Mount Swivelopenscad: Leonardo Phone Case Slimopenscad: Pistolopenscad: Retro Caropenscad: Mini-ITX Casestockfish: Total Timeincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionincompact3d: X3D-benchmarking input.i3donednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUmnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3astcenc: Mediumastcenc: Thoroughastcenc: Exhaustivesimdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyasimdjson: DistinctUserIDsrslte: OFDM_Testsrslte: PHY_DL_Testsrslte: PHY_DL_Testgnuradio: Five Back to Back FIR Filtersgnuradio: Signal Source (Cosine)gnuradio: FIR Filtergnuradio: IIR Filtergnuradio: FM Deemphasis Filtergnuradio: Hilbert Transformluaradio: Five Back to Back FIR Filtersluaradio: FM Deemphasis Filterluaradio: Hilbert Transformluaradio: Complex Phaseliquid-dsp: 16 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 57gromacs: water_GMX50_barenamd: ATPase Simulation - 327,506 AtomsNoctua NH-U9 TR4-SP3Dynatron A26Dynatron A3824.19011.3945791231.9280.9645.9893.59111.6126.93098.81423.59019.835110.706134.328104.5737.4586.1334.124.6116.0921.319.156.744.720.50.2345.24471.60468.7637.76324.42607.0292.8619.4730.130103562714011222111588.679388.186.192.189.76375098.8100.90218.699109.83319.06345.8411566566855.1565307222.5096487625.8123370.8800471.652207.136080.6050583.038200.7818511.177961.177583.663070.6630590.3849290.7209281373.911385.321370.82665.228664.988664.6805.78522.1303.7572.32828.1714.91767.992520.44283.640.962.834.01115633333257.494.3560.93322.2639.0608.6765.3378.21101.6344.793.5591.88033100001613766667279240000030179333335.5770.3811024.25611.3995827031.8580.8645.8093.72111.4826.87799.32623.67319.796111.318132.497101.7038.0686.5634.2924.7116.2521.479.166.714.830.500.2347.78471.09468.5937.89323.60602.5592.3479.4520.130105264014091205111682.678188.686.692.189.96405099.2100.72318.525108.61718.88645.2321601076515.1506515922.3308512667.2684940.8797521.656717.178390.6076553.024590.7834641.178631.186853.649410.6506240.3814950.7267891378.861369.981391.14666.049664.807666.4835.79722.3973.7892.34228.4514.94128.006220.61963.630.962.834.00116233333257.194.3567.73324.7641.6604.7760.3377.31110.6344.293.6591.68001900001614933333278286666730280666675.5820.3816424.36011.4045850431.9280.9246.0193.72111.5326.84499.08623.58619.800110.941133.119101.2737.4687.6334.7324.8216.1421.469.356.684.770.500.2346.82470.69467.9637.92322.43605.9793.0489.3550.130104464013991204111278.977988.286.391.989.96365089.1101.04518.734108.79418.92745.4821585120045.1505176722.7087644627.6671140.8794801.672567.180020.6036213.043520.7810601.180871.179063.617620.6403310.3852070.7215861400.191377.491381.77665.526665.265666.1305.87822.2693.7802.33228.3284.94247.989820.55913.630.962.833.98117866667255.993.7560.13303.6642.0607.0764.1376.21097.4343.493.5591.08041466671614566667279373333330281333335.5990.38215OpenBenchmarking.org

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: SupercarNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26612182430SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 324.1924.3624.26

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: BedroomNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A263691215SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 311.3911.4011.40

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2613K26K39K52K65KSE +/- 265.36, N = 3SE +/- 477.68, N = 3SE +/- 814.01, N = 3579125850458270

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-OnlyNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26714212835SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 331.9231.9231.85

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CPU-OnlyNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 380.9680.9280.86

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-OnlyNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A261020304050SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 345.9846.0145.80

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CPU-OnlyNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 393.5993.7293.72

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CPU-OnlyNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3111.61111.53111.48

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To CompileNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26612182430SE +/- 0.28, N = 8SE +/- 0.26, N = 9SE +/- 0.26, N = 926.9326.8426.88

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 398.8199.0999.33

Timed Apache Compilation

This test times how long it takes to build the Apache HTTPD web server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To CompileNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 323.5923.5923.67

Timed Mesa Compilation

This test profile times how long it takes to compile Mesa with Meson/Ninja. For minimizing build dependencies and avoid versioning conflicts, test this is just the core Mesa build without LLVM or the extra Gallium3D/Mesa drivers enabled. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To CompileNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26510152025SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 319.8419.8019.80

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To CompileNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.28, N = 3SE +/- 0.30, N = 3SE +/- 0.11, N = 3110.71110.94111.32

Timed Erlang/OTP Compilation

This test times how long it takes to compile Erlang/OTP. Erlang is a programming language and run-time for massively scalable soft real-time systems with high availability requirements. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To CompileNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26306090120150SE +/- 0.26, N = 3SE +/- 0.29, N = 3SE +/- 0.39, N = 3134.33133.12132.50

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 1.05, N = 7SE +/- 0.67, N = 6SE +/- 1.09, N = 6104.57101.27101.701. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26918273645SE +/- 0.43, N = 3SE +/- 0.48, N = 5SE +/- 0.46, N = 637.4537.4638.061. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.36, N = 6SE +/- 0.58, N = 6SE +/- 0.59, N = 686.1387.6386.561. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26816243240SE +/- 0.07, N = 3SE +/- 0.28, N = 3SE +/- 0.22, N = 334.1034.7334.291. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26612182430SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.18, N = 324.6124.8224.711. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2648121620SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 316.0916.1416.251. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26510152025SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 321.3121.4621.471. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4KNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A263691215SE +/- 0.07, N = 3SE +/- 0.11, N = 6SE +/- 0.10, N = 39.159.359.161. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26246810SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 36.746.686.711. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4KNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A261.08682.17363.26044.34725.434SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 34.724.774.831. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.11250.2250.33750.450.5625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.500.500.501. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4KNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.0450.090.1350.180.225SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.20.20.21. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2680160240320400SE +/- 5.87, N = 15SE +/- 6.16, N = 15SE +/- 6.37, N = 15345.24346.82347.781. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26100200300400500SE +/- 1.14, N = 10SE +/- 1.32, N = 10SE +/- 1.26, N = 10471.60470.69471.091. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26100200300400500SE +/- 1.02, N = 10SE +/- 0.90, N = 10SE +/- 1.01, N = 10468.76467.96468.591. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26918273645SE +/- 0.05, N = 4SE +/- 0.10, N = 4SE +/- 0.12, N = 437.7637.9237.891. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2670140210280350SE +/- 1.48, N = 10SE +/- 0.82, N = 10SE +/- 0.66, N = 10324.42322.43323.601. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26130260390520650SE +/- 1.56, N = 12SE +/- 0.85, N = 12SE +/- 1.45, N = 12607.02605.97602.551. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.39, N = 6SE +/- 0.62, N = 6SE +/- 0.38, N = 692.8693.0592.351. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A263691215SE +/- 0.108, N = 4SE +/- 0.114, N = 6SE +/- 0.083, N = 49.4739.3559.4521. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.02930.05860.08790.11720.1465SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.1300.1300.1301. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A262004006008001000SE +/- 30.05, N = 15SE +/- 27.57, N = 15SE +/- 28.51, N = 141035104410521. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26140280420560700SE +/- 2.32, N = 15SE +/- 1.86, N = 15SE +/- 2.67, N = 146276406401. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2630060090012001500SE +/- 5.47, N = 15SE +/- 3.16, N = 15SE +/- 3.55, N = 141401139914091. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2630060090012001500SE +/- 2.23, N = 15SE +/- 1.31, N = 15SE +/- 2.28, N = 141222120412051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A262004006008001000SE +/- 2.15, N = 15SE +/- 2.00, N = 15SE +/- 1.73, N = 141115111211161. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 7.63, N = 15SE +/- 4.88, N = 15SE +/- 9.76, N = 1488.678.982.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A262004006008001000SE +/- 1.64, N = 15SE +/- 3.44, N = 15SE +/- 1.78, N = 147937797811. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.52, N = 15SE +/- 0.34, N = 15SE +/- 0.08, N = 1488.188.288.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.28, N = 15SE +/- 0.30, N = 15SE +/- 0.06, N = 1486.186.386.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.04, N = 15SE +/- 0.13, N = 15SE +/- 0.03, N = 1492.191.992.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.13, N = 15SE +/- 0.05, N = 15SE +/- 0.02, N = 1489.789.989.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26140280420560700SE +/- 0.77, N = 14SE +/- 1.17, N = 13SE +/- 1.19, N = 146376366401. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GNU GMP GMPbench

GMPbench is a test of the GNU Multiple Precision Arithmetic (GMP) Library. GMPbench is a single-threaded integer benchmark that leverages the GMP library to stress the CPU with widening integer multiplication. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total TimeNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26110022003300440055005098.85089.15099.21. (CC) gcc options: -O3 -fomit-frame-pointer -lm

OpenSCAD

OpenSCAD is a programmer-focused solid 3D CAD modeller. OpenSCAD is free software and allows creating 3D CAD objects in a script-based modelling environment. This test profile will use the system-provided OpenSCAD program otherwise and time how long it takes tn render different SCAD assets to PNG output. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Projector Mount SwivelNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.29, N = 3SE +/- 0.14, N = 3SE +/- 0.42, N = 3100.90101.05100.721. OpenSCAD version 2019.05

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Leonardo Phone Case SlimNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26510152025SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 318.7018.7318.531. OpenSCAD version 2019.05

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: PistolNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.20, N = 3SE +/- 0.31, N = 3SE +/- 0.10, N = 3109.83108.79108.621. OpenSCAD version 2019.05

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Retro CarNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26510152025SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 319.0618.9318.891. OpenSCAD version 2019.05

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Mini-ITX CaseNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A261020304050SE +/- 0.06, N = 3SE +/- 0.17, N = 3SE +/- 0.17, N = 345.8445.4845.231. OpenSCAD version 2019.05

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2630M60M90M120M150MSE +/- 2161918.89, N = 4SE +/- 1246901.76, N = 3SE +/- 2061799.88, N = 151566566851585120041601076511. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per DirectionNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A261.16022.32043.48064.64085.801SE +/- 0.01747253, N = 7SE +/- 0.02393955, N = 7SE +/- 0.02227690, N = 75.156530725.150517675.150651591. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26510152025SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.30, N = 322.5122.7122.331. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26140280420560700SE +/- 0.48, N = 3SE +/- 0.23, N = 3SE +/- 11.65, N = 9625.81627.67667.271. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.1980.3960.5940.7920.99SE +/- 0.000667, N = 7SE +/- 0.000819, N = 7SE +/- 0.000564, N = 70.8800470.8794800.879752MIN: 0.84MIN: 0.84MIN: 0.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.37630.75261.12891.50521.8815SE +/- 0.00301, N = 7SE +/- 0.01439, N = 7SE +/- 0.00304, N = 71.652201.672561.65671MIN: 1.57MIN: 1.57MIN: 1.571. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26246810SE +/- 0.03799, N = 3SE +/- 0.01979, N = 3SE +/- 0.02988, N = 37.136087.180027.17839MIN: 6.04MIN: 6.18MIN: 6.171. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.13670.27340.41010.54680.6835SE +/- 0.001828, N = 3SE +/- 0.000705, N = 3SE +/- 0.001607, N = 30.6050580.6036210.607655MIN: 0.56MIN: 0.56MIN: 0.561. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.68481.36962.05442.73923.424SE +/- 0.00904, N = 9SE +/- 0.00693, N = 9SE +/- 0.00911, N = 93.038203.043523.02459MIN: 2.21MIN: 2.34MIN: 2.241. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.17630.35260.52890.70520.8815SE +/- 0.002596, N = 9SE +/- 0.002274, N = 9SE +/- 0.002597, N = 90.7818510.7810600.783464MIN: 0.72MIN: 0.72MIN: 0.721. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.26570.53140.79711.06281.3285SE +/- 0.00196, N = 4SE +/- 0.00303, N = 4SE +/- 0.00176, N = 41.177961.180871.17863MIN: 1.1MIN: 1.1MIN: 1.121. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.2670.5340.8011.0681.335SE +/- 0.00777, N = 4SE +/- 0.01187, N = 4SE +/- 0.00760, N = 41.177581.179061.18685MIN: 1MIN: 0.99MIN: 0.981. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.82421.64842.47263.29684.121SE +/- 0.04102, N = 5SE +/- 0.03350, N = 5SE +/- 0.02260, N = 53.663073.617623.64941MIN: 3.37MIN: 3.36MIN: 3.41. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.14920.29840.44760.59680.746SE +/- 0.006455, N = 5SE +/- 0.005445, N = 5SE +/- 0.004042, N = 50.6630590.6403310.650624MIN: 0.61MIN: 0.58MIN: 0.591. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.08670.17340.26010.34680.4335SE +/- 0.005034, N = 4SE +/- 0.004169, N = 4SE +/- 0.001495, N = 40.3849290.3852070.381495MIN: 0.36MIN: 0.36MIN: 0.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.16350.3270.49050.6540.8175SE +/- 0.002487, N = 4SE +/- 0.001725, N = 4SE +/- 0.001153, N = 40.7209280.7215860.726789MIN: 0.67MIN: 0.67MIN: 0.671. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2630060090012001500SE +/- 3.07, N = 3SE +/- 14.96, N = 3SE +/- 9.11, N = 31373.911400.191378.86MIN: 1332.45MIN: 1335.97MIN: 1326.591. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2630060090012001500SE +/- 3.58, N = 3SE +/- 4.57, N = 3SE +/- 3.15, N = 31385.321377.491369.98MIN: 1350.61MIN: 1335.99MIN: 1325.961. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2630060090012001500SE +/- 8.02, N = 3SE +/- 8.30, N = 3SE +/- 4.55, N = 31370.821381.771391.14MIN: 1322.76MIN: 1330.53MIN: 1343.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26140280420560700SE +/- 0.56, N = 3SE +/- 1.18, N = 3SE +/- 1.29, N = 3665.23665.53666.05MIN: 638.88MIN: 639.97MIN: 638.641. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26140280420560700SE +/- 1.05, N = 3SE +/- 0.96, N = 3SE +/- 1.25, N = 3664.99665.27664.81MIN: 636.72MIN: 637.88MIN: 637.031. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26140280420560700SE +/- 1.52, N = 3SE +/- 1.16, N = 3SE +/- 1.24, N = 3664.68666.13666.48MIN: 637.44MIN: 639.05MIN: 640.161. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0Noctua NH-U9 TR4-SP3Dynatron A38Dynatron A261.32262.64523.96785.29046.613SE +/- 0.039, N = 3SE +/- 0.015, N = 3SE +/- 0.027, N = 35.7855.8785.797MIN: 5.58 / MAX: 6.64MIN: 5.64 / MAX: 6.84MIN: 5.54 / MAX: 7.521. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50Noctua NH-U9 TR4-SP3Dynatron A38Dynatron A26510152025SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 322.1322.2722.40MIN: 21.58 / MAX: 32.33MIN: 21.57 / MAX: 30.86MIN: 21.64 / MAX: 41.291. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224Noctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.85251.7052.55753.414.2625SE +/- 0.017, N = 3SE +/- 0.016, N = 3SE +/- 0.021, N = 33.7573.7803.789MIN: 3.63 / MAX: 6.09MIN: 3.66 / MAX: 6.67MIN: 3.66 / MAX: 4.641. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0Noctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.5271.0541.5812.1082.635SE +/- 0.010, N = 3SE +/- 0.014, N = 3SE +/- 0.012, N = 32.3282.3322.342MIN: 2.28 / MAX: 2.55MIN: 2.28 / MAX: 2.65MIN: 2.29 / MAX: 2.641. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3Noctua NH-U9 TR4-SP3Dynatron A38Dynatron A26714212835SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 328.1728.3328.45MIN: 27.06 / MAX: 43.23MIN: 27.14 / MAX: 44.36MIN: 27.16 / MAX: 42.861. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: MediumNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A261.1122.2243.3364.4485.56SE +/- 0.0039, N = 7SE +/- 0.0054, N = 7SE +/- 0.0032, N = 74.91764.94244.94121. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ThoroughNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26246810SE +/- 0.0055, N = 6SE +/- 0.0064, N = 6SE +/- 0.0081, N = 67.99257.98988.00621. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ExhaustiveNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26510152025SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 320.4420.5620.621. (CXX) g++ options: -O3 -flto -pthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweetsNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.8191.6382.4573.2764.095SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.643.633.631. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandomNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.2160.4320.6480.8641.08SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.960.960.961. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: KostyaNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.63681.27361.91042.54723.184SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.832.832.831. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserIDNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.90231.80462.70693.60924.5115SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.013.984.001. (CXX) g++ options: -O3 -pthread

srsLTE

srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_TestNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2630M60M90M120M150MSE +/- 1770436.23, N = 3SE +/- 1178039.80, N = 3SE +/- 1902921.73, N = 31156333331178666671162333331. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_TestNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2660120180240300SE +/- 0.22, N = 3SE +/- 0.84, N = 3SE +/- 0.18, N = 3257.4255.9257.11. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_TestNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.46, N = 3SE +/- 0.52, N = 3SE +/- 0.23, N = 394.393.794.31. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR FiltersNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26120240360480600SE +/- 8.06, N = 4SE +/- 5.99, N = 9SE +/- 5.98, N = 3560.9560.1567.71. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)Noctua NH-U9 TR4-SP3Dynatron A38Dynatron A267001400210028003500SE +/- 6.28, N = 4SE +/- 17.69, N = 9SE +/- 26.65, N = 33322.23303.63324.71. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR FilterNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26140280420560700SE +/- 0.72, N = 4SE +/- 0.99, N = 9SE +/- 1.06, N = 3639.0642.0641.61. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR FilterNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26130260390520650SE +/- 1.08, N = 4SE +/- 1.20, N = 9SE +/- 1.25, N = 3608.6607.0604.71. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis FilterNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26160320480640800SE +/- 1.86, N = 4SE +/- 1.58, N = 9SE +/- 1.05, N = 3765.3764.1760.31. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert TransformNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2680160240320400SE +/- 1.21, N = 4SE +/- 0.76, N = 9SE +/- 1.82, N = 3378.2376.2377.31. 3.8.1.0

LuaRadio

LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR FiltersNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A262004006008001000SE +/- 3.75, N = 3SE +/- 2.63, N = 3SE +/- 1.05, N = 31101.61097.41110.6

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis FilterNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2670140210280350SE +/- 0.20, N = 3SE +/- 0.41, N = 3SE +/- 0.22, N = 3344.7343.4344.2

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert TransformNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A2620406080100SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 393.593.593.6

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex PhaseNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A26130260390520650SE +/- 0.62, N = 3SE +/- 0.49, N = 3SE +/- 0.72, N = 3591.8591.0591.6

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57Noctua NH-U9 TR4-SP3Dynatron A38Dynatron A26200M400M600M800M1000MSE +/- 6476302.96, N = 3SE +/- 2740148.01, N = 3SE +/- 5535858.86, N = 38033100008041466678001900001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57Noctua NH-U9 TR4-SP3Dynatron A38Dynatron A26300M600M900M1200M1500MSE +/- 3887729.99, N = 3SE +/- 2380709.51, N = 3SE +/- 3637917.60, N = 31613766667161456666716149333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57Noctua NH-U9 TR4-SP3Dynatron A38Dynatron A26600M1200M1800M2400M3000MSE +/- 5372460.64, N = 3SE +/- 3347304.06, N = 3SE +/- 3268196.92, N = 32792400000279373333327828666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57Noctua NH-U9 TR4-SP3Dynatron A38Dynatron A26600M1200M1800M2400M3000MSE +/- 2577035.33, N = 3SE +/- 266666.67, N = 3SE +/- 448454.13, N = 33017933333302813333330280666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bareNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A261.25982.51963.77945.03926.299SE +/- 0.003, N = 3SE +/- 0.010, N = 3SE +/- 0.018, N = 35.5775.5995.5821. (CXX) g++ options: -O3 -pthread

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A260.0860.1720.2580.3440.43SE +/- 0.00051, N = 3SE +/- 0.00076, N = 3SE +/- 0.00041, N = 30.381100.382150.38164

CPU Temperature Monitor

OpenBenchmarking.orgCelsiusCPU Temperature MonitorPhoronix Test Suite System MonitoringNoctua NH-U9 TR4-SP3Dynatron A38Dynatron A261530456075Min: 41.5 / Avg: 56.86 / Max: 79.5Min: 40.25 / Avg: 51.96 / Max: 70.25Min: 41 / Avg: 59.01 / Max: 79.25

107 Results Shown

IndigoBench:
  CPU - Supercar
  CPU - Bedroom
Chaos Group V-RAY
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
  Fishy Cat - CPU-Only
  Pabellon Barcelona - CPU-Only
  Barbershop - CPU-Only
Timed Linux Kernel Compilation
Timed GDB GNU Debugger Compilation
Timed Apache Compilation
Timed Mesa Compilation
Timed Node.js Compilation
Timed Erlang/OTP Compilation
AOM AV1:
  Speed 9 Realtime - Bosphorus 1080p
  Speed 9 Realtime - Bosphorus 4K
  Speed 8 Realtime - Bosphorus 1080p
  Speed 8 Realtime - Bosphorus 4K
  Speed 6 Realtime - Bosphorus 1080p
  Speed 6 Realtime - Bosphorus 4K
  Speed 6 Two-Pass - Bosphorus 1080p
  Speed 6 Two-Pass - Bosphorus 4K
  Speed 4 Two-Pass - Bosphorus 1080p
  Speed 4 Two-Pass - Bosphorus 4K
  Speed 0 Two-Pass - Bosphorus 1080p
  Speed 0 Two-Pass - Bosphorus 4K
SVT-VP9:
  Visual Quality Optimized - Bosphorus 1080p
  PSNR/SSIM Optimized - Bosphorus 1080p
  VMAF Optimized - Bosphorus 1080p
SVT-HEVC:
  1 - Bosphorus 1080p
  7 - Bosphorus 1080p
  10 - Bosphorus 1080p
SVT-AV1:
  Enc Mode 8 - 1080p
  Enc Mode 4 - 1080p
  Enc Mode 0 - 1080p
ViennaCL:
  CPU BLAS - sCOPY
  CPU BLAS - sAXPY
  CPU BLAS - dCOPY
  CPU BLAS - dAXPY
  CPU BLAS - dDOT
  CPU BLAS - dGEMV-N
  CPU BLAS - dGEMV-T
  CPU BLAS - dGEMM-NN
  CPU BLAS - dGEMM-NT
  CPU BLAS - dGEMM-TN
  CPU BLAS - dGEMM-TT
  CPU BLAS - sDOT
GNU GMP GMPbench
OpenSCAD:
  Projector Mount Swivel
  Leonardo Phone Case Slim
  Pistol
  Retro Car
  Mini-ITX Case
Stockfish
Xcompact3d Incompact3d:
  input.i3d 129 Cells Per Direction
  input.i3d 193 Cells Per Direction
  X3D-benchmarking input.i3d
oneDNN:
  Convolution Batch Shapes Auto - f32 - CPU
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  IP Shapes 1D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
  IP Shapes 3D - f32 - CPU
  IP Shapes 3D - u8s8f32 - CPU
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Mobile Neural Network:
  SqueezeNetV1.0
  resnet-v2-50
  MobileNetV2_224
  mobilenet-v1-1.0
  inception-v3
ASTC Encoder:
  Medium
  Thorough
  Exhaustive
simdjson:
  PartialTweets
  LargeRand
  Kostya
  DistinctUserID
srsLTE:
  OFDM_Test
  PHY_DL_Test
  PHY_DL_Test
GNU Radio:
  Five Back to Back FIR Filters
  Signal Source (Cosine)
  FIR Filter
  IIR Filter
  FM Deemphasis Filter
  Hilbert Transform
LuaRadio:
  Five Back to Back FIR Filters
  FM Deemphasis Filter
  Hilbert Transform
  Complex Phase
Liquid-DSP:
  16 - 256 - 57
  32 - 256 - 57
  64 - 256 - 57
  128 - 256 - 57
GROMACS
NAMD
CPU Temperature Monitor