AMD EPYC 7F72 2P Linux 5.11

2 x AMD EPYC 7F72 24-Core testing looking at CPU freq invariance on 5.11 with patch. CPU power consumption monitoring via AMD_Energy interface at 1 second polling.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2101248-HA-AMDEPYC7F52
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Linux 5.10
January 21 2021
  16 Hours, 7 Minutes
Linux 5.11 Git
January 22 2021
  15 Hours
Linux 5.11 Patched
January 23 2021
  15 Hours, 14 Minutes
Invert Behavior (Only Show Selected Data)
  15 Hours, 27 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7F72 2P Linux 5.11ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionLinux 5.10Linux 5.11 GitLinux 5.11 Patched2 x AMD EPYC 7F72 24-Core @ 3.20GHz (48 Cores / 96 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse16 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN1000GB Western Digital WD_BLACK SN850 1TBASPEEDVE2282 x Intel 10G X550TUbuntu 20.105.10.9-051009-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.9GCC 10.2.0ext41920x10805.11.0-051100rc4daily20210122-generic (x86_64) 20210121VE2285.11.0-rc4-max-boost-inv-patch (x86_64) 20210121OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034Java Details- OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Linux 5.10Linux 5.11 GitLinux 5.11 PatchedResult OverviewPhoronix Test Suite100%104%108%112%116%TTSIOD 3D RendererCLOMPNebular Empirical Analysis ToolBlogBenchTimed GDB GNU Debugger CompilationFFTWKeyDBLULESHQMCPACKoneDNNLAMMPS Molecular Dynamics SimulatorAI Benchmark Alpharav1eDaCapo BenchmarkIORx265FFTERodiniaTimed Godot Game Engine CompilationJohn The RipperBRL-CADQuantum ESPRESSOHigh Performance Conjugate GradientYafaRayStockfishdav1dChaos Group V-RAYBYTE Unix BenchmarkNAMDTNNHimeno BenchmarkRedisSVT-VP9NAS Parallel BenchmarksPlaidMLGPAWPrimesieveOpenFOAMPOV-RayIntel Open Image DenoiseSVT-AV1Timed Linux Kernel CompilationAlgebraic Multi-Grid BenchmarkCython BenchmarkBlenderInfluxDBasmFishCpuminer-OptLeelaChessZeroBuild2TachyonTimed MrBayes AnalysisZstd CompressionOSPrayTimed LLVM CompilationsimdjsonRELIONASKAPONNX RuntimeASTC EncoderSwetSQLite SpeedtestLZ4 CompressionGcrypt LibraryGoogle SynthMarkGROMACSEtcpakNumpy BenchmarkDolfynQuantLibTungsten RendererLuxCoreRenderGnuPGHierarchical INTegrationTSCPTensorFlow LiteFinanceBench

Linux 5.10Linux 5.11 GitLinux 5.11 PatchedPer Watt Result OverviewPhoronix Test Suite100%106%112%119%BlogBenchTTSIOD 3D RendererCLOMPASKAPHigh Performance Conjugate GradientKeyDBAI Benchmark AlphaFFTWLULESHFFTEasmFishIORBRL-CADOSPrayLAMMPS Molecular Dynamics SimulatorRedisInfluxDBJohn The RipperZstd Compressiondav1dBYTE Unix BenchmarkNAS Parallel BenchmarksAlgebraic Multi-Grid BenchmarkStockfishSVT-AV1Etcpakx265LeelaChessZeroChaos Group V-RAYQuantLibSVT-VP9Google SynthMarkLZ4 CompressionHimeno BenchmarkNumpy BenchmarkTSCPHierarchical INTegrationSwetCpuminer-OptONNX RuntimePlaidMLrav1eIntel Open Image DenoiseLuxCoreRenderGROMACSP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

AMD EPYC 7F72 2P Linux 5.11amg: blogbench: Readdav1d: Summer Nature 4Kdav1d: Chimera 1080p 10-bitospray: Magnetic Reconnection - SciVisospray: Magnetic Reconnection - Path Tracerospray: XFrog Forest - SciVisospray: XFrog Forest - Path Tracerospray: NASA Streamlines - SciVisospray: NASA Streamlines - Path Tracerospray: San Miguel - SciVisospray: San Miguel - Path Tracerplaidml: No - Inference - ResNet 50 - CPUplaidml: No - Inference - VGG16 - CPUplaidml: No - Inference - VGG19 - CPUttsiod-renderer: Phong Rendering With Soft-Shadow Mappingsvt-av1: Enc Mode 8 - 1080psvt-av1: Enc Mode 4 - 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-av1: Enc Mode 0 - 1080px265: Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080px265: Bosphorus 4Krav1e: 10rav1e: 6rav1e: 5rav1e: 1simdjson: LargeRandsimdjson: Kostyasimdjson: PartialTweetssimdjson: DistinctUserIDhpcg: oidn: Memorialonnx: yolov4 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUcpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: x25xcpuminer-opt: Garlicoincpuminer-opt: Skeincoincpuminer-opt: LBC, LBRY Creditsv-ray: CPUbyte: Dhrystone 2luxcorerender: DLSCluxcorerender: Rainbow Colors and Prismior: 2MB - Default Test Directoryior: 8MB - Default Test Directorycompress-zstd: 3compress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedquantlib: ffte: N=256, 3D Complex FFT Routinefftw: Float + SSE - 2D FFT Size 4096himeno: Poisson Pressure Solveraskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingetcpak: ETC1 + Ditheringetcpak: ETC1etcpak: ETC2lczero: BLASlczero: Eigenstockfish: Total Timetscp: AI Chess Performanceasmfish: 1024 Hash Memory, 26 Depthgromacs: Water Benchmarklammps: Rhodopsin Proteinlammps: 20k Atomsswet: Averagekeydb: hint: FLOATjohn-the-ripper: MD5john-the-ripper: Blowfishredis: SETredis: GETredis: LPUSHredis: SADDai-benchmark: Device Inference Scoreai-benchmark: Device Training Scoreai-benchmark: Device AI Scorenumpy: clomp: Static OMP Speedupnpb: EP.Cnpb: EP.Dnpb: LU.Cinfluxdb: 64 - 10000 - 2,5000,1 - 10000influxdb: 4 - 10000 - 2,5000,1 - 10000brl-cad: VGR Performance Metricsynthmark: VoiceMark_100lulesh: namd: ATPase Simulation - 327,506 Atomstensorflow-lite: Mobilenet Floattensorflow-lite: Mobilenet Quanttensorflow-lite: NASNet Mobiletensorflow-lite: SqueezeNettensorflow-lite: Inception ResNet V2tensorflow-lite: Inception V4financebench: Bonds OpenMPfinancebench: Repo OpenMPtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1onednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUdacapobench: H2dacapobench: Jythondacapobench: Tradebeansdacapobench: Tradesoapbuild-godot: Time To Compilebuild-gdb: Time To Compilebuild-linux-kernel: Time To Compilebuild-llvm: Time To Compilecython-bench: N-Queensgcrypt: gnupg: 2.7GB Sample File Encryptionopenfoam: Motorbike 30Mopenfoam: Motorbike 60Mqe: AUSURF112relion: Basic - CPUastcenc: Thoroughastcenc: Exhaustiveblender: Barbershop - CPU-Onlybuild2: Time To Compiledolfyn: Computational Fluid Dynamicspovray: Trace Timeyafaray: Total Time For Sample Scenegpaw: Carbon Nanotubemrbayes: Primate Phylogeny Analysisneat: primesieve: 1e12 Prime Number Generationrodinia: OpenMP CFD Solverrodinia: OpenMP LavaMDrodinia: OpenMP Leukocyterodinia: OpenMP Streamclusterrodinia: OpenMP HotSpot3Dsqlite-speedtest: Timed Time - Size 1,000tachyon: Total Timetungsten: Hairtungsten: Water Caustictungsten: Volumetric Causticqmcpack: simple-H2OLinux 5.10Linux 5.11 GitLinux 5.11 Patched1430050667981673310.21130.6232.6225011.195.9571.4316.3952.634.334.5425.3621.63726.86568.8897.663315.24365.960.09448.48357.4219.262.8371.3461.0180.3510.370.570.630.6531.128328.5118241723013731503.109912.853633461356025413737386556.67.758.83517.13529.438195.39668.5011264.448.8910527.247.7910505.72156.4182254.44246531174404191.6510347498.7111918.2245.504267.322155.787411243799535326311151251162019375.25723.57325.020683534408280163.95322922449.166174780667726261429370.711716829.231206370.591563597.66159310282621322.7441.63825.273871.58152840.601223604.8818275.3615930715.50418424.9840.4441843015.343716.315724561294.275545083520857310.34505241287.102865291.428274.2830.8724412.328991.569700.8884280.5190151158.35565449516032515061.440102.49826.107209.57326.892233.09977.11318.32130.091197.60351.1195.6941.12158.1667.93718.70711.41185.79359.52081.84928.2654.6128.10952.39254.96510.59896.75870.31018.11836.6963421.18765.3120229.47914377713331084405308.29130.6132.2625011.115.9171.4316.3952.634.304.6525.1222.09627.20868.2297.663311.85381.080.09247.66369.0118.632.9021.3701.0450.3680.370.570.620.6530.241528.0917543932975021524.219937.313637841324775480338181643.67.818.72505.19531.098205.29622.9911334.049.4310404.547.6310488.02149.8174206.13000387184684233.5115837426.0811870.3244.793266.315155.182410642849794508911151331173109855.23921.12924.993685888863302893.56322702844.182044550333708241380890.221689203.101220973.711539146.21169710592756323.9343.93788.883854.60147443.861231991.2807463.1638971712.25219576.1220.4545146659.245083.918977165193.076572689464057601.36328140124.373698303.449274.9440.9141982.405491.620880.8813480.5476741317.40531048975954517060.85897.64125.923210.34726.950233.82577.30218.71129.671217.49349.8525.6941.17158.6468.14118.71811.48584.73960.70282.65027.0674.5409.25552.79453.86211.20997.94270.16017.93756.6005421.32535.3010531.17714487183331103118317.45133.3732.6225011.195.9571.4316.3954.974.324.6325.4222.49655.22568.2317.648323.81371.480.09149.45364.8119.743.0541.4081.0680.3720.360.570.630.6530.826228.3918142102969951541.769949.883640171390375346038319339.87.808.76475.25520.728270.59757.1711305.048.9510666.047.7610489.82157.2178738.12497094170154286.6283097453.5111944.2245.595267.587155.798406144339704260111145621176329555.26123.78725.077687480262294214.37323144417.171974612308726361427348.101711621.521217218.751611164.34172010672787323.0047.83841.483863.45154376.761256112.1812193.6636521714.91419771.2230.4447239523.541034.013404462195.473628581075056769.45312539406.757812289.764274.8690.8637822.332901.554470.8492480.5219681123.32521747785591514859.17792.91625.752208.78626.604232.54277.17918.30128.281171.03348.2945.6540.97156.8367.32218.65211.30587.14359.85182.04224.6334.5358.88252.09252.68410.33896.60370.54018.05656.6903621.33295.2623529.281OpenBenchmarking.org

CPU Power Consumption Monitor

OpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringLinux 5.11 PatchedLinux 5.11 GitLinux 5.10100200300400500Min: 59.88 / Avg: 280.92 / Max: 530.69Min: 60.44 / Avg: 279.2 / Max: 548.33Min: 119.15 / Avg: 269.81 / Max: 501.96

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2Linux 5.10Linux 5.11 GitLinux 5.11 Patched300M600M900M1200M1500MSE +/- 5378401.14, N = 3SE +/- 2689640.27, N = 3SE +/- 750486.58, N = 31430050667143777133314487183331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

BlogBench

BlogBench is designed to replicate the load of a real-world busy file server by stressing the file-system with multiple threads of random reads, writes, and rewrites. The behavior is mimicked of that of a blog by creating blogs with content and pictures, modifying blog posts, adding comments to these blogs, and then reading the content of the blogs. All of these blogs generated are created locally with fake content and pictures. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFinal Score, More Is BetterBlogBench 1.1Test: ReadLinux 5.10Linux 5.11 GitLinux 5.11 Patched200K400K600K800K1000KSE +/- 4087.26, N = 3SE +/- 10984.18, N = 9SE +/- 1738.41, N = 3981673108440511031181. (CC) gcc options: -O2 -pthread

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KLinux 5.11 GitLinux 5.10Linux 5.11 Patched70140210280350SE +/- 1.86, N = 3SE +/- 1.19, N = 3SE +/- 0.53, N = 3308.29310.21317.45MIN: 163.13 / MAX: 334.13MIN: 160.01 / MAX: 335.18MIN: 173.69 / MAX: 340.431. (CC) gcc options: -pthread

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitLinux 5.11 GitLinux 5.10Linux 5.11 Patched306090120150SE +/- 0.23, N = 3SE +/- 0.47, N = 3SE +/- 0.14, N = 3130.61130.62133.37MIN: 90.23 / MAX: 199.74MIN: 89.55 / MAX: 202.25MIN: 92.59 / MAX: 205.111. (CC) gcc options: -pthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisLinux 5.11 GitLinux 5.10Linux 5.11 Patched816243240SE +/- 0.00, N = 6SE +/- 0.23, N = 6SE +/- 0.23, N = 632.2632.6232.62MIN: 12.66 / MAX: 33.33MIN: 12.05 / MAX: 34.48MIN: 12.82 / MAX: 33.33

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: Path TracerLinux 5.10Linux 5.11 GitLinux 5.11 Patched50100150200250250250250MIN: 100 / MAX: 333.33MIN: 90.91 / MAX: 500MIN: 90.91 / MAX: 333.33

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: SciVisLinux 5.11 GitLinux 5.10Linux 5.11 Patched3691215SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 311.1111.1911.19MIN: 9.62 / MAX: 11.24MIN: 9.35 / MAX: 11.36MIN: 8.2 / MAX: 11.36

OpenBenchmarking.orgFPS Per Watt, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path TracerLinux 5.10Linux 5.11 GitLinux 5.11 Patched0.00230.00460.00690.00920.01150.010.010.01

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: XFrog Forest - Renderer: Path TracerLinux 5.11 GitLinux 5.10Linux 5.11 Patched1.33882.67764.01645.35526.694SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.915.955.95MIN: 5.18 / MAX: 5.99MIN: 5.46 / MAX: 6.02MIN: 5.35 / MAX: 6.02

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisLinux 5.10Linux 5.11 GitLinux 5.11 Patched1632486480SE +/- 0.00, N = 7SE +/- 0.00, N = 7SE +/- 0.00, N = 771.4371.4371.43MIN: 21.74 / MAX: 76.92MIN: 21.28 / MAX: 76.92MIN: 19.61 / MAX: 76.92

TTSIOD 3D Renderer

OpenBenchmarking.orgFPS Per Watt, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingLinux 5.11 GitLinux 5.11 PatchedLinux 5.100.67281.34562.01842.69123.3642.592.702.99

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerLinux 5.10Linux 5.11 GitLinux 5.11 Patched48121620SE +/- 0.00, N = 4SE +/- 0.00, N = 4SE +/- 0.00, N = 416.3916.3916.39MIN: 10.64 / MAX: 16.95MIN: 10.31 / MAX: 16.67MIN: 10.99 / MAX: 16.95

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisLinux 5.10Linux 5.11 GitLinux 5.11 Patched1224364860SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 552.6352.6354.97MIN: 24.39 / MAX: 58.82MIN: 27.03 / MAX: 58.82MIN: 31.25 / MAX: 58.82

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerLinux 5.11 GitLinux 5.11 PatchedLinux 5.100.97431.94862.92293.89724.8715SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.304.324.33MIN: 3.38 / MAX: 4.35MIN: 3.76 / MAX: 4.37MIN: 3.44 / MAX: 4.37

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPULinux 5.10Linux 5.11 PatchedLinux 5.11 Git1.04632.09263.13894.18525.2315SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 34.544.634.65

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPULinux 5.11 GitLinux 5.10Linux 5.11 Patched612182430SE +/- 0.22, N = 15SE +/- 0.14, N = 3SE +/- 0.30, N = 1525.1225.3625.42

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPULinux 5.10Linux 5.11 GitLinux 5.11 Patched510152025SE +/- 0.23, N = 15SE +/- 0.20, N = 15SE +/- 0.16, N = 1521.6322.0922.49

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingLinux 5.11 GitLinux 5.11 PatchedLinux 5.10160320480640800SE +/- 9.04, N = 15SE +/- 3.22, N = 3SE +/- 10.33, N = 3627.21655.23726.871. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

rav1e

OpenBenchmarking.orgFrames Per Second Per Watt, More Is Betterrav1e 0.4Speed: 10Linux 5.10Linux 5.11 GitLinux 5.11 Patched0.00450.0090.01350.0180.02250.020.020.02

OpenBenchmarking.orgFrames Per Second Per Watt, More Is Betterrav1e 0.4Speed: 6Linux 5.10Linux 5.11 GitLinux 5.11 Patched0.00230.00460.00690.00920.01150.010.010.01

OpenBenchmarking.orgFrames Per Second Per Watt, More Is Betterrav1e 0.4Speed: 5Linux 5.10Linux 5.11 GitLinux 5.11 Patched0.00230.00460.00690.00920.01150.010.010.01

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pLinux 5.11 GitLinux 5.11 PatchedLinux 5.101530456075SE +/- 1.18, N = 15SE +/- 1.05, N = 15SE +/- 0.13, N = 568.2368.2368.891. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pLinux 5.11 PatchedLinux 5.10Linux 5.11 Git246810SE +/- 0.029, N = 4SE +/- 0.018, N = 4SE +/- 0.049, N = 47.6487.6637.6631. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pLinux 5.11 GitLinux 5.10Linux 5.11 Patched70140210280350SE +/- 16.16, N = 15SE +/- 10.05, N = 15SE +/- 4.21, N = 15311.85315.24323.811. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pLinux 5.10Linux 5.11 PatchedLinux 5.11 Git80160240320400SE +/- 2.11, N = 10SE +/- 1.70, N = 9SE +/- 2.00, N = 10365.96371.48381.081. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

rav1e

OpenBenchmarking.orgFrames Per Second Per Watt, More Is Betterrav1e 0.4Speed: 1Linux 5.10Linux 5.11 GitLinux 5.11 Patched

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pLinux 5.11 PatchedLinux 5.11 GitLinux 5.100.02120.04240.06360.08480.106SE +/- 0.001, N = 12SE +/- 0.001, N = 3SE +/- 0.001, N = 30.0910.0920.0941. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pLinux 5.11 GitLinux 5.10Linux 5.11 Patched1122334455SE +/- 0.42, N = 7SE +/- 0.26, N = 4SE +/- 0.52, N = 447.6648.4849.451. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pLinux 5.10Linux 5.11 PatchedLinux 5.11 Git80160240320400SE +/- 1.89, N = 10SE +/- 0.91, N = 10SE +/- 1.11, N = 10357.42364.81369.011. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KLinux 5.11 GitLinux 5.10Linux 5.11 Patched510152025SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.14, N = 318.6319.2619.741. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10Linux 5.10Linux 5.11 GitLinux 5.11 Patched0.68721.37442.06162.74883.436SE +/- 0.024, N = 3SE +/- 0.016, N = 3SE +/- 0.008, N = 32.8372.9023.054

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6Linux 5.10Linux 5.11 GitLinux 5.11 Patched0.31680.63360.95041.26721.584SE +/- 0.005, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 31.3461.3701.408

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5Linux 5.10Linux 5.11 GitLinux 5.11 Patched0.24030.48060.72090.96121.2015SE +/- 0.006, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 31.0181.0451.068

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1Linux 5.10Linux 5.11 GitLinux 5.11 Patched0.08370.16740.25110.33480.4185SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 30.3510.3680.372

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandomLinux 5.11 PatchedLinux 5.10Linux 5.11 Git0.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.360.370.371. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: KostyaLinux 5.10Linux 5.11 GitLinux 5.11 Patched0.12830.25660.38490.51320.6415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.570.570.571. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s Per Watt, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweetsLinux 5.10Linux 5.11 GitLinux 5.11 Patched

OpenBenchmarking.orgGB/s Per Watt, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserIDLinux 5.11 GitLinux 5.10Linux 5.11 Patched0.00230.00460.00690.00920.01150.010.01

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweetsLinux 5.11 GitLinux 5.10Linux 5.11 Patched0.14180.28360.42540.56720.709SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.630.631. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserIDLinux 5.10Linux 5.11 GitLinux 5.11 Patched0.14630.29260.43890.58520.7315SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.650.650.651. (CXX) g++ options: -O3 -pthread

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Linux 5.11 GitLinux 5.11 PatchedLinux 5.10714212835SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.14, N = 330.2430.8331.131. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialLinux 5.11 GitLinux 5.11 PatchedLinux 5.10714212835SE +/- 0.09, N = 6SE +/- 0.04, N = 6SE +/- 0.06, N = 628.0928.3928.51

ONNX Runtime

OpenBenchmarking.orgInferences Per Minute Per Watt, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPULinux 5.11 GitLinux 5.10Linux 5.11 Patched0.15530.31060.46590.62120.77650.670.690.69

OpenBenchmarking.orgInferences Per Minute Per Watt, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPULinux 5.11 PatchedLinux 5.10Linux 5.11 Git4812162015.2615.4215.72

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPULinux 5.11 GitLinux 5.11 PatchedLinux 5.104080120160200SE +/- 1.60, N = 12SE +/- 1.86, N = 3SE +/- 1.89, N = 121751811821. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPULinux 5.10Linux 5.11 PatchedLinux 5.11 Git9001800270036004500SE +/- 84.65, N = 12SE +/- 44.10, N = 3SE +/- 78.33, N = 94172421043931. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Cpuminer-Opt

Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Quad SHA-256, PyriteLinux 5.11 PatchedLinux 5.11 GitLinux 5.1060K120K180K240K300KSE +/- 5314.19, N = 12SE +/- 2892.28, N = 6SE +/- 1776.67, N = 32969952975023013731. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: x25xLinux 5.10Linux 5.11 GitLinux 5.11 Patched30060090012001500SE +/- 11.78, N = 15SE +/- 21.93, N = 14SE +/- 17.73, N = 151503.101524.211541.761. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: GarlicoinLinux 5.10Linux 5.11 GitLinux 5.11 Patched2K4K6K8K10KSE +/- 62.19, N = 13SE +/- 88.01, N = 14SE +/- 99.38, N = 159912.859937.319949.881. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: SkeincoinLinux 5.10Linux 5.11 GitLinux 5.11 Patched80K160K240K320K400KSE +/- 3597.25, N = 15SE +/- 3597.13, N = 15SE +/- 5604.68, N = 123633463637843640171. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: LBC, LBRY CreditsLinux 5.11 GitLinux 5.10Linux 5.11 Patched30K60K90K120K150KSE +/- 1036.73, N = 3SE +/- 1088.59, N = 15SE +/- 1380.06, N = 31324771356021390371. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Chaos Group V-RAY

OpenBenchmarking.orgKsamples Per Watt, More Is BetterChaos Group V-RAY 4.10.07Mode: CPULinux 5.11 PatchedLinux 5.10Linux 5.11 Git306090120150127.59129.30130.07

OpenBenchmarking.orgKsamples, More Is BetterChaos Group V-RAY 4.10.07Mode: CPULinux 5.11 PatchedLinux 5.10Linux 5.11 Git12K24K36K48K60KSE +/- 1018.74, N = 13SE +/- 624.59, N = 3SE +/- 504.01, N = 3534605413754803

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2Linux 5.10Linux 5.11 GitLinux 5.11 Patched8M16M24M32M40MSE +/- 402627.98, N = 3SE +/- 411547.50, N = 3SE +/- 341040.65, N = 337386556.638181643.638319339.8

LuxCoreRender

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on the CPU as opposed to the OpenCL version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCLinux 5.10Linux 5.11 PatchedLinux 5.11 Git246810SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 37.757.807.81MIN: 7.57 / MAX: 8.18MIN: 7.61 / MAX: 8.59MIN: 7.68 / MAX: 8.29

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismLinux 5.11 GitLinux 5.11 PatchedLinux 5.10246810SE +/- 0.09, N = 5SE +/- 0.09, N = 3SE +/- 0.04, N = 38.728.768.83MIN: 8.07 / MAX: 9.01MIN: 8.31 / MAX: 8.97MIN: 8.34 / MAX: 8.96

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test DirectoryLinux 5.11 PatchedLinux 5.11 GitLinux 5.10110220330440550SE +/- 2.06, N = 3SE +/- 1.77, N = 3SE +/- 1.76, N = 3475.25505.19517.13MIN: 400.96 / MAX: 971.55MIN: 457.62 / MAX: 951.11MIN: 453.79 / MAX: 894.751. (CC) gcc options: -O2 -lm -pthread -lmpi

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test DirectoryLinux 5.11 PatchedLinux 5.10Linux 5.11 Git110220330440550SE +/- 2.63, N = 3SE +/- 5.81, N = 3SE +/- 2.21, N = 3520.72529.43531.09MIN: 176.53 / MAX: 1089.46MIN: 260.25 / MAX: 972.23MIN: 489.6 / MAX: 1034.891. (CC) gcc options: -O2 -lm -pthread -lmpi

Zstd Compression

OpenBenchmarking.orgMB/s Per Watt, More Is BetterZstd Compression 1.4.5Compression Level: 3Linux 5.11 GitLinux 5.10Linux 5.11 Patched122436486053.4553.6654.88

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3Linux 5.10Linux 5.11 GitLinux 5.11 Patched2K4K6K8K10KSE +/- 59.38, N = 3SE +/- 30.36, N = 3SE +/- 69.07, N = 38195.38205.28270.51. (CC) gcc options: -O3 -pthread -lz -llzma

LZ4 Compression

OpenBenchmarking.orgMB/s Per Watt, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedLinux 5.11 GitLinux 5.11 PatchedLinux 5.102040608010081.0981.3481.53

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedLinux 5.11 GitLinux 5.10Linux 5.11 Patched2K4K6K8K10KSE +/- 96.44, N = 3SE +/- 75.81, N = 3SE +/- 19.43, N = 39622.999668.509757.171. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedLinux 5.10Linux 5.11 PatchedLinux 5.11 Git2K4K6K8K10KSE +/- 46.95, N = 3SE +/- 25.21, N = 3SE +/- 46.17, N = 311264.411305.011334.01. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedLinux 5.10Linux 5.11 PatchedLinux 5.11 Git1122334455SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.56, N = 448.8948.9549.431. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedLinux 5.11 GitLinux 5.10Linux 5.11 Patched2K4K6K8K10KSE +/- 38.32, N = 4SE +/- 24.78, N = 3SE +/- 60.61, N = 310404.510527.210666.01. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedLinux 5.11 GitLinux 5.11 PatchedLinux 5.101122334455SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.41, N = 1547.6347.7647.791. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedLinux 5.11 GitLinux 5.11 PatchedLinux 5.102K4K6K8K10KSE +/- 172.84, N = 3SE +/- 31.24, N = 3SE +/- 33.18, N = 1510488.010489.810505.71. (CC) gcc options: -O3

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21Linux 5.11 GitLinux 5.10Linux 5.11 Patched5001000150020002500SE +/- 13.13, N = 3SE +/- 14.69, N = 3SE +/- 8.27, N = 32149.82156.42157.21. (CXX) g++ options: -O3 -march=native -rdynamic

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT RoutineLinux 5.11 GitLinux 5.11 PatchedLinux 5.1040K80K120K160K200KSE +/- 1640.30, N = 15SE +/- 1760.31, N = 15SE +/- 1647.45, N = 15174206.13178738.12182254.441. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Linux 5.11 PatchedLinux 5.10Linux 5.11 Git4K8K12K16K20KSE +/- 213.45, N = 3SE +/- 280.66, N = 6SE +/- 24.98, N = 31701517440184681. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverLinux 5.10Linux 5.11 GitLinux 5.11 Patched9001800270036004500SE +/- 52.07, N = 4SE +/- 35.16, N = 8SE +/- 25.26, N = 34191.654233.514286.631. (CC) gcc options: -O3 -mavx2

ASKAP

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MPI - GriddingLinux 5.11 GitLinux 5.11 PatchedLinux 5.1016003200480064008000SE +/- 2.49, N = 3SE +/- 2.90, N = 3SE +/- 3.88, N = 37426.087453.517498.711. (CXX) g++ options: -lpthread

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 2018-11-10Test: tConvolve MPI - DegriddingLinux 5.11 GitLinux 5.10Linux 5.11 Patched3K6K9K12K15KSE +/- 7.33, N = 3SE +/- 3.70, N = 3SE +/- 6.47, N = 311870.311918.211944.21. (CXX) g++ options: -lpthread

Etcpak

OpenBenchmarking.orgMpx/s Per Watt, More Is BetterEtcpak 0.7Configuration: ETC1Linux 5.11 GitLinux 5.11 PatchedLinux 5.100.47480.94961.42441.89922.3742.072.102.11

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + DitheringLinux 5.11 GitLinux 5.10Linux 5.11 Patched50100150200250SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 3244.79245.50245.601. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1Linux 5.11 GitLinux 5.10Linux 5.11 Patched60120180240300SE +/- 0.24, N = 3SE +/- 0.24, N = 3SE +/- 0.23, N = 3266.32267.32267.591. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2Linux 5.11 GitLinux 5.10Linux 5.11 Patched306090120150SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3155.18155.79155.801. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

LeelaChessZero

OpenBenchmarking.orgNodes Per Second Per Watt, More Is BetterLeelaChessZero 0.26Backend: BLASLinux 5.10Linux 5.11 GitLinux 5.11 Patched36912159.649.699.72

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: BLASLinux 5.11 PatchedLinux 5.11 GitLinux 5.109001800270036004500SE +/- 49.90, N = 9SE +/- 50.84, N = 3SE +/- 11.55, N = 34061410641121. (CXX) g++ options: -flto -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.26Backend: EigenLinux 5.11 GitLinux 5.10Linux 5.11 Patched10002000300040005000SE +/- 49.20, N = 4SE +/- 32.10, N = 3SE +/- 36.23, N = 34284437944331. (CXX) g++ options: -flto -pthread

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total TimeLinux 5.10Linux 5.11 PatchedLinux 5.11 Git20M40M60M80M100MSE +/- 1090218.44, N = 3SE +/- 769788.53, N = 3SE +/- 1123146.37, N = 39535326397042601979450891. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceLinux 5.11 PatchedLinux 5.10Linux 5.11 Git200K400K600K800K1000KSE +/- 609.97, N = 12SE +/- 516.70, N = 12SE +/- 1015.01, N = 121114562111512511151331. (CC) gcc options: -O3 -march=native

asmFish

OpenBenchmarking.orgNodes/second Per Watt, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthLinux 5.10Linux 5.11 GitLinux 5.11 Patched60K120K180K240K300K246434.29251756.74259519.18

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthLinux 5.10Linux 5.11 GitLinux 5.11 Patched30M60M90M120M150MSE +/- 1158651.23, N = 3SE +/- 865236.83, N = 3SE +/- 358515.89, N = 3116201937117310985117632955

GROMACS

OpenBenchmarking.orgNs Per Day Per Watt, More Is BetterGROMACS 2020.3Water BenchmarkLinux 5.10Linux 5.11 GitLinux 5.11 Patched0.00230.00460.00690.00920.01150.010.010.01

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkLinux 5.11 GitLinux 5.10Linux 5.11 Patched1.18372.36743.55114.73485.9185SE +/- 0.039, N = 3SE +/- 0.013, N = 3SE +/- 0.022, N = 35.2395.2575.2611. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

LAMMPS Molecular Dynamics Simulator

OpenBenchmarking.orgns/day Per Watt, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinLinux 5.11 GitLinux 5.10Linux 5.11 Patched0.02930.05860.08790.11720.14650.120.130.13

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinLinux 5.11 GitLinux 5.10Linux 5.11 Patched612182430SE +/- 0.23, N = 15SE +/- 0.23, N = 15SE +/- 0.17, N = 1221.1323.5723.791. (CXX) g++ options: -O3 -pthread -lm

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsLinux 5.11 GitLinux 5.10Linux 5.11 Patched612182430SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 324.9925.0225.081. (CXX) g++ options: -O3 -pthread -lm

Swet

Swet is a synthetic CPU/RAM benchmark, includes multi-processor test cases. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16AverageLinux 5.10Linux 5.11 GitLinux 5.11 Patched150M300M450M600M750MSE +/- 4611428.30, N = 3SE +/- 5135867.19, N = 3SE +/- 1732499.41, N = 36835344086858888636874802621. (CC) gcc options: -lm -lpthread -lcurses -lrt

KeyDB

OpenBenchmarking.orgOps/sec Per Watt, More Is BetterKeyDB 6.0.16Linux 5.10Linux 5.11 PatchedLinux 5.11 Git4008001200160020001736.191865.161929.56

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16Linux 5.10Linux 5.11 PatchedLinux 5.11 Git60K120K180K240K300KSE +/- 3843.85, N = 3SE +/- 3012.50, N = 15SE +/- 4239.68, N = 15280163.95294214.37302893.561. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Hierarchical INTegration

OpenBenchmarking.orgQUIPs Per Watt, More Is BetterHierarchical INTegration 1.0Test: FLOATLinux 5.11 PatchedLinux 5.10Linux 5.11 Git500K1000K1500K2000K2500K2496420.852514775.522518730.77

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATLinux 5.11 GitLinux 5.10Linux 5.11 Patched70M140M210M280M350MSE +/- 121683.80, N = 3SE +/- 102987.33, N = 3SE +/- 122621.60, N = 3322702844.18322922449.17323144417.171. (CC) gcc options: -O3 -march=native -lm

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5Linux 5.11 GitLinux 5.11 PatchedLinux 5.101000K2000K3000K4000K5000KSE +/- 49184.46, N = 3SE +/- 54344.04, N = 13SE +/- 8171.77, N = 34550333461230847806671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishLinux 5.11 GitLinux 5.10Linux 5.11 Patched16K32K48K64K80KSE +/- 327.40, N = 3SE +/- 25.05, N = 3SE +/- 73.45, N = 37082472626726361. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

Redis

OpenBenchmarking.orgRequests Per Second Per Watt, More Is BetterRedis 6.0.9Test: SETLinux 5.11 GitLinux 5.11 PatchedLinux 5.102K4K6K8K10K11015.9211500.6111558.75

OpenBenchmarking.orgRequests Per Second Per Watt, More Is BetterRedis 6.0.9Test: GETLinux 5.11 GitLinux 5.11 PatchedLinux 5.103K6K9K12K15K13515.1713846.0413922.68

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETLinux 5.11 GitLinux 5.11 PatchedLinux 5.10300K600K900K1200K1500KSE +/- 10410.66, N = 15SE +/- 13176.39, N = 15SE +/- 13292.58, N = 71380890.221427348.101429370.711. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETLinux 5.11 GitLinux 5.11 PatchedLinux 5.10400K800K1200K1600K2000KSE +/- 6519.22, N = 4SE +/- 12716.89, N = 11SE +/- 17472.07, N = 151689203.101711621.521716829.231. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second Per Watt, More Is BetterRedis 6.0.9Test: LPUSHLinux 5.11 GitLinux 5.10Linux 5.11 Patched2K4K6K8K10K9715.039731.239780.17

OpenBenchmarking.orgRequests Per Second Per Watt, More Is BetterRedis 6.0.9Test: SADDLinux 5.11 GitLinux 5.10Linux 5.11 Patched3K6K9K12K15K12420.9112654.6913036.25

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHLinux 5.10Linux 5.11 PatchedLinux 5.11 Git300K600K900K1200K1500KSE +/- 19431.53, N = 14SE +/- 13782.67, N = 3SE +/- 11246.95, N = 31206370.591217218.751220973.711. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDLinux 5.11 GitLinux 5.10Linux 5.11 Patched300K600K900K1200K1500KSE +/- 16361.41, N = 3SE +/- 16159.15, N = 3SE +/- 15585.71, N = 41539146.211563597.661611164.341. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Numpy Benchmark

OpenBenchmarking.orgScore Per Watt, More Is BetterNumpy BenchmarkLinux 5.11 GitLinux 5.11 PatchedLinux 5.100.55131.10261.65392.20522.75652.422.432.45

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreLinux 5.10Linux 5.11 GitLinux 5.11 Patched400800120016002000159316971720

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreLinux 5.10Linux 5.11 GitLinux 5.11 Patched2004006008001000102810591067

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreLinux 5.10Linux 5.11 GitLinux 5.11 Patched6001200180024003000262127562787

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkLinux 5.10Linux 5.11 PatchedLinux 5.11 Git70140210280350SE +/- 0.06, N = 3SE +/- 0.23, N = 3SE +/- 1.74, N = 3322.74323.00323.93

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupLinux 5.10Linux 5.11 GitLinux 5.11 Patched1122334455SE +/- 0.78, N = 15SE +/- 0.60, N = 3SE +/- 0.47, N = 341.643.947.81. (CC) gcc options: -fopenmp -O3 -lm

NAS Parallel Benchmarks

OpenBenchmarking.orgTotal Mop/s Per Watt, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CLinux 5.11 GitLinux 5.10Linux 5.11 Patched4812162017.4317.5217.62

OpenBenchmarking.orgTotal Mop/s Per Watt, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DLinux 5.11 PatchedLinux 5.11 GitLinux 5.10369121510.2410.2510.42

OpenBenchmarking.orgTotal Mop/s Per Watt, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CLinux 5.11 GitLinux 5.10Linux 5.11 Patched100200300400500440.23449.55469.62

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CLinux 5.11 GitLinux 5.10Linux 5.11 Patched8001600240032004000SE +/- 10.26, N = 10SE +/- 4.54, N = 10SE +/- 5.01, N = 103788.883825.273841.481. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DLinux 5.11 GitLinux 5.11 PatchedLinux 5.108001600240032004000SE +/- 8.23, N = 3SE +/- 2.97, N = 3SE +/- 4.54, N = 33854.603863.453871.581. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CLinux 5.11 GitLinux 5.10Linux 5.11 Patched30K60K90K120K150KSE +/- 1780.52, N = 15SE +/- 547.25, N = 4SE +/- 509.59, N = 4147443.86152840.60154376.761. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

InfluxDB

This is a benchmark of the InfluxDB open-source time-series database optimized for fast, high-availability storage for IoT and other use-cases. The InfluxDB test profile makes use of InfluxDB Inch for facilitating the benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Linux 5.10Linux 5.11 GitLinux 5.11 Patched300K600K900K1200K1500KSE +/- 993.12, N = 3SE +/- 6204.63, N = 3SE +/- 2545.78, N = 31223604.81231991.21256112.1

OpenBenchmarking.orgval/sec Per Watt, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Linux 5.11 PatchedLinux 5.10Linux 5.11 Git110022003300440055004857.314903.045060.30

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000Linux 5.11 GitLinux 5.11 PatchedLinux 5.10200K400K600K800K1000KSE +/- 2183.04, N = 3SE +/- 1525.09, N = 3SE +/- 1079.29, N = 3807463.1812193.6818275.3

BRL-CAD

OpenBenchmarking.orgVGR Performance Metric Per Watt, More Is BetterBRL-CAD 7.30.8VGR Performance MetricLinux 5.10Linux 5.11 GitLinux 5.11 Patched300600900120015001313.981355.951369.51

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance MetricLinux 5.10Linux 5.11 PatchedLinux 5.11 Git140K280K420K560K700K6159306365216389711. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Linux 5.11 GitLinux 5.11 PatchedLinux 5.10150300450600750SE +/- 1.12, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3712.25714.91715.501. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3Linux 5.10Linux 5.11 GitLinux 5.11 Patched4K8K12K16K20KSE +/- 149.06, N = 5SE +/- 67.78, N = 5SE +/- 171.84, N = 518424.9819576.1219771.221. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsLinux 5.11 GitLinux 5.11 PatchedLinux 5.100.10230.20460.30690.40920.5115SE +/- 0.00311, N = 3SE +/- 0.00005, N = 3SE +/- 0.00029, N = 30.454510.444720.44418

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatLinux 5.11 GitLinux 5.10Linux 5.11 Patched10K20K30K40K50KSE +/- 1144.94, N = 15SE +/- 344.62, N = 15SE +/- 395.37, N = 346659.243015.339523.5

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantLinux 5.11 GitLinux 5.10Linux 5.11 Patched10K20K30K40K50KSE +/- 759.19, N = 15SE +/- 465.03, N = 4SE +/- 400.94, N = 645083.943716.341034.0

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileLinux 5.11 GitLinux 5.10Linux 5.11 Patched40K80K120K160K200KSE +/- 7366.43, N = 15SE +/- 3714.56, N = 15SE +/- 2393.85, N = 15189771157245134044

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetLinux 5.11 GitLinux 5.11 PatchedLinux 5.1014K28K42K56K70KSE +/- 690.93, N = 3SE +/- 412.91, N = 15SE +/- 72.63, N = 365193.062195.461294.2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2Linux 5.11 GitLinux 5.10Linux 5.11 Patched160K320K480K640K800KSE +/- 4257.59, N = 3SE +/- 1447.51, N = 3SE +/- 5824.36, N = 9765726755450736285

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4Linux 5.11 GitLinux 5.10Linux 5.11 Patched200K400K600K800K1000KSE +/- 2435.29, N = 3SE +/- 4174.26, N = 3SE +/- 1163.43, N = 3894640835208810750

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPLinux 5.11 GitLinux 5.10Linux 5.11 Patched12K24K36K48K60KSE +/- 721.39, N = 3SE +/- 245.29, N = 3SE +/- 598.50, N = 357601.3657310.3556769.451. (CXX) g++ options: -O3 -march=native -fopenmp

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPLinux 5.10Linux 5.11 GitLinux 5.11 Patched9K18K27K36K45KSE +/- 215.27, N = 3SE +/- 319.03, N = 3SE +/- 393.10, N = 341287.1040124.3739406.761. (CXX) g++ options: -O3 -march=native -fopenmp

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2Linux 5.11 GitLinux 5.10Linux 5.11 Patched70140210280350SE +/- 3.80, N = 3SE +/- 0.66, N = 3SE +/- 2.83, N = 3303.45291.43289.76MIN: 284.51 / MAX: 461.21MIN: 283.33 / MAX: 459.62MIN: 283.65 / MAX: 458.791. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1Linux 5.11 GitLinux 5.11 PatchedLinux 5.1060120180240300SE +/- 0.39, N = 3SE +/- 0.68, N = 3SE +/- 0.07, N = 3274.94274.87274.28MIN: 273.57 / MAX: 276.21MIN: 273.07 / MAX: 276.74MIN: 273.58 / MAX: 275.281. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.10Linux 5.11 Patched0.20570.41140.61710.82281.0285SE +/- 0.006064, N = 7SE +/- 0.003739, N = 7SE +/- 0.001510, N = 70.9141980.8724410.863782MIN: 0.78MIN: 0.79MIN: 0.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.11 PatchedLinux 5.100.54121.08241.62362.16482.706SE +/- 0.03372, N = 3SE +/- 0.01587, N = 3SE +/- 0.02538, N = 32.405492.332902.32899MIN: 1.92MIN: 2MIN: 1.931. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.10Linux 5.11 Patched0.36470.72941.09411.45881.8235SE +/- 0.01518, N = 4SE +/- 0.00943, N = 4SE +/- 0.01340, N = 41.620881.569701.55447MIN: 1.31MIN: 1.31MIN: 1.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPULinux 5.10Linux 5.11 GitLinux 5.11 Patched0.19990.39980.59970.79960.9995SE +/- 0.002999, N = 5SE +/- 0.005127, N = 5SE +/- 0.004000, N = 50.8884280.8813480.849248MIN: 0.77MIN: 0.71MIN: 0.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.11 PatchedLinux 5.100.12320.24640.36960.49280.616SE +/- 0.005010, N = 4SE +/- 0.004601, N = 4SE +/- 0.004888, N = 40.5476740.5219680.519015MIN: 0.43MIN: 0.43MIN: 0.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPULinux 5.11 GitLinux 5.10Linux 5.11 Patched30060090012001500SE +/- 26.23, N = 15SE +/- 14.39, N = 15SE +/- 3.00, N = 31317.401158.351123.32MIN: 1147.91MIN: 1073.97MIN: 1077.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2Linux 5.10Linux 5.11 GitLinux 5.11 Patched12002400360048006000SE +/- 132.95, N = 20SE +/- 36.45, N = 20SE +/- 73.65, N = 20565453105217

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: JythonLinux 5.10Linux 5.11 GitLinux 5.11 Patched11002200330044005500SE +/- 47.90, N = 6SE +/- 28.66, N = 18SE +/- 43.93, N = 6495148974778

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradebeansLinux 5.10Linux 5.11 GitLinux 5.11 Patched13002600390052006500SE +/- 65.72, N = 4SE +/- 50.83, N = 20SE +/- 66.39, N = 20603259545591

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: TradesoapLinux 5.11 GitLinux 5.10Linux 5.11 Patched11002200330044005500SE +/- 44.82, N = 4SE +/- 38.07, N = 20SE +/- 61.21, N = 4517051505148

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileLinux 5.10Linux 5.11 GitLinux 5.11 Patched1428425670SE +/- 0.28, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 361.4460.8659.18

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileLinux 5.10Linux 5.11 GitLinux 5.11 Patched20406080100SE +/- 0.64, N = 3SE +/- 0.40, N = 3SE +/- 0.43, N = 3102.5097.6492.92

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileLinux 5.10Linux 5.11 GitLinux 5.11 Patched612182430SE +/- 0.16, N = 13SE +/- 0.17, N = 12SE +/- 0.20, N = 926.1125.9225.75

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileLinux 5.11 GitLinux 5.10Linux 5.11 Patched50100150200250SE +/- 1.36, N = 3SE +/- 0.93, N = 3SE +/- 0.79, N = 3210.35209.57208.79

Cython Benchmark

Cython provides a superset of Python that is geared to deliver C-like levels of performance. This test profile makes use of Cython's bundled benchmark tests and runs an N-Queens sample test as a simple benchmark to the system's Cython performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-QueensLinux 5.11 GitLinux 5.10Linux 5.11 Patched612182430SE +/- 0.16, N = 3SE +/- 0.28, N = 3SE +/- 0.22, N = 326.9526.8926.60

Gcrypt Library

Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9Linux 5.11 GitLinux 5.10Linux 5.11 Patched50100150200250SE +/- 0.85, N = 3SE +/- 0.16, N = 3SE +/- 0.81, N = 3233.83233.10232.541. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error

GnuPG

This test times how long it takes to encrypt a sample file using GnuPG. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File EncryptionLinux 5.11 GitLinux 5.11 PatchedLinux 5.1020406080100SE +/- 0.20, N = 3SE +/- 0.20, N = 3SE +/- 0.47, N = 377.3077.1877.111. (CC) gcc options: -O2

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MLinux 5.11 GitLinux 5.10Linux 5.11 Patched510152025SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 318.7118.3218.301. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MLinux 5.10Linux 5.11 GitLinux 5.11 Patched306090120150SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3130.09129.67128.281. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Quantum ESPRESSO

Quantum ESPRESSO is an integrated suite of Open-Source computer codes for electronic-structure calculations and materials modeling at the nanoscale. It is based on density-functional theory, plane waves, and pseudopotentials. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112Linux 5.11 GitLinux 5.10Linux 5.11 Patched30060090012001500SE +/- 11.28, N = 3SE +/- 17.89, N = 9SE +/- 12.21, N = 41217.491197.601171.031. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

RELION

RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPULinux 5.10Linux 5.11 GitLinux 5.11 Patched80160240320400SE +/- 3.11, N = 3SE +/- 2.94, N = 9SE +/- 2.97, N = 9351.12349.85348.291. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ThoroughLinux 5.11 GitLinux 5.10Linux 5.11 Patched1.28032.56063.84095.12126.4015SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 55.695.695.651. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ExhaustiveLinux 5.11 GitLinux 5.10Linux 5.11 Patched918273645SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 341.1741.1240.971. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-OnlyLinux 5.11 GitLinux 5.10Linux 5.11 Patched4080120160200SE +/- 0.14, N = 3SE +/- 0.93, N = 3SE +/- 0.13, N = 3158.64158.16156.83

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To CompileLinux 5.11 GitLinux 5.10Linux 5.11 Patched1530456075SE +/- 0.67, N = 3SE +/- 0.41, N = 3SE +/- 0.52, N = 368.1467.9467.32

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsLinux 5.11 GitLinux 5.10Linux 5.11 Patched510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 318.7218.7118.65

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeLinux 5.11 GitLinux 5.10Linux 5.11 Patched3691215SE +/- 0.05, N = 4SE +/- 0.07, N = 4SE +/- 0.05, N = 411.4911.4111.311. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

YafaRay

YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneLinux 5.11 PatchedLinux 5.10Linux 5.11 Git20406080100SE +/- 0.85, N = 15SE +/- 1.01, N = 15SE +/- 0.38, N = 387.1485.7984.741. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 20.1Input: Carbon NanotubeLinux 5.11 GitLinux 5.11 PatchedLinux 5.101428425670SE +/- 0.18, N = 3SE +/- 0.39, N = 3SE +/- 0.33, N = 360.7059.8559.521. (CC) gcc options: -pthread -shared -fwrapv -O2 -lxc -lblas -lmpi

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisLinux 5.11 GitLinux 5.11 PatchedLinux 5.1020406080100SE +/- 0.08, N = 3SE +/- 0.29, N = 3SE +/- 0.13, N = 382.6582.0481.851. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

Nebular Empirical Analysis Tool

NEAT is the Nebular Empirical Analysis Tool for empirical analysis of ionised nebulae, with uncertainty propagation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2020-02-29Linux 5.10Linux 5.11 GitLinux 5.11 Patched714212835SE +/- 0.32, N = 3SE +/- 0.26, N = 15SE +/- 0.56, N = 1228.2727.0724.631. (F9X) gfortran options: -cpp -ffree-line-length-0 -Jsource/ -fopenmp -O3 -fno-backtrace

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number GenerationLinux 5.10Linux 5.11 GitLinux 5.11 Patched1.03772.07543.11314.15085.1885SE +/- 0.012, N = 8SE +/- 0.012, N = 8SE +/- 0.015, N = 84.6124.5404.5351. (CXX) g++ options: -O3 -lpthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverLinux 5.11 GitLinux 5.11 PatchedLinux 5.103691215SE +/- 0.148, N = 15SE +/- 0.141, N = 15SE +/- 0.055, N = 69.2558.8828.1091. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDLinux 5.11 GitLinux 5.10Linux 5.11 Patched1224364860SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.13, N = 352.7952.3952.091. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteLinux 5.10Linux 5.11 GitLinux 5.11 Patched1224364860SE +/- 0.36, N = 15SE +/- 0.35, N = 3SE +/- 0.69, N = 354.9753.8652.681. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterLinux 5.11 GitLinux 5.10Linux 5.11 Patched3691215SE +/- 0.22, N = 15SE +/- 0.08, N = 5SE +/- 0.03, N = 511.2110.6010.341. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DLinux 5.11 GitLinux 5.10Linux 5.11 Patched20406080100SE +/- 0.20, N = 3SE +/- 0.62, N = 3SE +/- 0.59, N = 397.9496.7696.601. (CXX) g++ options: -O2 -lOpenCL

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000Linux 5.11 PatchedLinux 5.10Linux 5.11 Git1632486480SE +/- 0.13, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 370.5470.3170.161. (CC) gcc options: -O2 -ldl -lz -lpthread

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeLinux 5.10Linux 5.11 PatchedLinux 5.11 Git48121620SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 318.1218.0617.941. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

Tungsten Renderer

Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairLinux 5.10Linux 5.11 PatchedLinux 5.11 Git246810SE +/- 0.03288, N = 6SE +/- 0.01480, N = 6SE +/- 0.05969, N = 66.696346.690366.600541. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticLinux 5.11 PatchedLinux 5.11 GitLinux 5.10510152025SE +/- 0.21, N = 15SE +/- 0.25, N = 3SE +/- 0.14, N = 1521.3321.3321.191. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticLinux 5.10Linux 5.11 GitLinux 5.11 Patched1.19522.39043.58564.78085.976SE +/- 0.03115, N = 7SE +/- 0.01185, N = 7SE +/- 0.03676, N = 75.312025.301055.262351. (CXX) g++ options: -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2OLinux 5.11 GitLinux 5.10Linux 5.11 Patched714212835SE +/- 0.53, N = 15SE +/- 0.04, N = 3SE +/- 0.08, N = 331.1829.4829.281. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

178 Results Shown

CPU Power Consumption Monitor
Algebraic Multi-Grid Benchmark
BlogBench
dav1d:
  Summer Nature 4K
  Chimera 1080p 10-bit
OSPray:
  Magnetic Reconnection - SciVis
  Magnetic Reconnection - Path Tracer
  XFrog Forest - SciVis
OSPray
OSPray:
  XFrog Forest - Path Tracer
  NASA Streamlines - SciVis
TTSIOD 3D Renderer
OSPray:
  NASA Streamlines - Path Tracer
  San Miguel - SciVis
  San Miguel - Path Tracer
PlaidML:
  No - Inference - ResNet 50 - CPU
  No - Inference - VGG16 - CPU
  No - Inference - VGG19 - CPU
TTSIOD 3D Renderer
rav1e:
  10
  6
  5
SVT-AV1:
  Enc Mode 8 - 1080p
  Enc Mode 4 - 1080p
SVT-VP9:
  Visual Quality Optimized - Bosphorus 1080p
  PSNR/SSIM Optimized - Bosphorus 1080p
rav1e
SVT-AV1
x265
SVT-VP9
x265
rav1e:
  10
  6
  5
  1
simdjson:
  LargeRand
  Kostya
simdjson:
  PartialTweets
  DistinctUserID
simdjson:
  PartialTweets
  DistinctUserID
High Performance Conjugate Gradient
Intel Open Image Denoise
ONNX Runtime:
  yolov4 - OpenMP CPU
  super-resolution-10 - OpenMP CPU
ONNX Runtime:
  yolov4 - OpenMP CPU
  super-resolution-10 - OpenMP CPU
Cpuminer-Opt:
  Quad SHA-256, Pyrite
  x25x
  Garlicoin
  Skeincoin
  LBC, LBRY Credits
Chaos Group V-RAY
Chaos Group V-RAY
BYTE Unix Benchmark
LuxCoreRender:
  DLSC
  Rainbow Colors and Prism
IOR:
  2MB - Default Test Directory
  8MB - Default Test Directory
Zstd Compression
Zstd Compression
LZ4 Compression
LZ4 Compression:
  1 - Compression Speed
  1 - Decompression Speed
  3 - Compression Speed
  3 - Decompression Speed
  9 - Compression Speed
  9 - Decompression Speed
QuantLib
FFTE
FFTW
Himeno Benchmark
ASKAP:
  tConvolve MPI - Gridding
  tConvolve MPI - Degridding
Etcpak
Etcpak:
  ETC1 + Dithering
  ETC1
  ETC2
LeelaChessZero
LeelaChessZero:
  BLAS
  Eigen
Stockfish
TSCP
asmFish
asmFish
GROMACS
GROMACS
LAMMPS Molecular Dynamics Simulator
LAMMPS Molecular Dynamics Simulator:
  Rhodopsin Protein
  20k Atoms
Swet
KeyDB
KeyDB
Hierarchical INTegration
Hierarchical INTegration
John The Ripper:
  MD5
  Blowfish
Redis:
  SET
  GET
Redis:
  SET
  GET
Redis:
  LPUSH
  SADD
Redis:
  LPUSH
  SADD
Numpy Benchmark
AI Benchmark Alpha:
  Device Inference Score
  Device Training Score
  Device AI Score
Numpy Benchmark
CLOMP
NAS Parallel Benchmarks:
  EP.C
  EP.D
  LU.C
NAS Parallel Benchmarks:
  EP.C
  EP.D
  LU.C
InfluxDB
InfluxDB
InfluxDB
BRL-CAD
BRL-CAD
Google SynthMark
LULESH
NAMD
TensorFlow Lite:
  Mobilenet Float
  Mobilenet Quant
  NASNet Mobile
  SqueezeNet
  Inception ResNet V2
  Inception V4
FinanceBench:
  Bonds OpenMP
  Repo OpenMP
TNN:
  CPU - MobileNet v2
  CPU - SqueezeNet v1.1
oneDNN:
  Convolution Batch Shapes Auto - f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
  IP Shapes 1D - f32 - CPU
  IP Shapes 3D - f32 - CPU
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
DaCapo Benchmark:
  H2
  Jython
  Tradebeans
  Tradesoap
Timed Godot Game Engine Compilation
Timed GDB GNU Debugger Compilation
Timed Linux Kernel Compilation
Timed LLVM Compilation
Cython Benchmark
Gcrypt Library
GnuPG
OpenFOAM:
  Motorbike 30M
  Motorbike 60M
Quantum ESPRESSO
RELION
ASTC Encoder:
  Thorough
  Exhaustive
Blender
Build2
Dolfyn
POV-Ray
YafaRay
GPAW
Timed MrBayes Analysis
Nebular Empirical Analysis Tool
Primesieve
Rodinia:
  OpenMP CFD Solver
  OpenMP LavaMD
  OpenMP Leukocyte
  OpenMP Streamcluster
  OpenMP HotSpot3D
SQLite Speedtest
Tachyon
Tungsten Renderer:
  Hair
  Water Caustic
  Volumetric Caustic
QMCPACK