MGLRU Kernel Tests

2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 21.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2201118-NE-MGLRUKERN50&grr&rdt.

MGLRU Kernel TestsProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionMGLRU EnabledMGLRU Disabled2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse128GB280GB INTEL SSDPE21D280GAASPEEDVE2282 x Intel 10G X550TUbuntu 21.105.16.0-rc8-mglru-pts (x86_64)GNOME Shell 40.5X Server1.1.182GCC 11.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034 Java Details- MGLRU Enabled: OpenJDK Runtime Environment (build 11.0.12+7-Ubuntu-0ubuntu3)- MGLRU Disabled: OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.21.10)Python Details- Python 3.9.7Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

MGLRU Kernel Testsopenvkl: vklBenchmark ISPCopenvkl: vklBenchmark Scalarjava-gradle-perf: Reactornwchem: C240 Buckyballplaidml: No - Inference - ResNet 50 - CPUincompact3d: X3D-benchmarking input.i3dplaidml: No - Inference - VGG16 - CPUqe: AUSURF112luxcorerender: Danish Mood - CPUplaidml: No - Inference - VGG19 - CPUonnx: fcn-resnet101-11 - CPUonnx: yolov4 - CPUonnx: super-resolution-10 - CPUmocassin: Dust 2D tau100.0build-llvm: Unix Makefilesbuild-linux-kernel: allmodconfigapache: 1000pgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 250 - Read Onlypgbench: 100 - 500 - Read Only - Average Latencypgbench: 100 - 500 - Read Onlyrodinia: OpenMP Leukocyteonnx: shufflenet-v2-10 - CPUbuild-llvm: Ninjarodinia: OpenMP HotSpot3Dapache: 500nginx: 500nginx: 1000build-linux-kernel: Time To Compileospray: San Miguel - Path Tracerluxcorerender: DLSC - CPUcompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingbuild-godot: Time To Compileamg: rodinia: OpenMP Streamclustersvt-av1: Preset 4 - Bosphorus 4Kstockfish: Total Timeospray: San Miguel - SciVisrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverxmrig: Monero - 1Mnamd: ATPase Simulation - 327,506 Atomsbuild-mesa: Time To Compilexmrig: Wownero - 1Mbuild-linux-kernel: defconfigliquid-dsp: 256 - 256 - 57liquid-dsp: 128 - 256 - 57svt-av1: Preset 8 - Bosphorus 4Knpb: EP.Dincompact3d: input.i3d 193 Cells Per Directionembree: Pathtracer ISPC - Crownembree: Pathtracer - Crownmt-dgemm: Sustained Floating-Point Ratenpb: MG.CMGLRU EnabledMGLRU Disabled175118374.7492161.24.49463.57981428.09330.625.3524.191802125923230196.919157.84494120.000.12919351500.260192228546.1105553110.353104.60176184.7389324.7491024.3119.8286.5210.3659418540997557.65112491416679.7294.47624965205883.3333.0408.96940029.90.2715821.14453546.020.4535511766667510073333352.6558589.0313.374322359.859466.204928.60169874668.52176118380.4412154.74.40463.23082526.28330.965.2523.531772306711230196.729159.28087890.860.12520004450.258193666347.0775447110.285105.13780632.5989985.5191601.6519.8046.6510.2859438439775558.11212446050009.8034.53025012546583.3333.2959.28840078.30.2725721.13053670.320.6185526200000509333333352.2888547.9113.380874360.101166.228728.82657574790.12OpenBenchmarking.org

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ISPCMGLRU EnabledMGLRU Disabled4080120160200SE +/- 0.67, N = 3SE +/- 0.00, N = 3175176MIN: 14 / MAX: 2362MIN: 16 / MAX: 2455

OpenVKL

Benchmark: vklBenchmark Scalar

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ScalarMGLRU EnabledMGLRU Disabled306090120150SE +/- 0.33, N = 3SE +/- 1.00, N = 3118118MIN: 11 / MAX: 2529MIN: 11 / MAX: 2528

Java Gradle Build

Gradle Build: Reactor

OpenBenchmarking.orgSeconds, Fewer Is BetterJava Gradle BuildGradle Build: ReactorMGLRU EnabledMGLRU Disabled80160240320400SE +/- 5.21, N = 9SE +/- 5.30, N = 3374.75380.44

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballMGLRU EnabledMGLRU Disabled50010001500200025002161.22154.71. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUMGLRU EnabledMGLRU Disabled1.01032.02063.03094.04125.0515SE +/- 0.04, N = 3SE +/- 0.02, N = 34.494.40

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dMGLRU EnabledMGLRU Disabled100200300400500SE +/- 0.82, N = 3SE +/- 1.49, N = 3463.58463.231. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

PlaidML

FP16: No - Mode: Inference - Network: VGG16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUMGLRU EnabledMGLRU Disabled714212835SE +/- 0.29, N = 15SE +/- 0.25, N = 1528.0926.28

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 7.0Input: AUSURF112MGLRU EnabledMGLRU Disabled70140210280350SE +/- 0.20, N = 3SE +/- 0.36, N = 3330.62330.961. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPUMGLRU EnabledMGLRU Disabled1.20382.40763.61144.81526.019SE +/- 0.09, N = 15SE +/- 0.08, N = 155.355.25MIN: 1.85 / MAX: 7.13MIN: 1.73 / MAX: 7.07

PlaidML

FP16: No - Mode: Inference - Network: VGG19 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUMGLRU EnabledMGLRU Disabled612182430SE +/- 0.23, N = 3SE +/- 0.23, N = 1524.1923.53

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: fcn-resnet101-11 - Device: CPUMGLRU EnabledMGLRU Disabled4080120160200SE +/- 4.29, N = 12SE +/- 1.80, N = 31801771. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: yolov4 - Device: CPUMGLRU EnabledMGLRU Disabled50100150200250SE +/- 1.95, N = 12SE +/- 0.87, N = 32122301. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: super-resolution-10 - Device: CPUMGLRU EnabledMGLRU Disabled14002800420056007000SE +/- 176.40, N = 12SE +/- 5.04, N = 3592367111. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Monte Carlo Simulations of Ionised Nebulae

Input: Dust 2D tau100.0

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2019-03-24Input: Dust 2D tau100.0MGLRU EnabledMGLRU Disabled50100150200250SE +/- 0.67, N = 3SE +/- 0.00, N = 32302301. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Unix MakefilesMGLRU EnabledMGLRU Disabled4080120160200SE +/- 0.16, N = 3SE +/- 0.47, N = 3196.92196.73

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.16Build: allmodconfigMGLRU EnabledMGLRU Disabled4080120160200SE +/- 0.55, N = 3SE +/- 0.79, N = 3157.84159.28

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000MGLRU EnabledMGLRU Disabled20K40K60K80K100KSE +/- 908.68, N = 6SE +/- 1001.52, N = 494120.0087890.861. (CC) gcc options: -shared -fPIC -O2

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyMGLRU EnabledMGLRU Disabled0.0290.0580.0870.1160.145SE +/- 0.000, N = 3SE +/- 0.001, N = 30.1290.1251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyMGLRU EnabledMGLRU Disabled400K800K1200K1600K2000KSE +/- 5692.58, N = 3SE +/- 19866.33, N = 3193515020004451. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average LatencyMGLRU EnabledMGLRU Disabled0.05850.1170.17550.2340.2925SE +/- 0.002, N = 3SE +/- 0.003, N = 30.2600.2581. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 500 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 500 - Mode: Read OnlyMGLRU EnabledMGLRU Disabled400K800K1200K1600K2000KSE +/- 11345.70, N = 3SE +/- 21901.77, N = 3192228519366631. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteMGLRU EnabledMGLRU Disabled1122334455SE +/- 0.32, N = 13SE +/- 0.20, N = 346.1147.081. (CXX) g++ options: -O2 -lOpenCL

ONNX Runtime

Model: shufflenet-v2-10 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: shufflenet-v2-10 - Device: CPUMGLRU EnabledMGLRU Disabled12002400360048006000SE +/- 54.18, N = 3SE +/- 71.82, N = 3555354471. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: NinjaMGLRU EnabledMGLRU Disabled20406080100SE +/- 0.14, N = 3SE +/- 0.23, N = 3110.35110.29

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DMGLRU EnabledMGLRU Disabled20406080100SE +/- 0.54, N = 3SE +/- 0.89, N = 3104.60105.141. (CXX) g++ options: -O2 -lOpenCL

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500MGLRU EnabledMGLRU Disabled20K40K60K80K100KSE +/- 976.98, N = 3SE +/- 232.53, N = 376184.7380632.591. (CC) gcc options: -shared -fPIC -O2

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500MGLRU EnabledMGLRU Disabled20K40K60K80K100KSE +/- 134.98, N = 3SE +/- 202.70, N = 389324.7489985.511. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000MGLRU EnabledMGLRU Disabled20K40K60K80K100KSE +/- 265.34, N = 3SE +/- 270.06, N = 391024.3191601.651. (CC) gcc options: -lcrypt -lz -O3 -march=native

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.14Time To CompileMGLRU EnabledMGLRU Disabled510152025SE +/- 0.14, N = 13SE +/- 0.13, N = 1419.8319.80

OSPray

Demo: San Miguel - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerMGLRU EnabledMGLRU Disabled246810SE +/- 0.08, N = 3SE +/- 0.08, N = 36.526.65MIN: 5.41 / MAX: 7.09MIN: 5.46 / MAX: 7.14

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUMGLRU EnabledMGLRU Disabled3691215SE +/- 0.07, N = 3SE +/- 0.07, N = 310.3610.28MIN: 9.71 / MAX: 14.1MIN: 9.66 / MAX: 14.1

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression RatingMGLRU EnabledMGLRU Disabled130K260K390K520K650KSE +/- 6316.43, N = 3SE +/- 7344.83, N = 35941855943841. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression RatingMGLRU EnabledMGLRU Disabled90K180K270K360K450KSE +/- 5744.62, N = 3SE +/- 1852.98, N = 34099753977551. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileMGLRU EnabledMGLRU Disabled1326395265SE +/- 0.08, N = 3SE +/- 0.12, N = 357.6558.11

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2MGLRU EnabledMGLRU Disabled300M600M900M1200M1500MSE +/- 738959.93, N = 3SE +/- 2472880.37, N = 3124914166712446050001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterMGLRU EnabledMGLRU Disabled3691215SE +/- 0.134, N = 15SE +/- 0.201, N = 149.7299.8031. (CXX) g++ options: -O2 -lOpenCL

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 4KMGLRU EnabledMGLRU Disabled1.01932.03863.05794.07725.0965SE +/- 0.008, N = 3SE +/- 0.005, N = 34.4764.5301. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeMGLRU EnabledMGLRU Disabled50M100M150M200M250MSE +/- 2261974.30, N = 3SE +/- 2532421.76, N = 62496520582501254651. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

OSPray

Demo: San Miguel - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisMGLRU EnabledMGLRU Disabled20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 383.3383.33MIN: 35.71 / MAX: 100MIN: 47.62 / MAX: 100

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDMGLRU EnabledMGLRU Disabled816243240SE +/- 0.15, N = 3SE +/- 0.17, N = 333.0433.301. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverMGLRU EnabledMGLRU Disabled3691215SE +/- 0.083, N = 6SE +/- 0.124, N = 128.9699.2881. (CXX) g++ options: -O2 -lOpenCL

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1MMGLRU EnabledMGLRU Disabled9K18K27K36K45KSE +/- 178.74, N = 3SE +/- 67.91, N = 340029.940078.31. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsMGLRU EnabledMGLRU Disabled0.06130.12260.18390.24520.3065SE +/- 0.00314, N = 3SE +/- 0.00229, N = 30.271580.27257

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To CompileMGLRU EnabledMGLRU Disabled510152025SE +/- 0.02, N = 3SE +/- 0.05, N = 321.1421.13

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1MMGLRU EnabledMGLRU Disabled11K22K33K44K55KSE +/- 192.97, N = 3SE +/- 176.64, N = 353546.053670.31. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.16Build: defconfigMGLRU EnabledMGLRU Disabled510152025SE +/- 0.07, N = 3SE +/- 0.02, N = 320.4520.62

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57MGLRU EnabledMGLRU Disabled1200M2400M3600M4800M6000MSE +/- 18720428.53, N = 3SE +/- 16977730.51, N = 3551176666755262000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57MGLRU EnabledMGLRU Disabled1100M2200M3300M4400M5500MSE +/- 11970148.05, N = 3SE +/- 7872808.34, N = 3510073333350933333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 4KMGLRU EnabledMGLRU Disabled1224364860SE +/- 0.18, N = 3SE +/- 0.18, N = 352.6652.291. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DMGLRU EnabledMGLRU Disabled2K4K6K8K10KSE +/- 4.21, N = 3SE +/- 47.65, N = 38589.038547.911. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.0

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionMGLRU EnabledMGLRU Disabled3691215SE +/- 0.03, N = 3SE +/- 0.02, N = 313.3713.381. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: CrownMGLRU EnabledMGLRU Disabled1326395265SE +/- 0.42, N = 3SE +/- 0.18, N = 359.8660.10MIN: 55.99 / MAX: 67.12MIN: 56.51 / MAX: 68.5

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: CrownMGLRU EnabledMGLRU Disabled1530456075SE +/- 0.15, N = 3SE +/- 0.24, N = 366.2066.23MIN: 61.78 / MAX: 73.44MIN: 61.28 / MAX: 74.94

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateMGLRU EnabledMGLRU Disabled714212835SE +/- 0.17, N = 3SE +/- 0.17, N = 328.6028.831. (CC) gcc options: -O3 -march=native -fopenmp

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CMGLRU EnabledMGLRU Disabled16K32K48K64K80KSE +/- 370.74, N = 3SE +/- 406.38, N = 374668.5274790.121. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.0


Phoronix Test Suite v10.8.5