MGLRU Kernel Tests

2 x AMD EPYC 7742 64-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 21.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2201118-NE-MGLRUKERN50&grs&sor.

MGLRU Kernel TestsProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionMGLRU EnabledMGLRU Disabled2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse128GB280GB INTEL SSDPE21D280GAASPEEDVE2282 x Intel 10G X550TUbuntu 21.105.16.0-rc8-mglru-pts (x86_64)GNOME Shell 40.5X Server1.1.182GCC 11.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034 Java Details- MGLRU Enabled: OpenJDK Runtime Environment (build 11.0.12+7-Ubuntu-0ubuntu3)- MGLRU Disabled: OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.21.10)Python Details- Python 3.9.7Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

MGLRU Kernel Testsonnx: yolov4 - CPUapache: 1000plaidml: No - Inference - VGG16 - CPUapache: 500rodinia: OpenMP CFD Solverpgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Only - Average Latencycompress-7zip: Compression Ratingplaidml: No - Inference - VGG19 - CPUrodinia: OpenMP Leukocyteplaidml: No - Inference - ResNet 50 - CPUospray: San Miguel - Path Traceronnx: shufflenet-v2-10 - CPUjava-gradle-perf: Reactorsvt-av1: Preset 4 - Bosphorus 4Kbuild-linux-kernel: allmodconfigbuild-linux-kernel: defconfigbuild-godot: Time To Compilemt-dgemm: Sustained Floating-Point Rateluxcorerender: DLSC - CPUpgbench: 100 - 500 - Read Only - Average Latencyrodinia: OpenMP LavaMDpgbench: 100 - 500 - Read Onlynginx: 500svt-av1: Preset 8 - Bosphorus 4Knginx: 1000openvkl: vklBenchmark ISPCrodinia: OpenMP HotSpot3Dnpb: EP.Dembree: Pathtracer ISPC - Crownnamd: ATPase Simulation - 327,506 Atomsamg: nwchem: C240 Buckyballliquid-dsp: 256 - 256 - 57xmrig: Wownero - 1Mstockfish: Total Timenpb: MG.Cliquid-dsp: 128 - 256 - 57build-linux-kernel: Time To Compilexmrig: Monero - 1Mqe: AUSURF112build-llvm: Unix Makefilesincompact3d: X3D-benchmarking input.i3dbuild-mesa: Time To Compilebuild-llvm: Ninjaincompact3d: input.i3d 193 Cells Per Directionembree: Pathtracer - Crowncompress-7zip: Decompression Ratingopenvkl: vklBenchmark Scalarospray: San Miguel - SciVismocassin: Dust 2D tau100.0onnx: super-resolution-10 - CPUonnx: fcn-resnet101-11 - CPUluxcorerender: Danish Mood - CPUrodinia: OpenMP StreamclusterMGLRU EnabledMGLRU Disabled21294120.0028.0976184.738.96919351500.12940997524.1946.1104.496.525553374.7494.476157.84420.45357.65128.60169810.360.26033.040192228589324.7452.65591024.31175104.6018589.0359.85940.2715812491416672161.2551176666753546.024965205874668.52510073333319.82840029.9330.62196.919463.57981421.144110.35313.374322366.204959418511883.3323059231805.359.72923087890.8626.2880632.599.28820004450.12539775523.5347.0774.406.655447380.4414.530159.28020.61858.11228.82657510.280.25833.295193666389985.5152.28891601.65176105.1378547.9160.10110.2725712446050002154.7552620000053670.325012546574790.12509333333319.80440078.3330.96196.729463.23082521.130110.28513.380874366.228759438411883.3323067111775.259.803OpenBenchmarking.org

ONNX Runtime

Model: yolov4 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: yolov4 - Device: CPUMGLRU DisabledMGLRU Enabled50100150200250SE +/- 0.87, N = 3SE +/- 1.95, N = 122302121. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000MGLRU EnabledMGLRU Disabled20K40K60K80K100KSE +/- 908.68, N = 6SE +/- 1001.52, N = 494120.0087890.861. (CC) gcc options: -shared -fPIC -O2

PlaidML

FP16: No - Mode: Inference - Network: VGG16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUMGLRU EnabledMGLRU Disabled714212835SE +/- 0.29, N = 15SE +/- 0.25, N = 1528.0926.28

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500MGLRU DisabledMGLRU Enabled20K40K60K80K100KSE +/- 232.53, N = 3SE +/- 976.98, N = 380632.5976184.731. (CC) gcc options: -shared -fPIC -O2

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverMGLRU EnabledMGLRU Disabled3691215SE +/- 0.083, N = 6SE +/- 0.124, N = 128.9699.2881. (CXX) g++ options: -O2 -lOpenCL

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyMGLRU DisabledMGLRU Enabled400K800K1200K1600K2000KSE +/- 19866.33, N = 3SE +/- 5692.58, N = 3200044519351501. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyMGLRU DisabledMGLRU Enabled0.0290.0580.0870.1160.145SE +/- 0.001, N = 3SE +/- 0.000, N = 30.1250.1291. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression RatingMGLRU EnabledMGLRU Disabled90K180K270K360K450KSE +/- 5744.62, N = 3SE +/- 1852.98, N = 34099753977551. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

PlaidML

FP16: No - Mode: Inference - Network: VGG19 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG19 - Device: CPUMGLRU EnabledMGLRU Disabled612182430SE +/- 0.23, N = 3SE +/- 0.23, N = 1524.1923.53

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteMGLRU EnabledMGLRU Disabled1122334455SE +/- 0.32, N = 13SE +/- 0.20, N = 346.1147.081. (CXX) g++ options: -O2 -lOpenCL

PlaidML

FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUMGLRU EnabledMGLRU Disabled1.01032.02063.03094.04125.0515SE +/- 0.04, N = 3SE +/- 0.02, N = 34.494.40

OSPray

Demo: San Miguel - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerMGLRU DisabledMGLRU Enabled246810SE +/- 0.08, N = 3SE +/- 0.08, N = 36.656.52MIN: 5.46 / MAX: 7.14MIN: 5.41 / MAX: 7.09

ONNX Runtime

Model: shufflenet-v2-10 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: shufflenet-v2-10 - Device: CPUMGLRU EnabledMGLRU Disabled12002400360048006000SE +/- 54.18, N = 3SE +/- 71.82, N = 3555354471. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Java Gradle Build

Gradle Build: Reactor

OpenBenchmarking.orgSeconds, Fewer Is BetterJava Gradle BuildGradle Build: ReactorMGLRU EnabledMGLRU Disabled80160240320400SE +/- 5.21, N = 9SE +/- 5.30, N = 3374.75380.44

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 4 - Input: Bosphorus 4KMGLRU DisabledMGLRU Enabled1.01932.03863.05794.07725.0965SE +/- 0.005, N = 3SE +/- 0.008, N = 34.5304.4761. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.16Build: allmodconfigMGLRU EnabledMGLRU Disabled4080120160200SE +/- 0.55, N = 3SE +/- 0.79, N = 3157.84159.28

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.16Build: defconfigMGLRU EnabledMGLRU Disabled510152025SE +/- 0.07, N = 3SE +/- 0.02, N = 320.4520.62

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompileMGLRU EnabledMGLRU Disabled1326395265SE +/- 0.08, N = 3SE +/- 0.12, N = 357.6558.11

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateMGLRU DisabledMGLRU Enabled714212835SE +/- 0.17, N = 3SE +/- 0.17, N = 328.8328.601. (CC) gcc options: -O3 -march=native -fopenmp

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUMGLRU EnabledMGLRU Disabled3691215SE +/- 0.07, N = 3SE +/- 0.07, N = 310.3610.28MIN: 9.71 / MAX: 14.1MIN: 9.66 / MAX: 14.1

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average LatencyMGLRU DisabledMGLRU Enabled0.05850.1170.17550.2340.2925SE +/- 0.003, N = 3SE +/- 0.002, N = 30.2580.2601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDMGLRU EnabledMGLRU Disabled816243240SE +/- 0.15, N = 3SE +/- 0.17, N = 333.0433.301. (CXX) g++ options: -O2 -lOpenCL

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 500 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 500 - Mode: Read OnlyMGLRU DisabledMGLRU Enabled400K800K1200K1600K2000KSE +/- 21901.77, N = 3SE +/- 11345.70, N = 3193666319222851. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500MGLRU DisabledMGLRU Enabled20K40K60K80K100KSE +/- 202.70, N = 3SE +/- 134.98, N = 389985.5189324.741. (CC) gcc options: -lcrypt -lz -O3 -march=native

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8.7Encoder Mode: Preset 8 - Input: Bosphorus 4KMGLRU EnabledMGLRU Disabled1224364860SE +/- 0.18, N = 3SE +/- 0.18, N = 352.6652.291. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000MGLRU DisabledMGLRU Enabled20K40K60K80K100KSE +/- 270.06, N = 3SE +/- 265.34, N = 391601.6591024.311. (CC) gcc options: -lcrypt -lz -O3 -march=native

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ISPCMGLRU DisabledMGLRU Enabled4080120160200SE +/- 0.00, N = 3SE +/- 0.67, N = 3176175MIN: 16 / MAX: 2455MIN: 14 / MAX: 2362

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DMGLRU EnabledMGLRU Disabled20406080100SE +/- 0.54, N = 3SE +/- 0.89, N = 3104.60105.141. (CXX) g++ options: -O2 -lOpenCL

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DMGLRU EnabledMGLRU Disabled2K4K6K8K10KSE +/- 4.21, N = 3SE +/- 47.65, N = 38589.038547.911. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.0

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: CrownMGLRU DisabledMGLRU Enabled1326395265SE +/- 0.18, N = 3SE +/- 0.42, N = 360.1059.86MIN: 56.51 / MAX: 68.5MIN: 55.99 / MAX: 67.12

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsMGLRU EnabledMGLRU Disabled0.06130.12260.18390.24520.3065SE +/- 0.00314, N = 3SE +/- 0.00229, N = 30.271580.27257

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2MGLRU EnabledMGLRU Disabled300M600M900M1200M1500MSE +/- 738959.93, N = 3SE +/- 2472880.37, N = 3124914166712446050001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballMGLRU DisabledMGLRU Enabled50010001500200025002154.72161.21. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 57MGLRU DisabledMGLRU Enabled1200M2400M3600M4800M6000MSE +/- 16977730.51, N = 3SE +/- 18720428.53, N = 3552620000055117666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1MMGLRU DisabledMGLRU Enabled11K22K33K44K55KSE +/- 176.64, N = 3SE +/- 192.97, N = 353670.353546.01. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeMGLRU DisabledMGLRU Enabled50M100M150M200M250MSE +/- 2532421.76, N = 6SE +/- 2261974.30, N = 32501254652496520581. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CMGLRU DisabledMGLRU Enabled16K32K48K64K80KSE +/- 406.38, N = 3SE +/- 370.74, N = 374790.1274668.521. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.0

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57MGLRU EnabledMGLRU Disabled1100M2200M3300M4400M5500MSE +/- 11970148.05, N = 3SE +/- 7872808.34, N = 3510073333350933333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.14Time To CompileMGLRU DisabledMGLRU Enabled510152025SE +/- 0.13, N = 14SE +/- 0.14, N = 1319.8019.83

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1MMGLRU DisabledMGLRU Enabled9K18K27K36K45KSE +/- 67.91, N = 3SE +/- 178.74, N = 340078.340029.91. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 7.0Input: AUSURF112MGLRU EnabledMGLRU Disabled70140210280350SE +/- 0.20, N = 3SE +/- 0.36, N = 3330.62330.961. (F9X) gfortran options: -pthread -fopenmp -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3_omp -lfftw3 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Unix MakefilesMGLRU DisabledMGLRU Enabled4080120160200SE +/- 0.47, N = 3SE +/- 0.16, N = 3196.73196.92

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dMGLRU DisabledMGLRU Enabled100200300400500SE +/- 1.49, N = 3SE +/- 0.82, N = 3463.23463.581. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To CompileMGLRU DisabledMGLRU Enabled510152025SE +/- 0.05, N = 3SE +/- 0.02, N = 321.1321.14

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: NinjaMGLRU DisabledMGLRU Enabled20406080100SE +/- 0.23, N = 3SE +/- 0.14, N = 3110.29110.35

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionMGLRU EnabledMGLRU Disabled3691215SE +/- 0.03, N = 3SE +/- 0.02, N = 313.3713.381. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer - Model: CrownMGLRU DisabledMGLRU Enabled1530456075SE +/- 0.24, N = 3SE +/- 0.15, N = 366.2366.20MIN: 61.28 / MAX: 74.94MIN: 61.78 / MAX: 73.44

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression RatingMGLRU DisabledMGLRU Enabled130K260K390K520K650KSE +/- 7344.83, N = 3SE +/- 6316.43, N = 35943845941851. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenVKL

Benchmark: vklBenchmark Scalar

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.0Benchmark: vklBenchmark ScalarMGLRU DisabledMGLRU Enabled306090120150SE +/- 1.00, N = 3SE +/- 0.33, N = 3118118MIN: 11 / MAX: 2528MIN: 11 / MAX: 2529

OSPray

Demo: San Miguel - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisMGLRU DisabledMGLRU Enabled20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 383.3383.33MIN: 47.62 / MAX: 100MIN: 35.71 / MAX: 100

Monte Carlo Simulations of Ionised Nebulae

Input: Dust 2D tau100.0

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2019-03-24Input: Dust 2D tau100.0MGLRU EnabledMGLRU Disabled50100150200250SE +/- 0.67, N = 3SE +/- 0.00, N = 32302301. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

ONNX Runtime

Model: super-resolution-10 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: super-resolution-10 - Device: CPUMGLRU DisabledMGLRU Enabled14002800420056007000SE +/- 5.04, N = 3SE +/- 176.40, N = 12671159231. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: fcn-resnet101-11 - Device: CPUMGLRU EnabledMGLRU Disabled4080120160200SE +/- 4.29, N = 12SE +/- 1.80, N = 31801771. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPUMGLRU EnabledMGLRU Disabled1.20382.40763.61144.81526.019SE +/- 0.09, N = 15SE +/- 0.08, N = 155.355.25MIN: 1.85 / MAX: 7.13MIN: 1.73 / MAX: 7.07

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterMGLRU EnabledMGLRU Disabled3691215SE +/- 0.134, N = 15SE +/- 0.201, N = 149.7299.8031. (CXX) g++ options: -O2 -lOpenCL


Phoronix Test Suite v10.8.5