Linux 5.12 Scheduler AMD Ryzen 9 5950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2102172-PTS-LINUX51261&sro&grs .
Linux 5.12 Scheduler Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution Linux 5.11 5.12 sched AMD Ryzen 9 5950X 16-Core @ 6.92GHz (16 Cores / 32 Threads) ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS) AMD Starship/Matisse 32GB 2000GB Corsair Force MP600 AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz) AMD Navi 10 HDMI Audio ASUS MG28U Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 Ubuntu 20.10 5.11.0-051100-generic (x86_64) GNOME Shell 3.38.2 X Server 1.20.9 4.6 Mesa 21.1.0-devel (git-824ae64 2021-02-01 groovy-oibaf-ppa) (LLVM 11.0.1) 1.2.145 GCC 10.2.0 ext4 3840x2160 5.11.0-sched (x86_64) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201009 Python Details - Python 3.8.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Linux 5.12 Scheduler etcpak: ETC2 graphics-magick: Rotate askap: tConvolve MPI - Degridding build-linux-kernel: Time To Compile qmcpack: simple-H2O tesseract: 3840 x 2160 stockfish: Total Time npb: CG.C askap: tConvolve OpenMP - Gridding graphics-magick: Resizing financebench: Bonds OpenMP daphne: OpenMP - Points2Image etcpak: DXT1 dav1d: Summer Nature 1080p daphne: OpenMP - NDT Mapping paraview: Wavelet Volume - 3840 x 2160 paraview: Wavelet Volume - 3840 x 2160 financebench: Repo OpenMP v-ray: CPU npb: FT.C build-godot: Time To Compile askap: tConvolve MPI - Gridding npb: LU.C indigobench: CPU - Bedroom indigobench: CPU - Supercar paraview: Many Spheres - 1920 x 1080 paraview: Many Spheres - 1920 x 1080 askap: Hogbom Clean OpenMP oidn: Memorial npb: EP.C openvkl: vklBenchmark dav1d: Chimera 1080p 10-bit npb: BT.C build-gdb: Time To Compile warsow: 3840 x 2160 paraview: Wavelet Contour - 1920 x 1080 paraview: Wavelet Contour - 1920 x 1080 ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping npb: IS.D n-queens: Elapsed Time askap: tConvolve OpenMP - Degridding webp2: Quality 75, Compression Effort 7 daphne: OpenMP - Euclidean Cluster paraview: Wavelet Contour - 3840 x 2160 paraview: Wavelet Contour - 3840 x 2160 dav1d: Summer Nature 4K rawtherapee: Total Benchmark Time gromacs: water_GMX50_bare webp2: Quality 95, Compression Effort 7 openfoam: Motorbike 30M dav1d: Chimera 1080p m-queens: Time To Solve paraview: Many Spheres - 3840 x 2160 askap: tConvolve MT - Degridding paraview: Many Spheres - 3840 x 2160 npb: MG.C npb: SP.B askap: tConvolve MT - Gridding namd: ATPase Simulation - 327,506 Atoms jpegxl-decode: All clomp: Static OMP Speedup simdjson: PartialTweets simdjson: Kostya paraview: Wavelet Volume - 1920 x 1080 paraview: Wavelet Volume - 1920 x 1080 Linux 5.11 5.12 sched 241.114 1068 6758.72 45.598 22.350 398.4990 43133645 7018.38 2722.08 1892 39971.764323 27781.126423097 1533.222 911.62 882.03 264.67 4234.713 27369.523437 21506 12182.06 78.962 6672.02 28328.86 4.139 8.682 66.03 6620.179 216.934 14.57 1917.44 292 121.66 24129.92 62.436 431.7 3896.310 373.88 1043.96 646.03 5.622 3252.89 116.335 1493.34 2653.882 254.66 240.77 45.764 1.264 214.943 97.64 837.08 30.814 6294.768 1344.59 62.78 9935.40 7886.19 785.016 1.08110 202.60 21.2 0.90 0.67 7823.400 488.96 235.614 1053 6846.93 45.028 22.608 394.3750 43554003 6957.05 2698.79 1876 39641.304688 27567.067160148 1521.611 904.93 888.36 266.26 4260.097 27208.082682 21626 12122.51 79.313 6642.93 28209.45 4.122 8.717 66.29 6645.758 217.709 14.52 1910.88 293 122.06 24051.43 62.234 430.4 3884.718 372.77 1046.99 647.67 5.636 3260.38 116.599 1496.14 2649.139 254.21 241.18 45.837 1.266 215.277 97.49 838.25 30.855 6291.432 1343.89 62.75 9930.81 7883.48 784.775 1.08100 202.61 21.2 0.94 0.71 7416.986 463.56 OpenBenchmarking.org
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 5.12 sched Linux 5.11 50 100 150 200 250 SE +/- 1.05, N = 3 SE +/- 3.17, N = 3 235.61 241.11 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate 5.12 sched Linux 5.11 200 400 600 800 1000 SE +/- 5.86, N = 3 SE +/- 2.91, N = 3 1053 1068 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 5.12 sched Linux 5.11 1500 3000 4500 6000 7500 SE +/- 79.27, N = 3 SE +/- 77.23, N = 3 6846.93 6758.72 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.4 Time To Compile 5.12 sched Linux 5.11 10 20 30 40 50 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 45.03 45.60
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 5.12 sched Linux 5.11 5 10 15 20 25 SE +/- 0.24, N = 5 SE +/- 0.10, N = 3 22.61 22.35 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm
Tesseract Resolution: 3840 x 2160 OpenBenchmarking.org Frames Per Second, More Is Better Tesseract 2014-05-12 Resolution: 3840 x 2160 5.12 sched Linux 5.11 90 180 270 360 450 SE +/- 3.88, N = 15 SE +/- 3.68, N = 6 394.38 398.50
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 12 Total Time 5.12 sched Linux 5.11 9M 18M 27M 36M 45M SE +/- 444782.82, N = 5 SE +/- 412700.59, N = 3 43554003 43133645 1. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 5.12 sched Linux 5.11 1500 3000 4500 6000 7500 SE +/- 11.23, N = 3 SE +/- 60.15, N = 3 6957.05 7018.38 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 5.12 sched Linux 5.11 600 1200 1800 2400 3000 SE +/- 18.11, N = 3 SE +/- 23.99, N = 7 2698.79 2722.08 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing 5.12 sched Linux 5.11 400 800 1200 1600 2000 SE +/- 6.56, N = 3 SE +/- 4.37, N = 3 1876 1892 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP 5.12 sched Linux 5.11 9K 18K 27K 36K 45K SE +/- 34.33, N = 3 SE +/- 31.83, N = 3 39641.30 39971.76 1. (CXX) g++ options: -O3 -march=native -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image 5.12 sched Linux 5.11 6K 12K 18K 24K 30K SE +/- 392.26, N = 3 SE +/- 342.19, N = 3 27567.07 27781.13 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 5.12 sched Linux 5.11 300 600 900 1200 1500 SE +/- 4.22, N = 3 SE +/- 1.30, N = 3 1521.61 1533.22 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 5.12 sched Linux 5.11 200 400 600 800 1000 SE +/- 0.24, N = 3 SE +/- 4.97, N = 3 904.93 911.62 MIN: 648.7 / MAX: 987.05 MIN: 618.67 / MAX: 1003.6 1. (CC) gcc options: -pthread
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping 5.12 sched Linux 5.11 200 400 600 800 1000 SE +/- 2.37, N = 3 SE +/- 6.01, N = 3 888.36 882.03 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
ParaView Test: Wavelet Volume - Resolution: 3840 x 2160 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 3840 x 2160 5.12 sched Linux 5.11 60 120 180 240 300 SE +/- 0.26, N = 3 SE +/- 1.81, N = 12 266.26 264.67
ParaView Test: Wavelet Volume - Resolution: 3840 x 2160 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 3840 x 2160 5.12 sched Linux 5.11 900 1800 2700 3600 4500 SE +/- 4.11, N = 3 SE +/- 28.98, N = 12 4260.10 4234.71
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP 5.12 sched Linux 5.11 6K 12K 18K 24K 30K SE +/- 33.25, N = 3 SE +/- 288.55, N = 3 27208.08 27369.52 1. (CXX) g++ options: -O3 -march=native -fopenmp
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 5.12 sched Linux 5.11 5K 10K 15K 20K 25K SE +/- 221.33, N = 3 SE +/- 99.08, N = 3 21626 21506
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 5.12 sched Linux 5.11 3K 6K 9K 12K 15K SE +/- 4.72, N = 3 SE +/- 6.14, N = 3 12122.51 12182.06 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 5.12 sched Linux 5.11 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.18, N = 3 79.31 78.96
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 5.12 sched Linux 5.11 1400 2800 4200 5600 7000 SE +/- 0.00, N = 3 SE +/- 56.07, N = 3 6642.93 6672.02 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 5.12 sched Linux 5.11 6K 12K 18K 24K 30K SE +/- 19.13, N = 3 SE +/- 8.24, N = 3 28209.45 28328.86 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
IndigoBench Acceleration: CPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Bedroom 5.12 sched Linux 5.11 0.9313 1.8626 2.7939 3.7252 4.6565 SE +/- 0.016, N = 3 SE +/- 0.017, N = 3 4.122 4.139
IndigoBench Acceleration: CPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar 5.12 sched Linux 5.11 2 4 6 8 10 SE +/- 0.023, N = 3 SE +/- 0.011, N = 3 8.717 8.682
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 5.12 sched Linux 5.11 15 30 45 60 75 SE +/- 0.15, N = 3 SE +/- 0.22, N = 3 66.29 66.03
ParaView Test: Many Spheres - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 1920 x 1080 5.12 sched Linux 5.11 1400 2800 4200 5600 7000 SE +/- 14.64, N = 3 SE +/- 22.52, N = 3 6645.76 6620.18
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 5.12 sched Linux 5.11 50 100 150 200 250 SE +/- 0.42, N = 3 SE +/- 1.25, N = 3 217.71 216.93 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Intel Open Image Denoise Scene: Memorial OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.2.0 Scene: Memorial 5.12 sched Linux 5.11 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 14.52 14.57
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 5.12 sched Linux 5.11 400 800 1200 1600 2000 SE +/- 5.65, N = 3 SE +/- 4.39, N = 3 1910.88 1917.44 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
OpenVKL Benchmark: vklBenchmark OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark 5.12 sched Linux 5.11 60 120 180 240 300 293 292 MIN: 1 / MAX: 1137 MIN: 1 / MAX: 1136
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 5.12 sched Linux 5.11 30 60 90 120 150 SE +/- 1.09, N = 3 SE +/- 0.66, N = 3 122.06 121.66 MIN: 86.2 / MAX: 274.09 MIN: 87.02 / MAX: 270.79 1. (CC) gcc options: -pthread
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 5.12 sched Linux 5.11 5K 10K 15K 20K 25K SE +/- 23.23, N = 3 SE +/- 23.90, N = 3 24051.43 24129.92 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 9.1 Time To Compile 5.12 sched Linux 5.11 14 28 42 56 70 SE +/- 0.31, N = 3 SE +/- 0.23, N = 3 62.23 62.44
Warsow Resolution: 3840 x 2160 OpenBenchmarking.org Frames Per Second, More Is Better Warsow 2.5 Beta Resolution: 3840 x 2160 5.12 sched Linux 5.11 90 180 270 360 450 SE +/- 0.38, N = 3 SE +/- 0.70, N = 3 430.4 431.7
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 5.12 sched Linux 5.11 800 1600 2400 3200 4000 SE +/- 1.80, N = 3 SE +/- 1.65, N = 3 3884.72 3896.31
ParaView Test: Wavelet Contour - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 1920 x 1080 5.12 sched Linux 5.11 80 160 240 320 400 SE +/- 0.17, N = 3 SE +/- 0.16, N = 3 372.77 373.88
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping 5.12 sched Linux 5.11 200 400 600 800 1000 SE +/- 2.77, N = 3 SE +/- 9.06, N = 3 1046.99 1043.96 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 5.12 sched Linux 5.11 140 280 420 560 700 SE +/- 0.65, N = 3 SE +/- 2.89, N = 3 647.67 646.03 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
N-Queens Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better N-Queens 1.0 Elapsed Time 5.12 sched Linux 5.11 1.2681 2.5362 3.8043 5.0724 6.3405 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 5.636 5.622 1. (CC) gcc options: -static -fopenmp -O3 -march=native
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 5.12 sched Linux 5.11 700 1400 2100 2800 3500 SE +/- 13.36, N = 3 SE +/- 10.36, N = 7 3260.38 3252.89 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 5.12 sched Linux 5.11 30 60 90 120 150 SE +/- 0.55, N = 3 SE +/- 1.18, N = 3 116.60 116.34 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Euclidean Cluster OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Euclidean Cluster 5.12 sched Linux 5.11 300 600 900 1200 1500 SE +/- 6.52, N = 3 SE +/- 3.74, N = 3 1496.14 1493.34 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
ParaView Test: Wavelet Contour - Resolution: 3840 x 2160 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 3840 x 2160 5.12 sched Linux 5.11 600 1200 1800 2400 3000 SE +/- 0.80, N = 3 SE +/- 0.59, N = 3 2649.14 2653.88
ParaView Test: Wavelet Contour - Resolution: 3840 x 2160 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Contour - Resolution: 3840 x 2160 5.12 sched Linux 5.11 60 120 180 240 300 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 254.21 254.66
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 5.12 sched Linux 5.11 50 100 150 200 250 SE +/- 0.13, N = 3 SE +/- 0.34, N = 3 241.18 240.77 MIN: 181.33 / MAX: 249.24 MIN: 181.75 / MAX: 249.17 1. (CC) gcc options: -pthread
RawTherapee Total Benchmark Time OpenBenchmarking.org Seconds, Fewer Is Better RawTherapee Total Benchmark Time 5.12 sched Linux 5.11 10 20 30 40 50 SE +/- 0.22, N = 3 SE +/- 0.26, N = 3 45.84 45.76 1. RawTherapee, version 5.8, command line.
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021 Input: water_GMX50_bare 5.12 sched Linux 5.11 0.2849 0.5698 0.8547 1.1396 1.4245 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 1.266 1.264 1. (CXX) g++ options: -O3 -pthread
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 5.12 sched Linux 5.11 50 100 150 200 250 SE +/- 0.64, N = 3 SE +/- 0.69, N = 3 215.28 214.94 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 5.12 sched Linux 5.11 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.07, N = 3 97.49 97.64 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 5.12 sched Linux 5.11 200 400 600 800 1000 SE +/- 5.53, N = 3 SE +/- 7.91, N = 3 838.25 837.08 MIN: 588.61 / MAX: 1047.17 MIN: 547.13 / MAX: 1054.47 1. (CC) gcc options: -pthread
m-queens Time To Solve OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.2 Time To Solve 5.12 sched Linux 5.11 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 30.86 30.81 1. (CXX) g++ options: -fopenmp -O2 -march=native
ParaView Test: Many Spheres - Resolution: 3840 x 2160 OpenBenchmarking.org MiPolys / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 3840 x 2160 5.12 sched Linux 5.11 1300 2600 3900 5200 6500 SE +/- 3.70, N = 3 SE +/- 1.21, N = 3 6291.43 6294.77
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 5.12 sched Linux 5.11 300 600 900 1200 1500 SE +/- 2.63, N = 3 SE +/- 1.91, N = 3 1343.89 1344.59 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ParaView Test: Many Spheres - Resolution: 3840 x 2160 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Many Spheres - Resolution: 3840 x 2160 5.12 sched Linux 5.11 14 28 42 56 70 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 62.75 62.78
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 5.12 sched Linux 5.11 2K 4K 6K 8K 10K SE +/- 6.34, N = 3 SE +/- 3.64, N = 3 9930.81 9935.40 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 5.12 sched Linux 5.11 2K 4K 6K 8K 10K SE +/- 21.12, N = 3 SE +/- 10.12, N = 3 7883.48 7886.19 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 5.12 sched Linux 5.11 200 400 600 800 1000 SE +/- 2.00, N = 3 SE +/- 1.97, N = 3 784.78 785.02 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 5.12 sched Linux 5.11 0.2432 0.4864 0.7296 0.9728 1.216 SE +/- 0.00363, N = 3 SE +/- 0.00393, N = 3 1.08100 1.08110
JPEG XL Decoding CPU Threads: All OpenBenchmarking.org MP/s, More Is Better JPEG XL Decoding 0.3.1 CPU Threads: All 5.12 sched Linux 5.11 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.31, N = 3 202.61 202.60
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup 5.12 sched Linux 5.11 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 21.2 21.2 1. (CC) gcc options: -fopenmp -O3 -lm
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: PartialTweets 5.12 sched Linux 5.11 0.2115 0.423 0.6345 0.846 1.0575 SE +/- 0.02, N = 15 SE +/- 0.01, N = 15 0.94 0.90 1. (CXX) g++ options: -O3 -pthread
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 0.7.1 Throughput Test: Kostya 5.12 sched Linux 5.11 0.1598 0.3196 0.4794 0.6392 0.799 SE +/- 0.03, N = 15 SE +/- 0.02, N = 12 0.71 0.67 1. (CXX) g++ options: -O3 -pthread
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org MiVoxels / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 5.12 sched Linux 5.11 2K 4K 6K 8K 10K SE +/- 229.82, N = 12 SE +/- 31.61, N = 3 7416.99 7823.40
ParaView Test: Wavelet Volume - Resolution: 1920 x 1080 OpenBenchmarking.org Frames / Sec, More Is Better ParaView 5.9 Test: Wavelet Volume - Resolution: 1920 x 1080 5.12 sched Linux 5.11 110 220 330 440 550 SE +/- 14.36, N = 12 SE +/- 1.98, N = 3 463.56 488.96
Phoronix Test Suite v10.8.5