Intel Core i5-3470 testing with a ECS H61H2-I3 v1.0 (4.6.5 BIOS) and ECS Intel Ivybridge Desktop 2GB on Ubuntu 19.10 via the Phoronix Test Suite.
1 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x21 - Thermald 1.9Python Notes: Python 2.7.17 + Python 3.7.5Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Vulnerable: No microcode + tsx_async_abort: Not affected
2 3 Processor: Intel Core i5-3470 @ 3.60GHz (4 Cores), Motherboard: ECS H61H2-I3 v1.0 (4.6.5 BIOS), Chipset: Intel Xeon E3-1200 v2/3rd, Memory: 4096MB, Disk: 120GB Samsung SSD 850, Graphics: ECS Intel Ivybridge Desktop 2GB (1100MHz), Audio: VIA VT1705, Monitor: DELL S2409W, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 19.10, Kernel: 5.9.0-050900rc8daily20201010-generic (x86_64) 20201009, Desktop: GNOME Shell 3.34.3, Display Server: X Server 1.20.5, OpenGL: 4.2 Mesa 19.2.8, Vulkan: 1.1.102, Compiler: GCC 9.2.1 20191008, File-System: ext4, Screen Resolution: 1920x1080
ivy 2021 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution 1 2 3 Intel Core i5-3470 @ 3.60GHz (4 Cores) ECS H61H2-I3 v1.0 (4.6.5 BIOS) Intel Xeon E3-1200 v2/3rd 4096MB 120GB Samsung SSD 850 ECS Intel Ivybridge Desktop 2GB (1100MHz) VIA VT1705 DELL S2409W Realtek RTL8111/8168/8411 Ubuntu 19.10 5.9.0-050900rc8daily20201010-generic (x86_64) 20201009 GNOME Shell 3.34.3 X Server 1.20.5 4.2 Mesa 19.2.8 1.1.102 GCC 9.2.1 20191008 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x21 - Thermald 1.9 Python Details - Python 2.7.17 + Python 3.7.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Vulnerable: No microcode + tsx_async_abort: Not affected
1 2 3 Result Overview Phoronix Test Suite 100% 101% 102% 103% NAS Parallel Benchmarks ONNX Runtime Chaos Group V-RAY Ngspice Mobile Neural Network ASKAP Timed Godot Game Engine Compilation QMCPACK JPEG XL Pennant WebP2 Image Encode QuantLib GnuPG LAMMPS Molecular Dynamics Simulator dav1d TNN rav1e Gcrypt Library LULESH lzbench Algebraic Multi-Grid Benchmark Google SynthMark GROMACS CloverLeaf CP2K Molecular Dynamics
ivy 2021 amg: dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit rav1e: 6 rav1e: 10 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU askap: Hogbom Clean OpenMP lzbench: XZ 0 - Compression lzbench: XZ 0 - Decompression lzbench: Zstd 1 - Compression lzbench: Zstd 1 - Decompression lzbench: Zstd 8 - Compression lzbench: Zstd 8 - Decompression lzbench: Crush 0 - Compression lzbench: Crush 0 - Decompression lzbench: Brotli 0 - Compression lzbench: Brotli 0 - Decompression lzbench: Brotli 2 - Compression lzbench: Brotli 2 - Decompression lzbench: Libdeflate 1 - Compression lzbench: Libdeflate 1 - Decompression quantlib: askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding jpegxl: 5 jpegxl: 7 jpegxl: 8 askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding gromacs: water_GMX50_bare lammps: Rhodopsin Protein npb: BT.C npb: CG.C npb: EP.C npb: LU.C npb: MG.C npb: SP.B synthmark: VoiceMark_100 v-ray: CPU lulesh: pennant: sedovbig pennant: leblancbig mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 cloverleaf: Lagrangian-Eulerian Hydrodynamics cp2k: Fayalite-FIST Data build-godot: Time To Compile gcrypt: ngspice: C2670 ngspice: C7552 webp2: Default webp2: Quality 75, Compression Effort 7 webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 gnupg: 2.7GB Sample File Encryption qmcpack: simple-H2O 1 2 3 137441333 159.32 41.68 148.69 46.95 0.586 1.621 107 142 15 8477 1027 95.6633 31 96 375 1247 66 1342 74 409 377 494 148 579 186 852 1438.3 719.162 938.098 703.761 983.507 37.90 37.78 18.02 680.010 870.816 0.252 2.314 9596.20 2333.10 369.38 9745.21 5632.10 3467.61 515.603 1878 931.91497 255.0827 156.9763 16.717 85.191 9.952 15.354 121.401 353.462 325.168 311.70 1785.885 521.133 294.478 203.968 160.837 13.040 911.164 1664.795 49.990 87.279 51.475 137311467 159.46 41.84 149.47 47.00 0.586 1.613 108 144 15 8456 1033 95.6938 31 95 376 1247 67 1340 75 409 378 495 148 580 185 851 1429.2 715.145 940.570 661.289 983.384 37.63 37.74 17.81 690.666 870.791 0.252 2.315 9575.54 2360.80 362.84 9738.48 4843.62 3467.07 515.571 1848 932.69923 254.9023 158.9725 17.024 84.464 10.214 15.746 122.517 353.821 325.892 311.54 1782.45 518.069 294.955 206.636 163.389 12.974 902.473 1650.573 49.591 87.090 51.847 137385133 159.81 41.87 149.63 47.00 0.587 1.621 108 148 16 8486 1039 95.7243 30 96 376 1248 68 1341 75 409 378 495 148 581 185 851 1433.2 719.645 939.238 663.988 981.326 37.49 37.44 17.90 677.687 840.475 0.252 2.323 9603.66 2364.66 368.38 9732.86 4249.13 3468.52 515.512 1854 930.64117 254.2454 156.9348 16.921 84.271 10.038 15.559 122.141 352.715 325.413 311.57 1795.445 515.838 294.814 206.932 162.547 12.967 905.442 1649.930 49.460 86.905 51.344 OpenBenchmarking.org
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 3 2 1 30M 60M 90M 120M 150M SE +/- 144244.08, N = 3 SE +/- 109870.97, N = 3 SE +/- 56286.09, N = 3 137385133 137311467 137441333 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 3 2 1 40 80 120 160 200 SE +/- 0.13, N = 3 SE +/- 0.16, N = 3 SE +/- 0.15, N = 3 159.81 159.46 159.32 MIN: 117.85 / MAX: 287.15 MIN: 117.71 / MAX: 286.64 MIN: 117.77 / MAX: 286.59 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 3 2 1 10 20 30 40 50 SE +/- 0.11, N = 3 SE +/- 0.13, N = 3 SE +/- 0.14, N = 3 41.87 41.84 41.68 MIN: 38.51 / MAX: 46.76 MIN: 38.43 / MAX: 46.88 MIN: 37.97 / MAX: 46.92 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 3 2 1 30 60 90 120 150 SE +/- 0.26, N = 3 SE +/- 0.27, N = 3 SE +/- 0.31, N = 3 149.63 149.47 148.69 MIN: 136.65 / MAX: 163.18 MIN: 135.64 / MAX: 163.7 MIN: 135.56 / MAX: 162.29 1. (CC) gcc options: -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 3 2 1 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 47.00 47.00 46.95 MIN: 31.92 / MAX: 105.82 MIN: 31.93 / MAX: 106.12 MIN: 31.7 / MAX: 107.29 1. (CC) gcc options: -pthread
OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 3 2 1 0.3647 0.7294 1.0941 1.4588 1.8235 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 1.621 1.613 1.621
ONNX Runtime ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 2 1 20 40 60 80 100 SE +/- 0.88, N = 3 SE +/- 0.76, N = 3 SE +/- 1.01, N = 3 108 108 107 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 3 2 1 30 60 90 120 150 SE +/- 2.08, N = 3 SE +/- 0.17, N = 3 SE +/- 1.88, N = 3 148 144 142 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 4 8 12 16 20 SE +/- 0.17, N = 3 SE +/- 0.11, N = 12 SE +/- 0.15, N = 6 16 15 15 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 3 2 1 2K 4K 6K 8K 10K SE +/- 7.40, N = 3 SE +/- 15.69, N = 3 SE +/- 12.23, N = 3 8486 8456 8477 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 3 2 1 200 400 600 800 1000 SE +/- 10.45, N = 5 SE +/- 12.58, N = 3 SE +/- 11.14, N = 4 1039 1033 1027 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP 3 2 1 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 95.72 95.69 95.66 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: XZ 0 - Process: Decompression 3 2 1 20 40 60 80 100 SE +/- 0.67, N = 3 96 95 96 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Compression 3 2 1 80 160 240 320 400 SE +/- 0.33, N = 3 376 376 375 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 1 - Process: Decompression 3 2 1 300 600 900 1200 1500 SE +/- 0.67, N = 3 SE +/- 1.00, N = 3 1248 1247 1247 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Compression 3 2 1 15 30 45 60 75 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 SE +/- 0.67, N = 6 68 67 66 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Zstd 8 - Process: Decompression 3 2 1 300 600 900 1200 1500 SE +/- 0.58, N = 3 SE +/- 1.20, N = 3 SE +/- 0.40, N = 6 1341 1340 1342 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Compression 3 2 1 20 40 60 80 100 SE +/- 0.67, N = 3 75 75 74 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Crush 0 - Process: Decompression 3 2 1 90 180 270 360 450 SE +/- 0.33, N = 3 409 409 409 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Compression 3 2 1 80 160 240 320 400 378 378 377 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 0 - Process: Decompression 3 2 1 110 220 330 440 550 SE +/- 0.58, N = 3 SE +/- 1.33, N = 3 495 495 494 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Compression 3 2 1 30 60 90 120 150 148 148 148 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Brotli 2 - Process: Decompression 3 2 1 130 260 390 520 650 SE +/- 1.20, N = 3 581 580 579 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Compression 3 2 1 40 80 120 160 200 SE +/- 1.00, N = 3 185 185 186 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 1.8 Test: Libdeflate 1 - Process: Decompression 3 2 1 200 400 600 800 1000 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 851 851 852 1. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 3 2 1 300 600 900 1200 1500 SE +/- 15.12, N = 3 SE +/- 12.95, N = 3 SE +/- 12.77, N = 3 1433.2 1429.2 1438.3 1. (CXX) g++ options: -O3 -march=native -rdynamic -lboost_timer -lboost_system -lboost_chrono
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding 3 2 1 160 320 480 640 800 SE +/- 3.49, N = 3 SE +/- 3.95, N = 3 SE +/- 3.65, N = 3 719.65 715.15 719.16 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 3 2 1 200 400 600 800 1000 SE +/- 5.46, N = 3 SE +/- 2.46, N = 3 SE +/- 3.53, N = 3 939.24 940.57 938.10 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 3 2 1 150 300 450 600 750 SE +/- 1.65, N = 3 SE +/- 4.48, N = 13 SE +/- 5.49, N = 15 663.99 661.29 703.76 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 3 2 1 200 400 600 800 1000 SE +/- 4.36, N = 3 SE +/- 2.06, N = 13 SE +/- 1.78, N = 15 981.33 983.38 983.51 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
JPEG XL OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Encode Speed: 5 3 2 1 9 18 27 36 45 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 SE +/- 0.19, N = 3 37.49 37.63 37.90 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -lbrotlicommon -lbrotlienc -lbrotlidec -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Encode Speed: 7 3 2 1 9 18 27 36 45 SE +/- 0.25, N = 3 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 37.44 37.74 37.78 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -lbrotlicommon -lbrotlienc -lbrotlidec -ldl
OpenBenchmarking.org MP/s, More Is Better JPEG XL 0.3.1 Encode Speed: 8 3 2 1 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 17.90 17.81 18.02 1. (CXX) g++ options: -funwind-tables -O3 -O2 -pthread -lbrotlicommon -lbrotlienc -lbrotlidec -ldl
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 3 2 1 150 300 450 600 750 SE +/- 9.14, N = 3 SE +/- 7.19, N = 3 SE +/- 8.76, N = 3 677.69 690.67 680.01 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 3 2 1 200 400 600 800 1000 SE +/- 22.67, N = 3 SE +/- 1.93, N = 3 SE +/- 3.84, N = 3 840.48 870.79 870.82 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 3 2 1 2K 4K 6K 8K 10K SE +/- 83.58, N = 9 SE +/- 77.59, N = 9 SE +/- 109.35, N = 4 9603.66 9575.54 9596.20 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 3 2 1 500 1000 1500 2000 2500 SE +/- 11.55, N = 3 SE +/- 6.77, N = 3 SE +/- 3.50, N = 3 2364.66 2360.80 2333.10 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 3 2 1 80 160 240 320 400 SE +/- 3.61, N = 5 SE +/- 3.15, N = 8 SE +/- 3.84, N = 5 368.38 362.84 369.38 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 3 2 1 2K 4K 6K 8K 10K SE +/- 6.28, N = 3 SE +/- 4.42, N = 3 SE +/- 10.55, N = 3 9732.86 9738.48 9745.21 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 3 2 1 1200 2400 3600 4800 6000 SE +/- 236.59, N = 15 SE +/- 214.25, N = 15 SE +/- 32.16, N = 3 4249.13 4843.62 5632.10 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 3 2 1 700 1400 2100 2800 3500 SE +/- 0.55, N = 3 SE +/- 0.98, N = 3 SE +/- 1.09, N = 3 3468.52 3467.07 3467.61 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. Open MPI 3.1.3
Google SynthMark SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 3 2 1 110 220 330 440 550 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.01, N = 3 515.51 515.57 515.60 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Chaos Group V-RAY This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 3 2 1 400 800 1200 1600 2000 SE +/- 21.68, N = 4 SE +/- 6.12, N = 3 SE +/- 26.59, N = 3 1854 1848 1878
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 3 2 1 40 80 120 160 200 SE +/- 0.20, N = 3 SE +/- 1.61, N = 12 SE +/- 0.23, N = 3 156.93 158.97 156.98 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Mobile Neural Network MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 3 2 1 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.10, N = 3 16.92 17.02 16.72 MIN: 16.77 / MAX: 31.99 MIN: 16.9 / MAX: 32.04 MIN: 16.54 / MAX: 30.41 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 3 2 1 20 40 60 80 100 SE +/- 0.24, N = 3 SE +/- 0.09, N = 3 SE +/- 0.15, N = 3 84.27 84.46 85.19 MIN: 74.33 / MAX: 125.18 MIN: 74.6 / MAX: 99.56 MIN: 75.21 / MAX: 129.47 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 3 2 1 3 6 9 12 15 SE +/- 0.089, N = 3 SE +/- 0.032, N = 3 SE +/- 0.006, N = 3 10.038 10.214 9.952 MIN: 9.88 / MAX: 23.88 MIN: 10.09 / MAX: 11.26 MIN: 9.88 / MAX: 24.01 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 3 2 1 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 15.56 15.75 15.35 MIN: 15.43 / MAX: 29.63 MIN: 15.62 / MAX: 55.65 MIN: 15.27 / MAX: 29.56 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 3 2 1 30 60 90 120 150 SE +/- 0.40, N = 3 SE +/- 0.36, N = 3 SE +/- 0.16, N = 3 122.14 122.52 121.40 MIN: 121.15 / MAX: 136.58 MIN: 121.56 / MAX: 169.78 MIN: 120.8 / MAX: 164.42 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 3 2 1 80 160 240 320 400 SE +/- 0.31, N = 3 SE +/- 1.21, N = 3 SE +/- 0.48, N = 3 352.72 353.82 353.46 MIN: 350.01 / MAX: 363.02 MIN: 351.96 / MAX: 360.16 MIN: 352.48 / MAX: 359.71 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 3 2 1 70 140 210 280 350 SE +/- 0.79, N = 3 SE +/- 0.36, N = 3 SE +/- 0.29, N = 3 325.41 325.89 325.17 MIN: 322.96 / MAX: 329.42 MIN: 324.53 / MAX: 328.35 MIN: 323.52 / MAX: 326.8 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 3 2 1 70 140 210 280 350 SE +/- 0.05, N = 3 SE +/- 0.05, N = 3 SE +/- 0.15, N = 3 311.57 311.54 311.70 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Gcrypt Library Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 3 2 1 60 120 180 240 300 SE +/- 0.37, N = 3 SE +/- 0.65, N = 3 SE +/- 0.54, N = 3 294.81 294.96 294.48 1. (CC) gcc options: -O2 -fvisibility=hidden
Ngspice Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 3 2 1 50 100 150 200 250 SE +/- 0.17, N = 3 SE +/- 0.61, N = 3 SE +/- 0.88, N = 3 206.93 206.64 203.97 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 3 2 1 40 80 120 160 200 SE +/- 0.36, N = 3 SE +/- 0.86, N = 3 SE +/- 0.39, N = 3 162.55 163.39 160.84 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
WebP2 Image Encode This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Default 3 2 1 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 12.97 12.97 13.04 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 3 2 1 200 400 600 800 1000 SE +/- 0.26, N = 3 SE +/- 2.54, N = 3 SE +/- 1.26, N = 3 905.44 902.47 911.16 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 3 2 1 400 800 1200 1600 2000 SE +/- 3.32, N = 3 SE +/- 0.75, N = 3 SE +/- 3.91, N = 3 1649.93 1650.57 1664.80 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg -lwebp -lwebpdemux
OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 3 2 1 11 22 33 44 55 SE +/- 0.24, N = 3 SE +/- 0.21, N = 3 SE +/- 0.24, N = 3 49.46 49.59 49.99 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lOpenGL -lGLX -lGLU -lglut -lXmu -lXi -lpthread -ljpeg -lwebp -lwebpdemux
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.10 Input: simple-H2O 3 2 1 12 24 36 48 60 SE +/- 0.49, N = 3 SE +/- 0.29, N = 3 SE +/- 0.61, N = 3 51.34 51.85 51.48 1. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -lm -pthread
1 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x21 - Thermald 1.9Python Notes: Python 2.7.17 + Python 3.7.5Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Vulnerable: No microcode + tsx_async_abort: Not affected
Testing initiated at 11 February 2021 13:09 by user phoronix.
2 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x21 - Thermald 1.9Python Notes: Python 2.7.17 + Python 3.7.5Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Vulnerable: No microcode + tsx_async_abort: Not affected
Testing initiated at 12 February 2021 08:11 by user phoronix.
3 Processor: Intel Core i5-3470 @ 3.60GHz (4 Cores), Motherboard: ECS H61H2-I3 v1.0 (4.6.5 BIOS), Chipset: Intel Xeon E3-1200 v2/3rd, Memory: 4096MB, Disk: 120GB Samsung SSD 850, Graphics: ECS Intel Ivybridge Desktop 2GB (1100MHz), Audio: VIA VT1705, Monitor: DELL S2409W, Network: Realtek RTL8111/8168/8411
OS: Ubuntu 19.10, Kernel: 5.9.0-050900rc8daily20201010-generic (x86_64) 20201009, Desktop: GNOME Shell 3.34.3, Display Server: X Server 1.20.5, OpenGL: 4.2 Mesa 19.2.8, Vulkan: 1.1.102, Compiler: GCC 9.2.1 20191008, File-System: ext4, Screen Resolution: 1920x1080
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x21 - Thermald 1.9Python Notes: Python 2.7.17 + Python 3.7.5Security Notes: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Vulnerable: No microcode + tsx_async_abort: Not affected
Testing initiated at 12 February 2021 20:02 by user phoronix.