990KS March

Intel Core i9-9900KS testing with a ASUS PRIME Z390-A (1502 BIOS) and ASUS Intel UHD 630 CFL GT2 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104012-IB-990KSMARC41&sor&grt.

990KS MarchProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen Resolution123Intel Core i9-9900KS @ 5.00GHz (8 Cores / 16 Threads)ASUS PRIME Z390-A (1502 BIOS)Intel Cannon Lake PCH32GB240GB Corsair Force MP510ASUS Intel UHD 630 CFL GT2 3GB (1200MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.045.9.0-050900rc8daily20201005-generic (x86_64) 20201004GNOME Shell 3.36.2X Server 1.20.84.6 Mesa 20.2.6OpenCL 2.1GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xcc - Thermald 1.9.1 Python Details- Python 2.7.18rc1 + Python 3.8.2Security Details- itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

990KS Marchaom-av1: Speed 0 Two-Pass - Bosphorus 4Kaom-av1: Speed 4 Two-Pass - Bosphorus 4Kaom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Kaom-av1: Speed 0 Two-Pass - Bosphorus 1080paom-av1: Speed 4 Two-Pass - Bosphorus 1080paom-av1: Speed 6 Realtime - Bosphorus 1080paom-av1: Speed 6 Two-Pass - Bosphorus 1080paom-av1: Speed 8 Realtime - Bosphorus 1080paom-av1: Speed 9 Realtime - Bosphorus 1080pastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3botan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptdav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitgmpbench: Total Timegnuradio: Five Back to Back FIR Filtersgnuradio: Signal Source (Cosine)gnuradio: FIR Filtergnuradio: IIR Filtergnuradio: FM Deemphasis Filtergnuradio: Hilbert Transformavifenc: 0avifenc: 2avifenc: 6avifenc: 10avifenc: 6, Losslessavifenc: 10, Losslessliquid-dsp: 1 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 8 - 256 - 57liquid-dsp: 16 - 256 - 57luaradio: Five Back to Back FIR Filtersluaradio: FM Deemphasis Filterluaradio: Hilbert Transformluaradio: Complex Phasemnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3onednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUopenscad: Pistolopenscad: Retro Caropenscad: Mini-ITX Caseopenscad: Projector Mount Swivelopenscad: Leonardo Phone Case Slimshoc: OpenCL - S3Dshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Reductionshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDsrslte: OFDM_Testsrslte: PHY_DL_Testsrslte: PHY_DL_Teststockfish: Total Timesvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080psysbench: RAM / Memorysysbench: CPUsystemd-boot-total: Totalsystemd-boot-total: Kernelsystemd-boot-total: Loadersystemd-boot-total: Firmwaresystemd-boot-total: Userspacebuild-erlang: Time To Compilebuild-linux-kernel: Time To Compilebuild-mesa: Time To Compilebuild-nodejs: Time To Compiletoybrot: TBBtoybrot: OpenMPtoybrot: C++ Taskstoybrot: C++ Threadsviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directioncompress-zstd: 3 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 8 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speed1230.123.4213.166.3438.6847.780.386.7625.0619.68124.39141.485.794914.9207113.887323.4557.17233.46163.543112.980107.8614815.9724806.102436.078437.297543.597539.238171.954172.091916.108908.311733.64185.13648.26160.916250.31190.23003.2822.3666.1836.7618.072.07437.25312.9523.15462.6745.778676296671301833332508566674527133335003333331297.9486.887.2704.24.89925.3082.6722.32929.4473.577408.261031.667471.8321514.28308.838986.6399314.03352.144903.977063408.161852.133414.521852.263.154333413.971854.883.44003100.76717.25941.90892.93116.97520.993115.605315.36820.386136.7934245.9201760.7029.211929.049556.90993.11.074.384.53126700000293.2112.7237778786.85106.19230.27173.31176.29136.7628391.4719380.3424050194135551567922109113.00094.71456.415477.0974405843866447564485428.242.847.926.239.344.145.546.534.834.136.535.532.934.637.734.035.33837.934.818.916.515.815.734.3375969121.6911702379.44186.5332.64312.434.53969.01164.04433.4395.14609.831.93957.50.123.4113.126.3237.9448.720.376.7324.9519.11122.64140.395.775714.9059113.773423.3907.09133.37463.560112.943107.7644814.8284811.181436.124437.288543.385539.017171.951172.076913.025907.952731.76184.25649.67164.406221.81195.53011.4820.0664.9837.7618.372.25837.30413.0053.16062.3895.771676323331300466672515766674520733335002633331292.4487.787.1700.24.88225.2542.6552.33329.6703.574558.146951.665791.8291014.30318.865016.6465913.99882.143853.931983405.321851.143413.021850.823.151573445.231851.223.43359100.97817.34642.05393.90816.94120.851815.233615.37570.386136.7270245.8851761.0729.164129.052056.89323.101.074.384.55128466667291.8112.2235015916.84105.90229.42172.24175.27136.2228491.7419382.7124050194135551567922109112.55894.56956.468476.8014413543854448424477428.242.947.926.239.344.145.646.534.834.236.435.533.334.637.634.135.33838.134.718.916.515.815.734.4397227121.8260732374.54164.5327.64308.934.63983.11156.94437.7378.44601.331.83963.90.123.3913.146.3037.1248.780.386.7525.0719.21121.96139.825.773714.9103113.893323.3677.08933.38763.593112.985107.8784815.6174812.421434.656436.744543.951539.337171.958172.150914.454908.474732.83184.16647.63163.766217.61188.73011.2818.3665.0837.3620.271.79937.32612.9643.17062.3965.787675996671306633332514733334530866675000166671291.8489.587.5698.14.94325.4142.6832.34629.7953.565778.167881.66551.8234014.29368.876186.6155714.01062.144523.932163405.991852.053413.881851.303.149433415.351854.403.43914100.81317.36742.08494.96617.05420.847915.251815.35730.38636.7633245.7831760.6129.176629.153156.91753.091.074.374.54128500000295.0113.4238322086.84105.78229.36172.10175.06136.0528338.5919367.3024050194135551567922109112.48494.76856.475477.2844408543838447294480528.342.847.926.239.344.145.546.534.834.136.535.533.334.637.634.235.43837.934.818.916.415.815.734.3846003121.7424722388.44180.3327.54307.734.63981.91177.14442.7387.24574.831.53939.4OpenBenchmarking.org

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K3210.0270.0540.0810.1080.135SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.120.120.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K1230.76951.5392.30853.0783.8475SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.423.413.391. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K1323691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.1613.1413.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 36.346.326.301. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K123918273645SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.51, N = 438.6837.9437.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K3211122334455SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.53, N = 1548.7848.7247.781. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p3120.08550.1710.25650.3420.4275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.380.380.371. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p132246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.766.756.731. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p312612182430SE +/- 0.22, N = 3SE +/- 0.24, N = 3SE +/- 0.29, N = 525.0725.0624.951. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p132510152025SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 319.6819.2119.111. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p123306090120150SE +/- 0.42, N = 3SE +/- 1.89, N = 3SE +/- 1.32, N = 12124.39122.64121.961. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p123306090120150SE +/- 1.63, N = 6SE +/- 0.46, N = 3SE +/- 2.02, N = 4141.48140.39139.821. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium3211.30392.60783.91175.21566.5195SE +/- 0.0062, N = 3SE +/- 0.0051, N = 3SE +/- 0.0231, N = 35.77375.77575.79491. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough23148121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 314.9114.9114.921. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive213306090120150SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.16, N = 3113.77113.89113.891. (CXX) g++ options: -O3 -flto -pthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S321612182430SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 323.3723.3923.461. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0321246810SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.000, N = 37.0897.0917.1721. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2231816243240SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 333.3733.3933.461. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 31231428425670SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 363.5463.5663.591. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI312306090120150SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3112.99112.98112.941. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt31220406080100SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3107.88107.86107.761. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-25613210002000300040005000SE +/- 0.17, N = 3SE +/- 0.22, N = 3SE +/- 0.54, N = 34815.974815.624814.831. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt32110002000300040005000SE +/- 0.04, N = 3SE +/- 0.85, N = 3SE +/- 4.51, N = 34812.424811.184806.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish21390180270360450SE +/- 0.12, N = 3SE +/- 0.16, N = 3SE +/- 1.83, N = 3436.12436.08434.661. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt12390180270360450SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.95, N = 3437.30437.29436.741. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish312120240360480600SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.40, N = 3543.95543.60543.391. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt312120240360480600SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.37, N = 3539.34539.24539.021. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-2563124080120160200SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3171.96171.95171.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt3124080120160200SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3172.15172.09172.081. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly13051322004006008001000SE +/- 0.12, N = 3SE +/- 1.80, N = 3SE +/- 3.02, N = 3916.11914.45913.031. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt3122004006008001000SE +/- 0.54, N = 3SE +/- 0.56, N = 3SE +/- 0.82, N = 3908.47908.31907.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p132160320480640800SE +/- 1.00, N = 3SE +/- 0.90, N = 3SE +/- 2.01, N = 3733.64732.83731.76MIN: 538.11 / MAX: 1144.18MIN: 537.85 / MAX: 1141.98MIN: 538.01 / MAX: 1138.021. (CC) gcc options: -pthread -lm

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4K1234080120160200SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.19, N = 3185.13184.25184.16MIN: 174.56 / MAX: 210.52MIN: 173.91 / MAX: 209.75MIN: 173.76 / MAX: 209.161. (CC) gcc options: -pthread -lm

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080p213140280420560700SE +/- 0.67, N = 3SE +/- 1.42, N = 3SE +/- 0.54, N = 3649.67648.26647.63MIN: 588.3 / MAX: 717.46MIN: 589.55 / MAX: 721.08MIN: 582.84 / MAX: 717.061. (CC) gcc options: -pthread -lm

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bit2314080120160200SE +/- 1.75, N = 3SE +/- 1.42, N = 3SE +/- 0.13, N = 3164.40163.76160.91MIN: 103.73 / MAX: 392.07MIN: 104.03 / MAX: 384.96MIN: 103.41 / MAX: 410.161. (CC) gcc options: -pthread -lm

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time123130026003900520065006250.36221.86217.61. (CC) gcc options: -O3 -fomit-frame-pointer -lm

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters21330060090012001500SE +/- 9.72, N = 3SE +/- 12.45, N = 3SE +/- 15.25, N = 31195.51190.21188.71. 3.8.1.0

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)2316001200180024003000SE +/- 2.10, N = 3SE +/- 2.72, N = 3SE +/- 3.36, N = 33011.43011.23003.21. 3.8.1.0

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter1232004006008001000SE +/- 1.65, N = 3SE +/- 3.57, N = 3SE +/- 2.94, N = 3822.3820.0818.31. 3.8.1.0

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter132140280420560700SE +/- 1.14, N = 3SE +/- 0.81, N = 3SE +/- 1.34, N = 3666.1665.0664.91. 3.8.1.0

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter2312004006008001000SE +/- 1.84, N = 3SE +/- 1.55, N = 3SE +/- 2.38, N = 3837.7837.3836.71. 3.8.1.0

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform321130260390520650SE +/- 1.23, N = 3SE +/- 2.62, N = 3SE +/- 1.07, N = 3620.2618.3618.01. 3.8.1.0

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 03121632486480SE +/- 0.26, N = 3SE +/- 0.16, N = 3SE +/- 0.25, N = 371.8072.0772.261. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2123918273645SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 337.2537.3037.331. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 61323691215SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 312.9512.9613.011. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 101230.71331.42662.13992.85323.5665SE +/- 0.004, N = 3SE +/- 0.010, N = 3SE +/- 0.012, N = 33.1543.1603.1701. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Lossless2311428425670SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 362.3962.4062.671. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Lossless2131.30212.60423.90635.20846.5105SE +/- 0.013, N = 3SE +/- 0.013, N = 3SE +/- 0.012, N = 35.7715.7785.7871. (CXX) g++ options: -O3 -fPIC -lm

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 5721314M28M42M56M70MSE +/- 1763.83, N = 3SE +/- 881.92, N = 3SE +/- 27834.83, N = 36763233367629667675996671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 5731230M60M90M120M150MSE +/- 35276.68, N = 3SE +/- 40551.75, N = 3SE +/- 367891.89, N = 31306633331301833331300466671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 5723150M100M150M200M250MSE +/- 375470.08, N = 3SE +/- 321679.62, N = 3SE +/- 265476.51, N = 32515766672514733332508566671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57312100M200M300M400M500MSE +/- 2209451.92, N = 3SE +/- 1523508.38, N = 3SE +/- 514274.03, N = 34530866674527133334520733331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57123110M220M330M440M550MSE +/- 324054.18, N = 3SE +/- 188355.46, N = 3SE +/- 199360.09, N = 35003333335002633335000166671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters12330060090012001500SE +/- 2.64, N = 3SE +/- 3.21, N = 3SE +/- 4.98, N = 31297.91292.41291.8

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter321110220330440550SE +/- 0.27, N = 3SE +/- 1.98, N = 3SE +/- 1.86, N = 3489.5487.7486.8

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform31220406080100SE +/- 0.32, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 387.587.287.1

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase123150300450600750SE +/- 5.92, N = 3SE +/- 6.74, N = 3SE +/- 11.13, N = 3704.2700.2698.1

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.02131.11222.22443.33664.44885.561SE +/- 0.030, N = 3SE +/- 0.029, N = 3SE +/- 0.027, N = 34.8824.8994.943MIN: 4.68 / MAX: 5.58MIN: 4.71 / MAX: 7.4MIN: 4.65 / MAX: 16.51. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50213612182430SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 325.2525.3125.41MIN: 25.09 / MAX: 35.04MIN: 25.08 / MAX: 37.21MIN: 25.17 / MAX: 37.21. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_2242130.60371.20741.81112.41483.0185SE +/- 0.051, N = 3SE +/- 0.063, N = 3SE +/- 0.063, N = 32.6552.6722.683MIN: 2.35 / MAX: 3.81MIN: 2.34 / MAX: 4.31MIN: 2.37 / MAX: 4.911. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.01230.52791.05581.58372.11162.6395SE +/- 0.007, N = 3SE +/- 0.006, N = 3SE +/- 0.010, N = 32.3292.3332.346MIN: 2.26 / MAX: 4.96MIN: 2.27 / MAX: 3.55MIN: 2.28 / MAX: 3.631. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3123714212835SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.16, N = 329.4529.6729.80MIN: 29.17 / MAX: 40.18MIN: 29.48 / MAX: 42.09MIN: 29.45 / MAX: 42.011. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU3210.80491.60982.41473.21964.0245SE +/- 0.00251, N = 3SE +/- 0.00753, N = 3SE +/- 0.01205, N = 33.565773.574553.57740MIN: 3.26MIN: 3.24MIN: 3.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU231246810SE +/- 0.00679, N = 3SE +/- 0.01559, N = 3SE +/- 0.00549, N = 38.146958.167888.26103MIN: 8MIN: 8.03MIN: 8.141. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3210.37520.75041.12561.50081.876SE +/- 0.00123, N = 3SE +/- 0.00242, N = 3SE +/- 0.00165, N = 31.665501.665791.66747MIN: 1.51MIN: 1.51MIN: 1.511. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3210.41220.82441.23661.64882.061SE +/- 0.00185, N = 3SE +/- 0.00543, N = 3SE +/- 0.00346, N = 31.823401.829101.83215MIN: 1.78MIN: 1.78MIN: 1.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU13248121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 314.2814.2914.30MIN: 14.2MIN: 14.2MIN: 14.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU123246810SE +/- 0.00652, N = 3SE +/- 0.01330, N = 3SE +/- 0.00592, N = 38.838988.865018.87618MIN: 4.68MIN: 4.67MIN: 4.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU312246810SE +/- 0.00264, N = 3SE +/- 0.00984, N = 3SE +/- 0.01698, N = 36.615576.639936.64659MIN: 5.99MIN: 6MIN: 61. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU23148121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 314.0014.0114.03MIN: 13.9MIN: 13.88MIN: 13.921. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU2310.48260.96521.44781.93042.413SE +/- 0.00224, N = 3SE +/- 0.00152, N = 3SE +/- 0.00353, N = 32.143852.144522.14490MIN: 1.97MIN: 1.97MIN: 1.941. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU2310.89481.78962.68443.57924.474SE +/- 0.01212, N = 3SE +/- 0.01771, N = 3SE +/- 0.02149, N = 33.931983.932163.97706MIN: 3.64MIN: 3.62MIN: 3.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU2317001400210028003500SE +/- 3.80, N = 3SE +/- 4.35, N = 3SE +/- 5.97, N = 33405.323405.993408.16MIN: 3391.39MIN: 3393.27MIN: 3391.461. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU231400800120016002000SE +/- 1.72, N = 3SE +/- 0.20, N = 3SE +/- 0.88, N = 31851.141852.051852.13MIN: 1843.95MIN: 1846.88MIN: 1846.261. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU2317001400210028003500SE +/- 0.58, N = 3SE +/- 0.89, N = 3SE +/- 3.64, N = 33413.023413.883414.52MIN: 3406.86MIN: 3407.32MIN: 3404.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU231400800120016002000SE +/- 0.16, N = 3SE +/- 1.14, N = 3SE +/- 0.58, N = 31850.821851.301852.26MIN: 1844.82MIN: 1843.57MIN: 1846.851. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU3210.70971.41942.12912.83883.5485SE +/- 0.00951, N = 3SE +/- 0.00442, N = 3SE +/- 0.00701, N = 33.149433.151573.15433MIN: 3.09MIN: 3.1MIN: 3.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1327001400210028003500SE +/- 2.52, N = 3SE +/- 0.66, N = 3SE +/- 26.13, N = 33413.973415.353445.23MIN: 3402.03MIN: 3409.68MIN: 3408.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU231400800120016002000SE +/- 1.24, N = 3SE +/- 1.53, N = 3SE +/- 3.26, N = 31851.221854.401854.88MIN: 1841.63MIN: 1846.77MIN: 1847.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2310.7741.5482.3223.0963.87SE +/- 0.00274, N = 3SE +/- 0.00188, N = 3SE +/- 0.00410, N = 33.433593.439143.44003MIN: 3.17MIN: 3.18MIN: 3.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenSCAD

Render: Pistol

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Pistol13220406080100SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 3100.77100.81100.981. OpenSCAD version 2019.05

OpenSCAD

Render: Retro Car

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Retro Car12348121620SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 317.2617.3517.371. OpenSCAD version 2019.05

OpenSCAD

Render: Mini-ITX Case

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Mini-ITX Case1231020304050SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 341.9142.0542.081. OpenSCAD version 2019.05

OpenSCAD

Render: Projector Mount Swivel

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Projector Mount Swivel12320406080100SE +/- 0.21, N = 3SE +/- 0.42, N = 3SE +/- 0.08, N = 392.9393.9194.971. OpenSCAD version 2019.05

OpenSCAD

Render: Leonardo Phone Case Slim

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Leonardo Phone Case Slim21348121620SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 316.9416.9817.051. OpenSCAD version 2019.05

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3D123510152025SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 320.9920.8520.851. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad13248121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 315.6115.2515.231. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SP21348121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 315.3815.3715.361. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hash2130.08690.17380.26070.34760.4345SE +/- 0.0000, N = 3SE +/- 0.0000, N = 3SE +/- 0.0000, N = 30.38610.38610.38601. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Reduction132816243240SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 336.7936.7636.731. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_N12350100150200250SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3245.92245.89245.781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flops213400800120016002000SE +/- 0.55, N = 3SE +/- 1.38, N = 3SE +/- 0.83, N = 31761.071760.701760.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Download132714212835SE +/- 0.10, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 329.2129.1829.161. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readback321714212835SE +/- 0.20, N = 3SE +/- 0.10, N = 3SE +/- 0.22, N = 329.1529.0529.051. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidth3121326395265SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 356.9256.9156.891. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya2130.69751.3952.09252.793.4875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.103.103.091. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom3210.24080.48160.72240.96321.204SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.071.071.071. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets2130.98551.9712.95653.9424.9275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.384.384.371. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID2311.02382.04763.07144.09525.119SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.554.544.531. (CXX) g++ options: -O3 -pthread

srsLTE

Test: OFDM_Test

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_Test32130M60M90M120M150MSE +/- 665832.81, N = 3SE +/- 821245.67, N = 3SE +/- 416333.20, N = 31285000001284666671267000001. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test31260120180240300SE +/- 0.84, N = 3SE +/- 0.19, N = 3SE +/- 0.73, N = 3295.0293.2291.81. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test312306090120150SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3113.4112.7112.21. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time3125M10M15M20M25MSE +/- 377829.22, N = 3SE +/- 294389.71, N = 4SE +/- 299604.32, N = 32383220823777878235015911. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p132246810SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.856.846.841. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p12320406080100SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3106.19105.90105.781. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p12350100150200250SE +/- 0.11, N = 3SE +/- 0.19, N = 3SE +/- 0.43, N = 3230.27229.42229.361. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p1234080120160200SE +/- 0.19, N = 3SE +/- 0.43, N = 3SE +/- 0.26, N = 3173.31172.24172.101. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p1234080120160200SE +/- 0.31, N = 3SE +/- 0.47, N = 3SE +/- 0.09, N = 3176.29175.27175.061. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p123306090120150SE +/- 0.26, N = 3SE +/- 0.13, N = 3SE +/- 0.35, N = 3136.76136.22136.051. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory2136K12K18K24K30KSE +/- 138.44, N = 3SE +/- 98.12, N = 3SE +/- 41.95, N = 328491.7428391.4728338.591. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU2134K8K12K16K20KSE +/- 20.02, N = 3SE +/- 14.24, N = 3SE +/- 16.54, N = 319382.7119380.3419367.301. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Systemd Total Boot Time

Test: Total

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Total1235K10K15K20K25K240502405024050

Systemd Total Boot Time

Test: Kernel

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Kernel123400800120016002000194119411941

Systemd Total Boot Time

Test: Loader

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Loader1238001600240032004000355535553555

Systemd Total Boot Time

Test: Firmware

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Firmware1233K6K9K12K15K156791567915679

Systemd Total Boot Time

Test: Userspace

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Userspace1235K10K15K20K25K221092210922109

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compile321306090120150SE +/- 0.31, N = 3SE +/- 0.40, N = 3SE +/- 0.14, N = 3112.48112.56113.00

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compile21320406080100SE +/- 0.60, N = 3SE +/- 0.78, N = 3SE +/- 0.47, N = 394.5794.7194.77

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile1231326395265SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 356.4256.4756.48

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile213100200300400500SE +/- 0.34, N = 3SE +/- 0.32, N = 3SE +/- 0.19, N = 3476.80477.10477.28

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBB1329K18K27K36K45KSE +/- 59.38, N = 3SE +/- 82.93, N = 3SE +/- 72.84, N = 34405844085441351. (CXX) g++ options: -O3 -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP3219K18K27K36K45KSE +/- 19.34, N = 3SE +/- 16.82, N = 3SE +/- 13.86, N = 34383843854438661. (CXX) g++ options: -O3 -lpthread

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks31210K20K30K40K50KSE +/- 38.73, N = 3SE +/- 16.42, N = 3SE +/- 118.82, N = 34472944756448421. (CXX) g++ options: -O3 -lpthread

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads23110K20K30K40K50KSE +/- 25.27, N = 3SE +/- 36.98, N = 3SE +/- 27.00, N = 34477444805448541. (CXX) g++ options: -O3 -lpthread

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY321714212835SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 328.328.228.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY2311020304050SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 342.942.842.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT3211122334455SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 347.947.947.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY321612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 326.226.226.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY321918273645SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 339.339.339.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT3211020304050SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 344.144.144.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N2311020304050SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 345.645.545.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T3211122334455SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 346.546.546.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN321816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 334.834.834.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT231816243240SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 334.234.134.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN312816243240SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 336.536.536.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT321816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 335.535.535.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY321816243240SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 333.333.332.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY321816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 334.634.634.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT132918273645SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 337.737.637.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY321816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 334.234.134.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY321816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 335.435.335.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT3219182736453838381. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N231918273645SE +/- 0.00, N = 338.137.937.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T312816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 334.834.834.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN321510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 318.918.918.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT21348121620SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 316.516.516.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN32148121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 315.815.815.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TT32148121620SE +/- 0.00, N = 2SE +/- 0.00, N = 2SE +/- 0.00, N = 215.715.715.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction132816243240SE +/- 0.35, N = 3SE +/- 0.31, N = 3SE +/- 0.34, N = 334.3434.3834.441. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction132306090120150SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3121.69121.74121.831. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression Speed3125001000150020002500SE +/- 12.46, N = 3SE +/- 12.12, N = 3SE +/- 16.90, N = 32388.42379.42374.51. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Decompression Speed1329001800270036004500SE +/- 4.56, N = 3SE +/- 1.88, N = 3SE +/- 19.69, N = 34186.54180.34164.51. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression Speed12370140210280350SE +/- 1.66, N = 3SE +/- 2.23, N = 3SE +/- 0.88, N = 3332.6327.6327.51. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression Speed1239001800270036004500SE +/- 11.68, N = 3SE +/- 5.34, N = 3SE +/- 5.58, N = 34312.44308.94307.71. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression Speed321816243240SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 334.634.634.51. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression Speed2319001800270036004500SE +/- 1.35, N = 3SE +/- 4.27, N = 3SE +/- 10.42, N = 33983.13981.93969.01. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression Speed31230060090012001500SE +/- 6.67, N = 3SE +/- 5.14, N = 3SE +/- 10.72, N = 31177.11164.01156.91. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression Speed32110002000300040005000SE +/- 1.29, N = 3SE +/- 5.89, N = 3SE +/- 7.93, N = 34442.74437.74433.41. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression Speed13290180270360450SE +/- 4.78, N = 3SE +/- 2.94, N = 3SE +/- 2.00, N = 3395.1387.2378.41. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression Speed12310002000300040005000SE +/- 2.70, N = 3SE +/- 1.73, N = 3SE +/- 22.35, N = 34609.84601.34574.81. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression Speed123714212835SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 331.931.831.51. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression Speed2139001800270036004500SE +/- 0.96, N = 3SE +/- 9.00, N = 3SE +/- 5.83, N = 33963.93957.53939.41. (CC) gcc options: -O3 -pthread -lz -llzma


Phoronix Test Suite v10.8.5