990KS March

Intel Core i9-9900KS testing with a ASUS PRIME Z390-A (1502 BIOS) and ASUS Intel UHD 630 CFL GT2 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104012-IB-990KSMARC41&sor&grr.

990KS MarchProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLCompilerFile-SystemScreen Resolution123Intel Core i9-9900KS @ 5.00GHz (8 Cores / 16 Threads)ASUS PRIME Z390-A (1502 BIOS)Intel Cannon Lake PCH32GB240GB Corsair Force MP510ASUS Intel UHD 630 CFL GT2 3GB (1200MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.045.9.0-050900rc8daily20201005-generic (x86_64) 20201004GNOME Shell 3.36.2X Server 1.20.84.6 Mesa 20.2.6OpenCL 2.1GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xcc - Thermald 1.9.1 Python Details- Python 2.7.18rc1 + Python 3.8.2Security Details- itlb_multihit: KVM: Mitigation of VMX unsupported + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

990KS Marchbuild-nodejs: Time To Compilegnuradio: Hilbert Transformgnuradio: FM Deemphasis Filtergnuradio: IIR Filtergnuradio: FIR Filtergnuradio: Signal Source (Cosine)gnuradio: Five Back to Back FIR Filtersluaradio: Complex Phaseluaradio: Hilbert Transformluaradio: FM Deemphasis Filterluaradio: Five Back to Back FIR Filtersaom-av1: Speed 4 Two-Pass - Bosphorus 4Kshoc: OpenCL - S3Daom-av1: Speed 0 Two-Pass - Bosphorus 4Kgmpbench: Total Timeaom-av1: Speed 6 Two-Pass - Bosphorus 4Kincompact3d: input.i3d 193 Cells Per Directionastcenc: Exhaustivebuild-erlang: Time To Compileopenscad: Pistolaom-av1: Speed 4 Two-Pass - Bosphorus 1080pbuild-linux-kernel: Time To Compileopenscad: Projector Mount Swivelsysbench: CPUsvt-hevc: 1 - Bosphorus 1080pdav1d: Chimera 1080p 10-bitshoc: OpenCL - Texture Read Bandwidthonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUshoc: OpenCL - Max SP Flopsonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0avifenc: 0simdjson: DistinctUserIDsimdjson: PartialTweetsbasis: UASTC Level 3avifenc: 6, Losslesssimdjson: Kostyabuild-mesa: Time To Compileaom-av1: Speed 0 Two-Pass - Bosphorus 1080pcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedsimdjson: LargeRandcompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedaom-av1: Speed 6 Realtime - Bosphorus 4Ktoybrot: C++ Threadstoybrot: C++ Taskstoybrot: TBBshoc: OpenCL - MD5 Hashtoybrot: OpenMPopenscad: Mini-ITX Casestockfish: Total Timesrslte: PHY_DL_Testsrslte: PHY_DL_Testaom-av1: Speed 6 Two-Pass - Bosphorus 1080pavifenc: 2srslte: OFDM_Testcompress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedincompact3d: input.i3d 129 Cells Per Directioncompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedviennacl: OpenCL BLAS - dGEMM-TTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMV-Tviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sCOPYcompress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedbasis: UASTC Level 2botan: AES-256 - Decryptbotan: AES-256aom-av1: Speed 9 Realtime - Bosphorus 4Kbotan: ChaCha20Poly1305 - Decryptbotan: ChaCha20Poly1305botan: Blowfish - Decryptbotan: Blowfishbotan: Twofish - Decryptbotan: Twofishbotan: CAST-256 - Decryptbotan: CAST-256botan: KASUMI - Decryptbotan: KASUMIaom-av1: Speed 6 Realtime - Bosphorus 1080pviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYbasis: ETC1Sshoc: OpenCL - GEMM SGEMM_Nonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUdav1d: Summer Nature 4Kliquid-dsp: 4 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 1 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 8 - 256 - 57astcenc: Thoroughaom-av1: Speed 8 Realtime - Bosphorus 4Kopenscad: Retro Caropenscad: Leonardo Phone Case Slimdav1d: Chimera 1080ponednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUavifenc: 6onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUshoc: OpenCL - Reductionaom-av1: Speed 8 Realtime - Bosphorus 1080ponednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUbasis: UASTC Level 0aom-av1: Speed 9 Realtime - Bosphorus 1080ponednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUshoc: OpenCL - FFT SPsvt-hevc: 7 - Bosphorus 1080pdav1d: Summer Nature 1080pastcenc: Mediumavifenc: 10, Losslesssvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psysbench: RAM / Memoryavifenc: 10onednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUsvt-hevc: 10 - Bosphorus 1080pshoc: OpenCL - Triadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Bus Speed Downloadsystemd-boot-total: Userspacesystemd-boot-total: Firmwaresystemd-boot-total: Loadersystemd-boot-total: Kernelsystemd-boot-total: Total123477.097618.0836.7666.1822.33003.21190.2704.287.2486.81297.93.4220.99310.126250.36.34121.691170113.8873113.000100.7676.7694.71492.93119380.346.85160.9156.90993413.973414.523408.161760.701854.881852.131852.2629.4472.3292.67225.3084.89972.0744.534.3863.54362.6743.156.4150.383957.531.91.073969.034.513.164485444756440580.38614386641.90823777878112.7293.219.6837.2531267000004312.4332.64609.8395.134.33759694433.41164.015.715.816.518.934.837.93835.334.037.734.632.94186.52379.433.4614806.1024815.97247.78908.311916.108539.238543.597437.297436.078172.091171.954107.861112.98025.0635.536.534.134.846.545.544.139.326.247.942.828.223.455245.9208.838982.14490185.132508566671301833336762966750033333345271333314.920738.6817.25916.975733.643.577401.6674712.9523.154333.4400336.7934124.398.261031.832157.172141.4814.283014.033515.3682106.19648.265.79495.778136.76173.31176.2928391.473.1546.639933.97706230.2715.605329.049529.211922109156793555194124050476.801618.3837.7664.9820.03011.41195.5700.287.1487.71292.43.4120.85180.126221.86.32121.826073113.7734112.558100.9786.7394.56993.90819382.716.84164.4056.89323445.233413.023405.321761.071851.221851.141850.8229.6702.3332.65525.2544.88272.2584.554.3863.56062.3893.1056.4680.373963.931.81.073983.134.613.124477444842441350.38614385442.05323501591112.2291.819.1137.3041284666674308.9327.64601.3378.434.43972274437.71156.915.715.816.518.934.738.13835.334.137.634.633.34164.52374.533.3744811.1814814.82848.72907.952913.025539.017543.385437.288436.124172.076171.951107.764112.94324.9535.536.434.234.846.545.644.139.326.247.942.928.223.390245.8858.865012.14385184.252515766671300466676763233350026333345207333314.905937.9417.34616.941731.763.574551.6657913.0053.151573.4335936.7270122.648.146951.829107.091140.3914.303113.998815.3757105.90649.675.77575.771136.22172.24175.2728491.743.1606.646593.93198229.4215.233629.052029.164122109156793555194124050477.284620.2837.3665.0818.33011.21188.7698.187.5489.51291.83.3920.84790.126217.66.30121.742472113.8933112.484100.8136.7594.76894.96619367.306.84163.7656.91753415.353413.883405.991760.611854.401852.051851.3029.7952.3462.68325.4144.94371.7994.544.3763.59362.3963.0956.4750.383939.431.51.073981.934.613.144480544729440850.3864383842.08423832208113.4295.019.2137.3261285000004307.7327.54574.8387.234.38460034442.71177.115.715.816.418.934.837.93835.434.237.634.633.34180.32388.433.3874812.4214815.61748.78908.474914.454539.337543.951436.744434.656172.150171.958107.878112.98525.0735.536.534.134.846.545.544.139.326.247.942.828.323.367245.7838.876182.14452184.162514733331306633336759966750001666745308666714.910337.1217.36717.054732.833.565771.665512.9643.149433.4391436.7633121.968.167881.823407.089139.8214.293614.010615.3573105.78647.635.77375.787136.05172.10175.0628338.593.1706.615573.93216229.3615.251829.153129.176622109156793555194124050OpenBenchmarking.org

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile213100200300400500SE +/- 0.34, N = 3SE +/- 0.32, N = 3SE +/- 0.19, N = 3476.80477.10477.28

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform321130260390520650SE +/- 1.23, N = 3SE +/- 2.62, N = 3SE +/- 1.07, N = 3620.2618.3618.01. 3.8.1.0

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter2312004006008001000SE +/- 1.84, N = 3SE +/- 1.55, N = 3SE +/- 2.38, N = 3837.7837.3836.71. 3.8.1.0

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter132140280420560700SE +/- 1.14, N = 3SE +/- 0.81, N = 3SE +/- 1.34, N = 3666.1665.0664.91. 3.8.1.0

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter1232004006008001000SE +/- 1.65, N = 3SE +/- 3.57, N = 3SE +/- 2.94, N = 3822.3820.0818.31. 3.8.1.0

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)2316001200180024003000SE +/- 2.10, N = 3SE +/- 2.72, N = 3SE +/- 3.36, N = 33011.43011.23003.21. 3.8.1.0

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters21330060090012001500SE +/- 9.72, N = 3SE +/- 12.45, N = 3SE +/- 15.25, N = 31195.51190.21188.71. 3.8.1.0

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase123150300450600750SE +/- 5.92, N = 3SE +/- 6.74, N = 3SE +/- 11.13, N = 3704.2700.2698.1

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform31220406080100SE +/- 0.32, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 387.587.287.1

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter321110220330440550SE +/- 0.27, N = 3SE +/- 1.98, N = 3SE +/- 1.86, N = 3489.5487.7486.8

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters12330060090012001500SE +/- 2.64, N = 3SE +/- 3.21, N = 3SE +/- 4.98, N = 31297.91292.41291.8

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K1230.76951.5392.30853.0783.8475SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.423.413.391. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3D123510152025SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 320.9920.8520.851. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K3210.0270.0540.0810.1080.135SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.120.120.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time123130026003900520065006250.36221.86217.61. (CC) gcc options: -O3 -fomit-frame-pointer -lm

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 36.346.326.301. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction132306090120150SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3121.69121.74121.831. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustive213306090120150SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.16, N = 3113.77113.89113.891. (CXX) g++ options: -O3 -flto -pthread

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compile321306090120150SE +/- 0.31, N = 3SE +/- 0.40, N = 3SE +/- 0.14, N = 3112.48112.56113.00

OpenSCAD

Render: Pistol

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Pistol13220406080100SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 3100.77100.81100.981. OpenSCAD version 2019.05

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p132246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.766.756.731. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compile21320406080100SE +/- 0.60, N = 3SE +/- 0.78, N = 3SE +/- 0.47, N = 394.5794.7194.77

OpenSCAD

Render: Projector Mount Swivel

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Projector Mount Swivel12320406080100SE +/- 0.21, N = 3SE +/- 0.42, N = 3SE +/- 0.08, N = 392.9393.9194.971. OpenSCAD version 2019.05

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU2134K8K12K16K20KSE +/- 20.02, N = 3SE +/- 14.24, N = 3SE +/- 16.54, N = 319382.7119380.3419367.301. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p132246810SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.856.846.841. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bit2314080120160200SE +/- 1.75, N = 3SE +/- 1.42, N = 3SE +/- 0.13, N = 3164.40163.76160.91MIN: 103.73 / MAX: 392.07MIN: 104.03 / MAX: 384.96MIN: 103.41 / MAX: 410.161. (CC) gcc options: -pthread -lm

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidth3121326395265SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 356.9256.9156.891. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1327001400210028003500SE +/- 2.52, N = 3SE +/- 0.66, N = 3SE +/- 26.13, N = 33413.973415.353445.23MIN: 3402.03MIN: 3409.68MIN: 3408.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU2317001400210028003500SE +/- 0.58, N = 3SE +/- 0.89, N = 3SE +/- 3.64, N = 33413.023413.883414.52MIN: 3406.86MIN: 3407.32MIN: 3404.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU2317001400210028003500SE +/- 3.80, N = 3SE +/- 4.35, N = 3SE +/- 5.97, N = 33405.323405.993408.16MIN: 3391.39MIN: 3393.27MIN: 3391.461. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flops213400800120016002000SE +/- 0.55, N = 3SE +/- 1.38, N = 3SE +/- 0.83, N = 31761.071760.701760.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU231400800120016002000SE +/- 1.24, N = 3SE +/- 1.53, N = 3SE +/- 3.26, N = 31851.221854.401854.88MIN: 1841.63MIN: 1846.77MIN: 1847.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU231400800120016002000SE +/- 1.72, N = 3SE +/- 0.20, N = 3SE +/- 0.88, N = 31851.141852.051852.13MIN: 1843.95MIN: 1846.88MIN: 1846.261. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU231400800120016002000SE +/- 0.16, N = 3SE +/- 1.14, N = 3SE +/- 0.58, N = 31850.821851.301852.26MIN: 1844.82MIN: 1843.57MIN: 1846.851. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3123714212835SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.16, N = 329.4529.6729.80MIN: 29.17 / MAX: 40.18MIN: 29.48 / MAX: 42.09MIN: 29.45 / MAX: 42.011. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.01230.52791.05581.58372.11162.6395SE +/- 0.007, N = 3SE +/- 0.006, N = 3SE +/- 0.010, N = 32.3292.3332.346MIN: 2.26 / MAX: 4.96MIN: 2.27 / MAX: 3.55MIN: 2.28 / MAX: 3.631. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_2242130.60371.20741.81112.41483.0185SE +/- 0.051, N = 3SE +/- 0.063, N = 3SE +/- 0.063, N = 32.6552.6722.683MIN: 2.35 / MAX: 3.81MIN: 2.34 / MAX: 4.31MIN: 2.37 / MAX: 4.911. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50213612182430SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 325.2525.3125.41MIN: 25.09 / MAX: 35.04MIN: 25.08 / MAX: 37.21MIN: 25.17 / MAX: 37.21. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.02131.11222.22443.33664.44885.561SE +/- 0.030, N = 3SE +/- 0.029, N = 3SE +/- 0.027, N = 34.8824.8994.943MIN: 4.68 / MAX: 5.58MIN: 4.71 / MAX: 7.4MIN: 4.65 / MAX: 16.51. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 03121632486480SE +/- 0.26, N = 3SE +/- 0.16, N = 3SE +/- 0.25, N = 371.8072.0772.261. (CXX) g++ options: -O3 -fPIC -lm

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID2311.02382.04763.07144.09525.119SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.554.544.531. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets2130.98551.9712.95653.9424.9275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.384.384.371. (CXX) g++ options: -O3 -pthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 31231428425670SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 363.5463.5663.591. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Lossless2311428425670SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 362.3962.4062.671. (CXX) g++ options: -O3 -fPIC -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya2130.69751.3952.09252.793.4875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.103.103.091. (CXX) g++ options: -O3 -pthread

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile1231326395265SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 356.4256.4756.48

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p3120.08550.1710.25650.3420.4275SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.380.380.371. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression Speed2139001800270036004500SE +/- 0.96, N = 3SE +/- 9.00, N = 3SE +/- 5.83, N = 33963.93957.53939.41. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression Speed123714212835SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 331.931.831.51. (CC) gcc options: -O3 -pthread -lz -llzma

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom3210.24080.48160.72240.96321.204SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.071.071.071. (CXX) g++ options: -O3 -pthread

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression Speed2319001800270036004500SE +/- 1.35, N = 3SE +/- 4.27, N = 3SE +/- 10.42, N = 33983.13981.93969.01. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression Speed321816243240SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 334.634.634.51. (CC) gcc options: -O3 -pthread -lz -llzma

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K1323691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.1613.1413.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads23110K20K30K40K50KSE +/- 25.27, N = 3SE +/- 36.98, N = 3SE +/- 27.00, N = 34477444805448541. (CXX) g++ options: -O3 -lpthread

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks31210K20K30K40K50KSE +/- 38.73, N = 3SE +/- 16.42, N = 3SE +/- 118.82, N = 34472944756448421. (CXX) g++ options: -O3 -lpthread

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBB1329K18K27K36K45KSE +/- 59.38, N = 3SE +/- 82.93, N = 3SE +/- 72.84, N = 34405844085441351. (CXX) g++ options: -O3 -lpthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hash2130.08690.17380.26070.34760.4345SE +/- 0.0000, N = 3SE +/- 0.0000, N = 3SE +/- 0.0000, N = 30.38610.38610.38601. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP3219K18K27K36K45KSE +/- 19.34, N = 3SE +/- 16.82, N = 3SE +/- 13.86, N = 34383843854438661. (CXX) g++ options: -O3 -lpthread

OpenSCAD

Render: Mini-ITX Case

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Mini-ITX Case1231020304050SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 341.9142.0542.081. OpenSCAD version 2019.05

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time3125M10M15M20M25MSE +/- 377829.22, N = 3SE +/- 294389.71, N = 4SE +/- 299604.32, N = 32383220823777878235015911. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test312306090120150SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3113.4112.7112.21. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Test31260120180240300SE +/- 0.84, N = 3SE +/- 0.19, N = 3SE +/- 0.73, N = 3295.0293.2291.81. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p132510152025SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 319.6819.2119.111. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2123918273645SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 337.2537.3037.331. (CXX) g++ options: -O3 -fPIC -lm

srsLTE

Test: OFDM_Test

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_Test32130M60M90M120M150MSE +/- 665832.81, N = 3SE +/- 821245.67, N = 3SE +/- 416333.20, N = 31285000001284666671267000001. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression Speed1239001800270036004500SE +/- 11.68, N = 3SE +/- 5.34, N = 3SE +/- 5.58, N = 34312.44308.94307.71. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression Speed12370140210280350SE +/- 1.66, N = 3SE +/- 2.23, N = 3SE +/- 0.88, N = 3332.6327.6327.51. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression Speed12310002000300040005000SE +/- 2.70, N = 3SE +/- 1.73, N = 3SE +/- 22.35, N = 34609.84601.34574.81. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression Speed13290180270360450SE +/- 4.78, N = 3SE +/- 2.94, N = 3SE +/- 2.00, N = 3395.1387.2378.41. (CC) gcc options: -O3 -pthread -lz -llzma

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction132816243240SE +/- 0.35, N = 3SE +/- 0.31, N = 3SE +/- 0.34, N = 334.3434.3834.441. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression Speed32110002000300040005000SE +/- 1.29, N = 3SE +/- 5.89, N = 3SE +/- 7.93, N = 34442.74437.74433.41. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression Speed31230060090012001500SE +/- 6.67, N = 3SE +/- 5.14, N = 3SE +/- 10.72, N = 31177.11164.01156.91. (CC) gcc options: -O3 -pthread -lz -llzma

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TT32148121620SE +/- 0.00, N = 2SE +/- 0.00, N = 2SE +/- 0.00, N = 215.715.715.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TN32148121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 315.815.815.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NT21348121620SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 316.516.516.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NN321510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 318.918.918.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-T312816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 334.834.834.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-N231918273645SE +/- 0.00, N = 338.137.937.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOT3219182736453838381. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPY321816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 335.435.335.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPY321816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 334.234.134.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOT132918273645SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 337.737.637.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPY321816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 334.634.634.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPY321816243240SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 333.333.332.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Decompression Speed1329001800270036004500SE +/- 4.56, N = 3SE +/- 1.88, N = 3SE +/- 19.69, N = 34186.54180.34164.51. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression Speed3125001000150020002500SE +/- 12.46, N = 3SE +/- 12.12, N = 3SE +/- 16.90, N = 32388.42379.42374.51. (CC) gcc options: -O3 -pthread -lz -llzma

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2231816243240SE +/- 0.10, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 333.3733.3933.461. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt32110002000300040005000SE +/- 0.04, N = 3SE +/- 0.85, N = 3SE +/- 4.51, N = 34812.424811.184806.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-25613210002000300040005000SE +/- 0.17, N = 3SE +/- 0.22, N = 3SE +/- 0.54, N = 34815.974815.624814.831. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K3211122334455SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.53, N = 1548.7848.7247.781. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt3122004006008001000SE +/- 0.54, N = 3SE +/- 0.56, N = 3SE +/- 0.82, N = 3908.47908.31907.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly13051322004006008001000SE +/- 0.12, N = 3SE +/- 1.80, N = 3SE +/- 3.02, N = 3916.11914.45913.031. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt312120240360480600SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.37, N = 3539.34539.24539.021. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish312120240360480600SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.40, N = 3543.95543.60543.391. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt12390180270360450SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.95, N = 3437.30437.29436.741. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish21390180270360450SE +/- 0.12, N = 3SE +/- 0.16, N = 3SE +/- 1.83, N = 3436.12436.08434.661. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt3124080120160200SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3172.15172.09172.081. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-2563124080120160200SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3171.96171.95171.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt31220406080100SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3107.88107.86107.761. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI312306090120150SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3112.99112.98112.941. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p312612182430SE +/- 0.22, N = 3SE +/- 0.24, N = 3SE +/- 0.29, N = 525.0725.0624.951. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT321816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 335.535.535.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN312816243240SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 336.536.536.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT231816243240SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 334.234.134.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN321816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 334.834.834.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T3211122334455SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 346.546.546.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N2311020304050SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 345.645.545.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT3211020304050SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 344.144.144.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY321918273645SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 339.339.339.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY321612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 326.226.226.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT3211122334455SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 347.947.947.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY2311020304050SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 342.942.842.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY321714212835SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 328.328.228.21. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S321612182430SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 323.3723.3923.461. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_N12350100150200250SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3245.92245.89245.781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU123246810SE +/- 0.00652, N = 3SE +/- 0.01330, N = 3SE +/- 0.00592, N = 38.838988.865018.87618MIN: 4.68MIN: 4.67MIN: 4.771. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU2310.48260.96521.44781.93042.413SE +/- 0.00224, N = 3SE +/- 0.00152, N = 3SE +/- 0.00353, N = 32.143852.144522.14490MIN: 1.97MIN: 1.97MIN: 1.941. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4K1234080120160200SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.19, N = 3185.13184.25184.16MIN: 174.56 / MAX: 210.52MIN: 173.91 / MAX: 209.75MIN: 173.76 / MAX: 209.161. (CC) gcc options: -pthread -lm

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 5723150M100M150M200M250MSE +/- 375470.08, N = 3SE +/- 321679.62, N = 3SE +/- 265476.51, N = 32515766672514733332508566671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 5731230M60M90M120M150MSE +/- 35276.68, N = 3SE +/- 40551.75, N = 3SE +/- 367891.89, N = 31306633331301833331300466671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 5721314M28M42M56M70MSE +/- 1763.83, N = 3SE +/- 881.92, N = 3SE +/- 27834.83, N = 36763233367629667675996671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57123110M220M330M440M550MSE +/- 324054.18, N = 3SE +/- 188355.46, N = 3SE +/- 199360.09, N = 35003333335002633335000166671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57312100M200M300M400M500MSE +/- 2209451.92, N = 3SE +/- 1523508.38, N = 3SE +/- 514274.03, N = 34530866674527133334520733331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thorough23148121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 314.9114.9114.921. (CXX) g++ options: -O3 -flto -pthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K123918273645SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.51, N = 438.6837.9437.121. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenSCAD

Render: Retro Car

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Retro Car12348121620SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 317.2617.3517.371. OpenSCAD version 2019.05

OpenSCAD

Render: Leonardo Phone Case Slim

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Leonardo Phone Case Slim21348121620SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 316.9416.9817.051. OpenSCAD version 2019.05

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p132160320480640800SE +/- 1.00, N = 3SE +/- 0.90, N = 3SE +/- 2.01, N = 3733.64732.83731.76MIN: 538.11 / MAX: 1144.18MIN: 537.85 / MAX: 1141.98MIN: 538.01 / MAX: 1138.021. (CC) gcc options: -pthread -lm

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU3210.80491.60982.41473.21964.0245SE +/- 0.00251, N = 3SE +/- 0.00753, N = 3SE +/- 0.01205, N = 33.565773.574553.57740MIN: 3.26MIN: 3.24MIN: 3.231. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3210.37520.75041.12561.50081.876SE +/- 0.00123, N = 3SE +/- 0.00242, N = 3SE +/- 0.00165, N = 31.665501.665791.66747MIN: 1.51MIN: 1.51MIN: 1.511. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 61323691215SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 312.9512.9613.011. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU3210.70971.41942.12912.83883.5485SE +/- 0.00951, N = 3SE +/- 0.00442, N = 3SE +/- 0.00701, N = 33.149433.151573.15433MIN: 3.09MIN: 3.1MIN: 3.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2310.7741.5482.3223.0963.87SE +/- 0.00274, N = 3SE +/- 0.00188, N = 3SE +/- 0.00410, N = 33.433593.439143.44003MIN: 3.17MIN: 3.18MIN: 3.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Reduction132816243240SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 336.7936.7636.731. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p123306090120150SE +/- 0.42, N = 3SE +/- 1.89, N = 3SE +/- 1.32, N = 12124.39122.64121.961. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU231246810SE +/- 0.00679, N = 3SE +/- 0.01559, N = 3SE +/- 0.00549, N = 38.146958.167888.26103MIN: 8MIN: 8.03MIN: 8.141. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3210.41220.82441.23661.64882.061SE +/- 0.00185, N = 3SE +/- 0.00543, N = 3SE +/- 0.00346, N = 31.823401.829101.83215MIN: 1.78MIN: 1.78MIN: 1.791. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0321246810SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.000, N = 37.0897.0917.1721. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p123306090120150SE +/- 1.63, N = 6SE +/- 0.46, N = 3SE +/- 2.02, N = 4141.48140.39139.821. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU13248121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 314.2814.2914.30MIN: 14.2MIN: 14.2MIN: 14.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU23148121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 314.0014.0114.03MIN: 13.9MIN: 13.88MIN: 13.921. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SP21348121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 315.3815.3715.361. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p12320406080100SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3106.19105.90105.781. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080p213140280420560700SE +/- 0.67, N = 3SE +/- 1.42, N = 3SE +/- 0.54, N = 3649.67648.26647.63MIN: 588.3 / MAX: 717.46MIN: 589.55 / MAX: 721.08MIN: 582.84 / MAX: 717.061. (CC) gcc options: -pthread -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Medium3211.30392.60783.91175.21566.5195SE +/- 0.0062, N = 3SE +/- 0.0051, N = 3SE +/- 0.0231, N = 35.77375.77575.79491. (CXX) g++ options: -O3 -flto -pthread

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Lossless2131.30212.60423.90635.20846.5105SE +/- 0.013, N = 3SE +/- 0.013, N = 3SE +/- 0.012, N = 35.7715.7785.7871. (CXX) g++ options: -O3 -fPIC -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p123306090120150SE +/- 0.26, N = 3SE +/- 0.13, N = 3SE +/- 0.35, N = 3136.76136.22136.051. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p1234080120160200SE +/- 0.19, N = 3SE +/- 0.43, N = 3SE +/- 0.26, N = 3173.31172.24172.101. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p1234080120160200SE +/- 0.31, N = 3SE +/- 0.47, N = 3SE +/- 0.09, N = 3176.29175.27175.061. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory2136K12K18K24K30KSE +/- 138.44, N = 3SE +/- 98.12, N = 3SE +/- 41.95, N = 328491.7428391.4728338.591. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 101230.71331.42662.13992.85323.5665SE +/- 0.004, N = 3SE +/- 0.010, N = 3SE +/- 0.012, N = 33.1543.1603.1701. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU312246810SE +/- 0.00264, N = 3SE +/- 0.00984, N = 3SE +/- 0.01698, N = 36.615576.639936.64659MIN: 5.99MIN: 6MIN: 61. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU2310.89481.78962.68443.57924.474SE +/- 0.01212, N = 3SE +/- 0.01771, N = 3SE +/- 0.02149, N = 33.931983.932163.97706MIN: 3.64MIN: 3.62MIN: 3.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p12350100150200250SE +/- 0.11, N = 3SE +/- 0.19, N = 3SE +/- 0.43, N = 3230.27229.42229.361. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triad13248121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 315.6115.2515.231. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readback321714212835SE +/- 0.20, N = 3SE +/- 0.10, N = 3SE +/- 0.22, N = 329.1529.0529.051. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Download132714212835SE +/- 0.10, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 329.2129.1829.161. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Systemd Total Boot Time

Test: Userspace

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Userspace1235K10K15K20K25K221092210922109

Systemd Total Boot Time

Test: Firmware

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Firmware1233K6K9K12K15K156791567915679

Systemd Total Boot Time

Test: Loader

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Loader1238001600240032004000355535553555

Systemd Total Boot Time

Test: Kernel

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Kernel123400800120016002000194119411941

Systemd Total Boot Time

Test: Total

OpenBenchmarking.orgms, Fewer Is BetterSystemd Total Boot TimeTest: Total1235K10K15K20K25K240502405024050


Phoronix Test Suite v10.8.5