EPYC 7702 April 2021

AMD EPYC 7702 64-Core testing with a ASRockRack EPYCD8 (P2.40 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104043-IB-EPYC7702A33&grs&sor.

EPYC 7702 April 2021ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolution123AMD EPYC 7702 64-Core @ 2.00GHz (64 Cores / 128 Threads)ASRockRack EPYCD8 (P2.40 BIOS)AMD Starship/Matisse126GB3841GB Micron_9300_MTFDHAL3T8TDPASPEEDVE2282 x Intel I350Ubuntu 20.045.9.0-050900rc6daily20200921-generic (x86_64) 20200920GNOME Shell 3.36.4X Server 1.20.8GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034 Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7702 April 2021svt-vp9: VMAF Optimized - Bosphorus 1080pbuild-linux-kernel: Time To Compileaom-av1: Speed 8 Realtime - Bosphorus 1080pincompact3d: input.i3d 129 Cells Per Directionviennacl: CPU BLAS - dGEMV-Tcompress-zstd: 8 - Compression Speedtoybrot: TBBonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUaom-av1: Speed 6 Two-Pass - Bosphorus 4Konednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUviennacl: CPU BLAS - dAXPYgnuradio: Five Back to Back FIR Filtersaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080ponednn: Recurrent Neural Network Training - u8s8f32 - CPUgnuradio: FM Deemphasis Filteronednn: IP Shapes 3D - u8s8f32 - CPUviennacl: CPU BLAS - dGEMM-NNcompress-zstd: 19 - Compression Speedviennacl: CPU BLAS - dCOPYavifenc: 10onednn: Deconvolution Batch shapes_1d - f32 - CPUgnuradio: Signal Source (Cosine)stockfish: Total Timecompress-zstd: 3, Long Mode - Compression Speedviennacl: CPU BLAS - sCOPYsvt-hevc: 7 - Bosphorus 1080pviennacl: CPU BLAS - sAXPYincompact3d: input.i3d 193 Cells Per Directionaom-av1: Speed 6 Realtime - Bosphorus 4Ktoybrot: OpenMPaom-av1: Speed 6 Two-Pass - Bosphorus 1080ponednn: IP Shapes 1D - u8s8f32 - CPUaom-av1: Speed 9 Realtime - Bosphorus 4Ktoybrot: C++ Threadscompress-zstd: 19, Long Mode - Compression Speedonednn: IP Shapes 1D - f32 - CPUviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMM-NTblender: BMW27 - CPU-Onlycompress-zstd: 3 - Compression Speedavifenc: 10, Losslessgnuradio: IIR Filtersvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psimdjson: PartialTweetssimdjson: DistinctUserIDblender: Classroom - CPU-Onlyonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUblender: Pabellon Barcelona - CPU-Onlyviennacl: CPU BLAS - dGEMM-TNgnuradio: FIR Filtersvt-hevc: 1 - Bosphorus 1080paom-av1: Speed 4 Two-Pass - Bosphorus 1080psimdjson: Kostyasvt-hevc: 10 - Bosphorus 1080pavifenc: 6viennacl: CPU BLAS - dGEMM-TTonednn: Recurrent Neural Network Training - f32 - CPUliquid-dsp: 64 - 256 - 57toybrot: C++ Tasksblender: Fishy Cat - CPU-Onlyavifenc: 6, Losslessaom-av1: Speed 6 Realtime - Bosphorus 1080psysbench: RAM / Memoryonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUcompress-zstd: 19 - Decompression Speedbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptblender: Barbershop - CPU-Onlyluaradio: Complex Phaseliquid-dsp: 128 - 256 - 57build-mesa: Time To Compilegnuradio: Hilbert Transformcompress-zstd: 3, Long Mode - Decompression Speedaom-av1: Speed 4 Two-Pass - Bosphorus 4Konednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUcompress-zstd: 8 - Decompression Speedbuild-erlang: Time To Compileonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUliquid-dsp: 1 - 256 - 57luaradio: Five Back to Back FIR Filtersbuild-nodejs: Time To Compilecompress-zstd: 8, Long Mode - Decompression Speedincompact3d: X3D-benchmarking input.i3dluaradio: FM Deemphasis Filterliquid-dsp: 16 - 256 - 57avifenc: 0liquid-dsp: 8 - 256 - 57onednn: IP Shapes 3D - f32 - CPUliquid-dsp: 32 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 4 - 256 - 57botan: AES-256 - Decryptbotan: Blowfish - Decryptbotan: AES-256gmpbench: Total Timebotan: Blowfishavifenc: 2onednn: Recurrent Neural Network Inference - f32 - CPUbotan: Twofishonednn: Deconvolution Batch shapes_3d - f32 - CPUcompress-zstd: 19, Long Mode - Decompression Speedbotan: Twofish - Decryptsysbench: CPUbotan: CAST-256 - Decryptbotan: KASUMI - Decryptbotan: KASUMIbotan: CAST-256onednn: Convolution Batch Shapes Auto - f32 - CPUluaradio: Hilbert Transformsimdjson: LargeRandviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - sDOTcompress-zstd: 8, Long Mode - Compression Speed123238.1433.5558.375.54909182636251074710.9118026.720.4338381320331.722.2267.34279.12081.537440.9988019383.513903.7121.719932913.8134641813410.5757277.9168725.883249311.89784816.221.4831425.94720741.51.2798589690.538.745279.46.767486.7362.923.553.59100.031.224942082.99116.5295.1534.534.415.792.12462.6110.44292.92089.27271570000075995529.43818.246128.19730.7841.149652559.5633.174632.656142.58515.5312970000023.304346.52892.43.82734.1732763.5157.6793.45459654000451.2128.1842986.2689.604065339.293775000053.3064756500005.0430716906000001189200002379700004560.395367.424560.3764542369.32128.412732.05302.4382.670162552.2302.23496671.09120.06675.09177.653120.0551.0090182.70.6630.9461444.1361.0031.15554.475.961010776202441.172440.8727696.980.4461411283323.022.1865.79273.592085.38734.10.98032391.482.113673.7541.747742868.2133169802404.5764276.0767925.552420911.74794116.091.4704825.71727841.11.2754490489.738.885243.66.796489.7361.433.533.61100.581.222242085.08117.1494.6537.334.595.772.13460.8610.46892.92098.232718266667757954.8029.50018.226104.81732.8101.151662564.8635.007631.089142.94513.8312423333323.232346.12888.03.82732.2942770.2157.4793.4614159569333452.0128.4682992.6690.480672339.893677666753.2164761166675.0432016930666671190366672382366674559.088367.7974558.6264546.4369.21928.391732.625302.4922.670362553.7302.33796717.84120.05875.10177.661120.0651.0089982.70.6628.5507444.0360.7431.16254.325.685589636532386.171190.8938306.790.4503101327322.621.6766.10277.872122.66730.10.98680692.582.313833.7731.735682879.4132630158408.0753279.7468825.611750311.74792116.041.4670725.99725841.21.2876490489.839.045250.36.809488.3363.643.533.59100.081.218412094.12116.6694.7535.134.455.762.13462.9110.48892.52095.262727266667761155.0229.55618.176121.17733.5121.153912555.8635.338630.505142.46514.5311940000023.232347.12884.63.83732.4832768.5157.3023.4531459520333452.2128.3782989.6691.061747339.993614666753.2874764200005.0510316912666671190866672380533334555.311367.8044555.6254542.4368.99328.414732.382302.2852.668772553.0302.18096694.41120.09675.10377.663120.0551.0089482.70.6627.3479473.1OpenBenchmarking.org

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p23180160240320400SE +/- 0.55, N = 3SE +/- 0.35, N = 3361.00360.74238.141. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compile231816243240SE +/- 0.28, N = 10SE +/- 0.28, N = 1031.1631.1633.55

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p1231326395265SE +/- 0.25, N = 3SE +/- 0.38, N = 358.3754.4754.321. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction1321.34122.68244.02365.36486.706SE +/- 0.08476458, N = 3SE +/- 0.08892473, N = 35.549091825.685589635.961010771. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T312140280420560700SE +/- 7.86, N = 3SE +/- 14.00, N = 36536366201. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression Speed1235001000150020002500SE +/- 26.18, N = 3SE +/- 11.70, N = 32510.02441.12386.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBB32116003200480064008000SE +/- 12.41, N = 3SE +/- 35.23, N = 37119724474711. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU2310.20520.41040.61560.82081.026SE +/- 0.011992, N = 4SE +/- 0.007517, N = 30.8727690.8938300.911802MIN: 0.8MIN: 0.81MIN: 0.811. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K231246810SE +/- 0.01, N = 3SE +/- 0.04, N = 36.986.796.721. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.10130.20260.30390.40520.5065SE +/- 0.001158, N = 3SE +/- 0.001517, N = 30.4338380.4461410.450310MIN: 0.39MIN: 0.38MIN: 0.391. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY31230060090012001500SE +/- 3.33, N = 3SE +/- 41.77, N = 31327132012831. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters12370140210280350SE +/- 3.91, N = 3SE +/- 3.89, N = 3331.7323.0322.61. 3.8.1.0

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K123510152025SE +/- 0.20, N = 3SE +/- 0.30, N = 322.2222.1821.671. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p1321530456075SE +/- 0.03, N = 3SE +/- 0.53, N = 367.3466.1065.791. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p13260120180240300SE +/- 3.78, N = 3SE +/- 2.47, N = 3279.10277.87273.591. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1235001000150020002500SE +/- 3.09, N = 3SE +/- 10.59, N = 32081.532085.382122.66MIN: 2065.05MIN: 2066.13MIN: 2083.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter123160320480640800SE +/- 2.41, N = 3SE +/- 5.82, N = 3744.0734.1730.11. 3.8.1.0

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU2310.22470.44940.67410.89881.1235SE +/- 0.001130, N = 3SE +/- 0.001469, N = 30.9803230.9868060.998801MIN: 0.94MIN: 0.9MIN: 0.941. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN13220406080100SE +/- 0.17, N = 3SE +/- 0.35, N = 393.092.591.41. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression Speed13220406080100SE +/- 1.26, N = 3SE +/- 0.44, N = 383.582.382.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY13230060090012001500SE +/- 8.82, N = 3SE +/- 14.53, N = 31390138313671. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 101230.84891.69782.54673.39564.2445SE +/- 0.007, N = 3SE +/- 0.001, N = 33.7123.7543.7731. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1320.39320.78641.17961.57281.966SE +/- 0.00100, N = 3SE +/- 0.00765, N = 31.719931.735681.74774MIN: 1.64MIN: 1.65MIN: 1.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)1326001200180024003000SE +/- 27.86, N = 3SE +/- 4.36, N = 32913.82879.42868.21. 3.8.1.0

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time12330M60M90M120M150MSE +/- 1363379.57, N = 8SE +/- 1858448.97, N = 41346418131331698021326301581. (CXX) g++ options: -fprofile-use -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression Speed13290180270360450SE +/- 3.33, N = 3SE +/- 6.05, N = 3410.5408.0404.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY213160320480640800SE +/- 11.85, N = 3SE +/- 5.84, N = 37647577531. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p31260120180240300SE +/- 1.54, N = 3SE +/- 0.37, N = 3279.74277.91276.071. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY312150300450600750SE +/- 0.58, N = 3SE +/- 6.89, N = 36886876791. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction231612182430SE +/- 0.22, N = 3SE +/- 0.28, N = 325.5525.6125.881. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K1323691215SE +/- 0.03, N = 3SE +/- 0.04, N = 311.8911.7411.741. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP1322K4K6K8K10KSE +/- 13.93, N = 3SE +/- 18.59, N = 37848792179411. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p12348121620SE +/- 0.03, N = 3SE +/- 0.03, N = 316.2216.0916.041. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3210.33370.66741.00111.33481.6685SE +/- 0.00090, N = 3SE +/- 0.00614, N = 31.467071.470481.48314MIN: 1.21MIN: 1.21MIN: 1.241. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K312612182430SE +/- 0.12, N = 3SE +/- 0.21, N = 325.9925.9425.711. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads13216003200480064008000SE +/- 30.64, N = 3SE +/- 11.93, N = 37207725872781. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression Speed132918273645SE +/- 0.23, N = 3SE +/- 0.09, N = 341.541.241.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU2130.28970.57940.86911.15881.4485SE +/- 0.00139, N = 3SE +/- 0.01102, N = 31.275441.279851.28764MIN: 1.22MIN: 1.22MIN: 1.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT3212004006008001000SE +/- 6.84, N = 39049048961. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT13220406080100SE +/- 0.15, N = 3SE +/- 0.15, N = 390.589.889.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-Only123918273645SE +/- 0.11, N = 3SE +/- 0.24, N = 338.7438.8839.04

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression Speed13211002200330044005500SE +/- 3.83, N = 3SE +/- 2.38, N = 35279.45250.35243.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Lossless123246810SE +/- 0.010, N = 3SE +/- 0.017, N = 36.7676.7966.8091. (CXX) g++ options: -O3 -fPIC -lm

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter231110220330440550SE +/- 1.77, N = 3SE +/- 0.90, N = 3489.7488.3486.71. 3.8.1.0

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p31280160240320400SE +/- 3.53, N = 3SE +/- 1.33, N = 3363.64362.92361.431. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets1320.79881.59762.39643.19523.994SE +/- 0.00, N = 3SE +/- 0.01, N = 33.553.533.531. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID2310.81231.62462.43693.24924.0615SE +/- 0.00, N = 3SE +/- 0.01, N = 33.613.593.591. (CXX) g++ options: -O3 -march=native -pthread

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CPU-Only13220406080100SE +/- 0.04, N = 3SE +/- 0.60, N = 3100.03100.08100.58

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3210.27560.55120.82681.10241.378SE +/- 0.00239, N = 3SE +/- 0.00091, N = 31.218411.222241.22494MIN: 1.12MIN: 1.13MIN: 1.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU123400800120016002000SE +/- 1.02, N = 3SE +/- 6.39, N = 32082.992085.082094.12MIN: 2067.76MIN: 2067.79MIN: 2069.051. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CPU-Only132306090120150SE +/- 0.22, N = 3SE +/- 0.36, N = 3116.52116.66117.14

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN13220406080100SE +/- 0.15, N = 3SE +/- 0.03, N = 395.194.794.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter231120240360480600SE +/- 0.95, N = 3SE +/- 0.73, N = 3537.3535.1534.51. 3.8.1.0

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p231816243240SE +/- 0.08, N = 3SE +/- 0.05, N = 334.5934.4534.411. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p1231.30282.60563.90845.21126.514SE +/- 0.01, N = 3SE +/- 0.01, N = 35.795.775.761. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya3210.47930.95861.43791.91722.3965SE +/- 0.00, N = 3SE +/- 0.00, N = 32.132.132.121. (CXX) g++ options: -O3 -march=native -pthread

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p312100200300400500SE +/- 3.83, N = 3SE +/- 2.49, N = 3462.91462.61460.861. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 61233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 310.4410.4710.491. (CXX) g++ options: -O3 -fPIC -lm

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT21320406080100SE +/- 0.20, N = 2SE +/- 0.30, N = 392.992.992.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1325001000150020002500SE +/- 3.90, N = 3SE +/- 11.83, N = 32089.272095.262098.23MIN: 2067.67MIN: 2072.7MIN: 2065.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57321600M1200M1800M2400M3000MSE +/- 1003881.36, N = 3SE +/- 4836091.17, N = 32727266667271826666727157000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks21316003200480064008000SE +/- 110.25, N = 3SE +/- 31.63, N = 37579759976111. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-Only2131224364860SE +/- 0.05, N = 3SE +/- 0.03, N = 354.8055.0055.02

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Lossless123714212835SE +/- 0.01, N = 3SE +/- 0.03, N = 329.4429.5029.561. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p12348121620SE +/- 0.01, N = 3SE +/- 0.05, N = 318.2418.2218.171. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory13213002600390052006500SE +/- 9.05, N = 3SE +/- 9.71, N = 36128.196121.176104.811. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU123160320480640800SE +/- 1.60, N = 3SE +/- 1.21, N = 3730.78732.81733.51MIN: 719.75MIN: 720.04MIN: 720.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.25960.51920.77881.03841.298SE +/- 0.00070, N = 3SE +/- 0.00189, N = 31.149651.151661.15391MIN: 1.09MIN: 1.08MIN: 1.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression Speed2136001200180024003000SE +/- 5.02, N = 3SE +/- 7.03, N = 32564.82559.52555.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305321140280420560700SE +/- 0.61, N = 3SE +/- 0.48, N = 3635.34635.01633.171. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt123140280420560700SE +/- 0.23, N = 3SE +/- 0.53, N = 3632.66631.09630.511. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CPU-Only312306090120150SE +/- 0.10, N = 3SE +/- 0.05, N = 3142.46142.58142.94

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase132110220330440550SE +/- 0.64, N = 3SE +/- 0.22, N = 3515.5514.5513.8

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57123700M1400M2100M2800M3500MSE +/- 2649108.86, N = 3SE +/- 503322.30, N = 33129700000312423333331194000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile231612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 323.2323.2323.30

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform31280160240320400SE +/- 2.03, N = 3SE +/- 0.32, N = 3347.1346.5346.11. 3.8.1.0

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression Speed1236001200180024003000SE +/- 0.59, N = 3SE +/- 3.16, N = 32892.42888.02884.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K3210.86181.72362.58543.44724.309SE +/- 0.01, N = 3SE +/- 0.01, N = 33.833.823.821. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU231160320480640800SE +/- 0.35, N = 3SE +/- 0.14, N = 3732.29732.48734.17MIN: 720.4MIN: 720.58MIN: 722.641. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression Speed2316001200180024003000SE +/- 1.83, N = 3SE +/- 5.72, N = 32770.22768.52763.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compile321306090120150SE +/- 0.50, N = 3SE +/- 0.43, N = 3157.30157.48157.68

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU3120.77881.55762.33643.11523.894SE +/- 0.00642, N = 3SE +/- 0.00371, N = 33.453143.454003.46141MIN: 3.38MIN: 3.4MIN: 3.381. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 5712313M26M39M52M65MSE +/- 43364.09, N = 3SE +/- 66338.36, N = 35965400059569333595203331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters321100200300400500SE +/- 1.34, N = 3SE +/- 0.38, N = 3452.2452.0451.2

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile132306090120150SE +/- 0.03, N = 3SE +/- 0.31, N = 3128.18128.38128.47

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression Speed2316001200180024003000SE +/- 5.14, N = 3SE +/- 2.45, N = 152992.62989.62986.21. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d123150300450600750SE +/- 0.64, N = 3SE +/- 0.84, N = 3689.60690.48691.061. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter32170140210280350SE +/- 0.03, N = 3SE +/- 0.19, N = 3339.9339.8339.2

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57123200M400M600M800M1000MSE +/- 935420.29, N = 3SE +/- 1134097.78, N = 39377500009367766679361466671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 02311224364860SE +/- 0.00, N = 3SE +/- 0.01, N = 353.2253.2953.311. (CXX) g++ options: -O3 -fPIC -lm

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57321100M200M300M400M500MSE +/- 230289.67, N = 3SE +/- 493637.29, N = 34764200004761166674756500001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1231.13652.2733.40954.5465.6825SE +/- 0.00358, N = 3SE +/- 0.00299, N = 35.043075.043205.05103MIN: 4.9MIN: 4.85MIN: 4.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57231400M800M1200M1600M2000MSE +/- 88191.71, N = 3SE +/- 218581.28, N = 31693066667169126666716906000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 5732130M60M90M120M150MSE +/- 60092.52, N = 3SE +/- 28480.01, N = 31190866671190366671189200001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 5723150M100M150M200M250MSE +/- 92616.29, N = 3SE +/- 116237.31, N = 32382366672380533332379700001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt12310002000300040005000SE +/- 1.20, N = 3SE +/- 5.30, N = 34560.404559.094555.311. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt32180160240320400SE +/- 0.08, N = 3SE +/- 0.08, N = 3367.80367.80367.421. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-25612310002000300040005000SE +/- 0.67, N = 3SE +/- 4.16, N = 34560.384558.634555.631. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time231100020003000400050004546.44542.44542.01. (CC) gcc options: -O3 -fomit-frame-pointer -lm

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish12380160240320400SE +/- 0.12, N = 3SE +/- 0.14, N = 3369.32369.22368.991. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2213714212835SE +/- 0.03, N = 3SE +/- 0.02, N = 328.3928.4128.411. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU132160320480640800SE +/- 1.01, N = 3SE +/- 1.26, N = 3732.05732.38732.63MIN: 720.72MIN: 718.93MIN: 720.331. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish21370140210280350SE +/- 0.02, N = 3SE +/- 0.11, N = 3302.49302.44302.291. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU3120.60081.20161.80242.40323.004SE +/- 0.00016, N = 3SE +/- 0.00818, N = 32.668772.670162.67036MIN: 2.48MIN: 2.51MIN: 2.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression Speed2315001000150020002500SE +/- 3.23, N = 3SE +/- 1.71, N = 32553.72553.02552.21. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt21370140210280350SE +/- 0.03, N = 3SE +/- 0.04, N = 3302.34302.23302.181. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU23120K40K60K80K100KSE +/- 9.51, N = 3SE +/- 20.57, N = 396717.8496694.4196671.091. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt312306090120150SE +/- 0.02, N = 3SE +/- 0.01, N = 3120.10120.07120.061. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt32120406080100SE +/- 0.03, N = 3SE +/- 0.01, N = 375.1075.1075.091. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI32120406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 377.6677.6677.651. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256231306090120150SE +/- 0.02, N = 3SE +/- 0.01, N = 3120.07120.06120.061. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU3210.2270.4540.6810.9081.135SE +/- 0.00167, N = 3SE +/- 0.00095, N = 31.008941.008991.00901MIN: 0.97MIN: 0.97MIN: 0.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform32120406080100SE +/- 0.00, N = 3SE +/- 0.03, N = 382.782.782.7

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom3210.14850.2970.44550.5940.7425SE +/- 0.00, N = 3SE +/- 0.00, N = 30.660.660.661. (CXX) g++ options: -O3 -march=native -pthread

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N123714212835SE +/- 0.47, N = 3SE +/- 1.05, N = 330.928.527.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT231110220330440550SE +/- 27.14, N = 3SE +/- 13.58, N = 35074794611. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression Speed312100200300400500SE +/- 8.81, N = 15SE +/- 0.80, N = 3473.1444.1444.01. (CC) gcc options: -O3 -march=native -pthread -lz -llzma


Phoronix Test Suite v10.8.5