GCC 14 vs. Clang 18 - AMD Ryzen Threadripper 7980X

AMD Ryzen Threadripper 7980X compiler benchmarking on Fedora 40 by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2404242-NE-GCC14AMDR04&grr&sor.

GCC 14 vs. Clang 18 - AMD Ryzen Threadripper 7980XProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionGCC 14.0.1 20240411Clang 18.1.1AMD Ryzen Threadripper 7980X 64-Cores @ 7.79GHz (64 Cores / 128 Threads)System76 Thelio Major (FA Z5 BIOS)AMD Device 14a44 x 32GB DDR5-4800MT/s Micron MTC20F1045S1RC48BA21000GB CT1000T700SSD5AMD Radeon Pro W7900 45GBAMD Device 14ccDELL P2415QAquantia AQC113C NBase-T/IEEE + Realtek RTL8125 2.5GbE + Intel Wi-Fi 6EFedora Linux 406.8.5-301.fc40.x86_64 (x86_64)GNOME Shell 46.1X Server + Wayland4.6 Mesa 24.0.5 (LLVM 18.1.1 DRM 3.57)GCC 14.0.1 20240411btrfs1920x1080Clang 18.1.1 + LLVM 18.1.1OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- GCC 14.0.1 20240411: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none,amdgcn-amdhsa --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Processor Details- Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa108105 Python Details- Python 3.12.2Security Details- SELinux + gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCC 14 vs. Clang 18 - AMD Ryzen Threadripper 7980Xlammps: 20k Atomsopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenssl: RSA4096openssl: RSA4096securemark: SecureMark-TLSsmhasher: SHA3-256smhasher: SHA3-256quicksilver: CTS2quicksilver: CORAL2 P2openvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUjohn-the-ripper: HMAC-SHA512openssl: AES-128-GCMopenssl: ChaCha20-Poly1305openssl: AES-256-GCMopenssl: ChaCha20openssl: SHA512openssl: SHA256jpegxl: PNG - 90liquid-dsp: 1 - 256 - 512coremark: CoreMark Size 666 - Iterations Per Secondhelsing: 14 digitcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 12 - Decompression Speedcompress-zstd: 12 - Compression Speedopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUjohn-the-ripper: MD5openvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUgraphics-magick: Resizinggraphics-magick: Rotategraphics-magick: Noise-Gaussiangraphics-magick: Sharpengraphics-magick: HWB Color Spacegraphics-magick: Enhancedgraphics-magick: Swirlsrsran: PDSCH Processor Benchmark, Throughput Totalquicksilver: CORAL2 P1stargate: 192000 - 1024gromacs: MPI CPU - water_GMX50_barequantlib: Multi-Threadedjpegxl: JPEG - 90encode-opus: WAV To Opus Encodejohn-the-ripper: WPA PSKliquid-dsp: 128 - 256 - 512astcenc: Very Thoroughjohn-the-ripper: bcryptjohn-the-ripper: Blowfishliquid-dsp: 64 - 256 - 512synthmark: VoiceMark_100webp: Quality 100, Lossless, Highest Compressionastcenc: Exhaustivestargate: 96000 - 1024minibude: OpenMP - BM2minibude: OpenMP - BM2primesieve: 1e13compress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speeduvg266: Bosphorus 4K - Slowuvg266: Bosphorus 4K - Mediumsrsran: PUSCH Processor Benchmark, Throughput Totalsvt-av1: Preset 4 - Bosphorus 4Kencode-flac: WAV To FLACquantlib: Single-Threadedkvazaar: Bosphorus 4K - Mediumsvt-av1: Preset 13 - Bosphorus 4Kx265: Bosphorus 4Kminibude: OpenMP - BM1minibude: OpenMP - BM1jpegxl-decode: Alljpegxl: PNG - 100webp: Quality 100, Losslessjpegxl: JPEG - 100astcenc: Thoroughsmhasher: MeowHash x86_64 AES-NIsmhasher: MeowHash x86_64 AES-NIsvt-av1: Preset 12 - Bosphorus 4Ksmhasher: FarmHash128smhasher: FarmHash128svt-av1: Preset 8 - Bosphorus 4Kkvazaar: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Ultra Fastrnnoise: 26 Minute Long Talking Samplec-ray: Total Time - 4K, 16 Rays Per Pixelkvazaar: Bosphorus 4K - Very Fastsmhasher: Spooky32smhasher: Spooky32smhasher: FarmHash32 x86_64 AVXsmhasher: FarmHash32 x86_64 AVXdraco: Church Facadesmhasher: fasthash32smhasher: fasthash32kvazaar: Bosphorus 4K - Ultra Fastencode-wavpack: WAV To WavPacksmhasher: t1ha2_atoncesmhasher: t1ha2_atoncesmhasher: t1ha0_aes_avx2 x86_64smhasher: t1ha0_aes_avx2 x86_64draco: Lionwebp: Quality 100, Highest Compressionastcenc: Mediumsmhasher: wyhashsmhasher: wyhashwebp: Quality 100tscp: AI Chess PerformanceGCC 14.0.1 20240411Clang 18.1.127.24218.283367.261280463.344305.14160971648.905237.33218533332178666789.77355.932988530008220882489773103526179277076582465534378296868833708417700311251352716348.575182589173365815.03282263.2851825.312.11917.825.92251.4399.1377.5884.53159.76200.067.334299.7524.441304.855.335969.174.3813824.920.43116609.441331866727.862288.666.384938.077.648089.5519315518521625925055420064.5285733334.0976175.571293011.246.96919.32854146112422000008.71201534461528661025733333990.5700.815.34466.161616175.0504376.24426.3515658.0146.675882.149.0227.1630.394801.99.84811.3924611.637.05193.77939.16175.9024397.544600.86841.1752.0741.69462.448441.83650680.40197.49243.84721159.3795.24693.1270.7372.6574.077.6076.17979.4925.09420053.0724.11437461.46524020.7978833.7197.053.70219.11019662.3319.08596787.5740494.57450.109813.44226137.9814.80196008426.33321.122933.121275596.844172.3432909996.372380.271918666722130000100.60317.982144483338167191577101962158718077010698824332881238513903724609937710934582436049.454234263083044772.38052678.0571861.012.31962.825.82340.9383.4380.5083.82160.20199.477.444232.6725.201266.105.405885.264.4613601.850.43115684.851484033327.932283.116.454888.567.708024.9417815315919920925645720338.7235700004.2387555.247295240.551.25418.41646052512978666678.961515972615778512310666671002.5150.825.51306.550766210.7825269.55026.4315493.7141.725640.145.8626.4529.504446.19.85310.6804697.638.67195.30740.08174.7664369.140597.36240.7072.0541.65363.560642.39351816.54194.88531.02419825.3693.19496.3774.3875.2776.457.4848.49284.1625.14620643.4526.99931033.62511220.5648593.7897.803.70119.35119737.3918.83997667.2639425.64459.583112.96829888.6914.582118389OpenBenchmarking.org

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsGCC 14.0.1 20240411Clang 18.1.1612182430SE +/- 0.28, N = 3SE +/- 0.04, N = 327.2426.331. (CXX) g++ options: -O3 -march=native -lm -ldl

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPUGCC 14.0.1 20240411Clang 18.1.1510152025SE +/- 0.14, N = 15SE +/- 0.18, N = 1518.2821.12-fno-strict-overflow -fwrapv - MIN: 7.31 / MAX: 109.14MIN: 7.43 / MAX: 121.231. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Noise Suppression Poconet-Like FP16 - Device: CPUGCC 14.0.1 20240411Clang 18.1.17001400210028003500SE +/- 28.58, N = 15SE +/- 27.73, N = 153367.262933.12-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.3Algorithm: RSA4096GCC 14.0.1 20240411Clang 18.1.1300K600K900K1200K1500KSE +/- 1971.44, N = 3SE +/- 967.52, N = 31280463.31275596.8-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.3Algorithm: RSA4096GCC 14.0.1 20240411Clang 18.1.19K18K27K36K45KSE +/- 89.80, N = 3SE +/- 71.24, N = 344305.144172.3-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 18.1.1GCC 14.0.1 2024041190K180K270K360K450KSE +/- 1948.09, N = 3SE +/- 1794.60, N = 34329094160971. (CC) gcc options: -pedantic -O3

SMHasher

Hash: SHA3-256

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: SHA3-256Clang 18.1.1GCC 14.0.1 20240411400800120016002000SE +/- 3.42, N = 3SE +/- 5.66, N = 3996.371648.91-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: SHA3-256

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: SHA3-256Clang 18.1.1GCC 14.0.1 2024041180160240320400SE +/- 3.36, N = 3SE +/- 1.54, N = 3380.27237.33-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

Quicksilver

Input: CTS2

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CTS2GCC 14.0.1 20240411Clang 18.1.15M10M15M20M25MSE +/- 6666.67, N = 3SE +/- 75351.03, N = 321853333191866671. (CXX) g++ options: -fopenmp -O3 -march=native

Quicksilver

Input: CORAL2 P2

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P2Clang 18.1.1GCC 14.0.1 202404115M10M15M20M25MSE +/- 10000.00, N = 3SE +/- 80069.41, N = 322130000217866671. (CXX) g++ options: -fopenmp -O3 -march=native

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPUGCC 14.0.1 20240411Clang 18.1.120406080100SE +/- 0.13, N = 3SE +/- 0.85, N = 1589.77100.60-fno-strict-overflow -fwrapv - MIN: 33.39 / MAX: 179.49MIN: 33.14 / MAX: 247.721. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Machine Translation EN To DE FP16 - Device: CPUGCC 14.0.1 20240411Clang 18.1.180160240320400SE +/- 0.54, N = 3SE +/- 2.83, N = 15355.93317.98-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512GCC 14.0.1 20240411Clang 18.1.160M120M180M240M300MSE +/- 1504066.82, N = 3SE +/- 6642498.38, N = 152988530002144483331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-128-GCMGCC 14.0.1 20240411Clang 18.1.1200000M400000M600000M800000M1000000MSE +/- 665285492.45, N = 3SE +/- 246618519.80, N = 3822088248977816719157710-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20-Poly1305GCC 14.0.1 20240411Clang 18.1.170000M140000M210000M280000M350000MSE +/- 34389860.61, N = 3SE +/- 32266915.14, N = 3310352617927196215871807-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-256-GCMGCC 14.0.1 20240411Clang 18.1.1150000M300000M450000M600000M750000MSE +/- 612129667.02, N = 3SE +/- 847452668.98, N = 3707658246553701069882433-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20GCC 14.0.1 20240411Clang 18.1.190000M180000M270000M360000M450000MSE +/- 56533218.78, N = 3SE +/- 161435637.78, N = 3437829686883288123851390-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA512Clang 18.1.1GCC 14.0.1 202404118000M16000M24000M32000M40000MSE +/- 38811972.88, N = 3SE +/- 40543653.85, N = 33724609937737084177003-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA256GCC 14.0.1 20240411Clang 18.1.120000M40000M60000M80000M100000MSE +/- 254544356.50, N = 3SE +/- 281782565.20, N = 3112513527163109345824360-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -ldl

JPEG-XL libjxl

Input: PNG - Quality: 90

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: PNG - Quality: 90Clang 18.1.1GCC 14.0.1 202404111122334455SE +/- 0.47, N = 15SE +/- 0.41, N = 1549.4548.581. (CXX) g++ options: -O3 -march=native -fno-rtti -fPIE -pie -lm

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 512Clang 18.1.1GCC 14.0.1 202404115M10M15M20M25MSE +/- 343818.42, N = 13SE +/- 417025.62, N = 1223426308182589171. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 14.0.1 20240411Clang 18.1.1700K1400K2100K2800K3500KSE +/- 1831.80, N = 3SE +/- 24583.22, N = 153365815.033044772.381. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

Helsing

Digit Range: 14 digit

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitGCC 14.0.1 20240411Clang 18.1.120406080100SE +/- 0.21, N = 3SE +/- 0.30, N = 363.2978.061. (CC) gcc options: -O2 -pthread

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression SpeedClang 18.1.1GCC 14.0.1 20240411400800120016002000SE +/- 3.56, N = 3SE +/- 10.14, N = 31861.01825.3-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression SpeedClang 18.1.1GCC 14.0.1 202404113691215SE +/- 0.03, N = 3SE +/- 0.07, N = 312.312.1-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression SpeedClang 18.1.1GCC 14.0.1 20240411400800120016002000SE +/- 2.11, N = 3SE +/- 2.52, N = 31962.81917.8-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression SpeedGCC 14.0.1 20240411Clang 18.1.1612182430SE +/- 0.03, N = 3SE +/- 0.07, N = 325.925.8-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz

Zstd Compression

Compression Level: 12 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression SpeedClang 18.1.1GCC 14.0.1 202404115001000150020002500SE +/- 2.17, N = 3SE +/- 21.94, N = 32340.92251.4-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz

Zstd Compression

Compression Level: 12 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression SpeedGCC 14.0.1 20240411Clang 18.1.190180270360450SE +/- 3.11, N = 3SE +/- 0.40, N = 3399.1383.4-Qunused-arguments1. (CC) gcc options: -O3 -march=native -pthread -lz

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.180160240320400SE +/- 0.25, N = 3SE +/- 0.31, N = 3377.58380.50-fno-strict-overflow -fwrapv - MIN: 186.1 / MAX: 401.52MIN: 321.62 / MAX: 405.781. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.120406080100SE +/- 0.07, N = 3SE +/- 0.08, N = 384.5383.82-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPUGCC 14.0.1 20240411Clang 18.1.14080120160200SE +/- 0.22, N = 3SE +/- 0.53, N = 3159.76160.20-fno-strict-overflow -fwrapv - MIN: 54.41 / MAX: 238.04MIN: 50.55 / MAX: 283.621. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Detection FP16 - Device: CPUGCC 14.0.1 20240411Clang 18.1.14080120160200SE +/- 0.27, N = 3SE +/- 0.66, N = 3200.06199.47-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPUGCC 14.0.1 20240411Clang 18.1.1246810SE +/- 0.01, N = 3SE +/- 0.01, N = 37.337.44-fno-strict-overflow -fwrapv - MIN: 3.9 / MAX: 28.61MIN: 4.21 / MAX: 29.461. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Vehicle Bike Detection FP16 - Device: CPUGCC 14.0.1 20240411Clang 18.1.19001800270036004500SE +/- 5.60, N = 3SE +/- 5.05, N = 34299.754232.67-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.1612182430SE +/- 0.18, N = 3SE +/- 0.33, N = 324.4425.20-fno-strict-overflow -fwrapv - MIN: 11.45 / MAX: 51.44MIN: 10.11 / MAX: 731. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Road Segmentation ADAS FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.130060090012001500SE +/- 9.55, N = 3SE +/- 16.52, N = 31304.851266.10-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPUGCC 14.0.1 20240411Clang 18.1.11.2152.433.6454.866.075SE +/- 0.00, N = 3SE +/- 0.00, N = 35.335.40-fno-strict-overflow -fwrapv - MIN: 3.15 / MAX: 26.39MIN: 3.38 / MAX: 23.081. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Person Re-Identification Retail FP16 - Device: CPUGCC 14.0.1 20240411Clang 18.1.113002600390052006500SE +/- 3.36, N = 3SE +/- 2.64, N = 35969.175885.26-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.11.00352.0073.01054.0145.0175SE +/- 0.00, N = 3SE +/- 0.00, N = 34.384.46-fno-strict-overflow -fwrapv - MIN: 2.05 / MAX: 26.42MIN: 2.1 / MAX: 27.331. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Face Detection Retail FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.13K6K9K12K15KSE +/- 11.99, N = 3SE +/- 5.86, N = 313824.9213601.85-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.10.09680.19360.29040.38720.484SE +/- 0.00, N = 3SE +/- 0.00, N = 30.430.43-fno-strict-overflow -fwrapv - MIN: 0.19 / MAX: 22.84MIN: 0.2 / MAX: 25.551. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.120K40K60K80K100KSE +/- 108.76, N = 3SE +/- 47.46, N = 3116609.44115684.85-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5Clang 18.1.1GCC 14.0.1 202404113M6M9M12M15MSE +/- 17975.29, N = 3SE +/- 15878.01, N = 314840333133186671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.1714212835SE +/- 0.02, N = 3SE +/- 0.06, N = 327.8627.93-fno-strict-overflow -fwrapv - MIN: 16.47 / MAX: 52.7MIN: 16.08 / MAX: 54.791. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Handwritten English Recognition FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.15001000150020002500SE +/- 1.62, N = 3SE +/- 4.45, N = 32288.662283.11-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.1246810SE +/- 0.01, N = 3SE +/- 0.01, N = 36.386.45-fno-strict-overflow -fwrapv - MIN: 3.23 / MAX: 27.63MIN: 3.45 / MAX: 34.451. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Vehicle Detection FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.111002200330044005500SE +/- 6.00, N = 3SE +/- 8.31, N = 34938.074888.56-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.1246810SE +/- 0.01, N = 3SE +/- 0.01, N = 37.647.70-fno-strict-overflow -fwrapv - MIN: 3.6 / MAX: 33.44MIN: 3.41 / MAX: 30.421. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.0Model: Weld Porosity Detection FP16-INT8 - Device: CPUGCC 14.0.1 20240411Clang 18.1.12K4K6K8K10KSE +/- 6.57, N = 3SE +/- 9.39, N = 38089.558024.94-fno-strict-overflow -fwrapv1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -shared -ldl

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: ResizingGCC 14.0.1 20240411Clang 18.1.14080120160200SE +/- 0.58, N = 3SE +/- 0.33, N = 3193178-lgomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: RotateGCC 14.0.1 20240411Clang 18.1.1306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3155153-lgomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianGCC 14.0.1 20240411Clang 18.1.14080120160200SE +/- 0.33, N = 3SE +/- 0.00, N = 3185159-lgomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenGCC 14.0.1 20240411Clang 18.1.150100150200250SE +/- 0.33, N = 3SE +/- 1.00, N = 3216199-lgomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceGCC 14.0.1 20240411Clang 18.1.160120180240300SE +/- 0.67, N = 3SE +/- 0.33, N = 3259209-lgomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedClang 18.1.1GCC 14.0.1 2024041160120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3256250-lomp-lgomp1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlGCC 14.0.1 20240411Clang 18.1.1120240360480600SE +/- 0.88, N = 3SE +/- 0.58, N = 3554457-lgomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=native -ljpeg -lX11 -lz -lm -lpthread

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput TotalClang 18.1.1GCC 14.0.1 202404114K8K12K16K20KSE +/- 173.47, N = 3SE +/- 160.00, N = 1520338.720064.51. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

Quicksilver

Input: CORAL2 P1

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P1GCC 14.0.1 20240411Clang 18.1.16M12M18M24M30MSE +/- 26034.17, N = 3SE +/- 32145.50, N = 328573333235700001. (CXX) g++ options: -fopenmp -O3 -march=native

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 1024Clang 18.1.1GCC 14.0.1 202404110.95371.90742.86113.81484.7685SE +/- 0.003706, N = 3SE +/- 0.009323, N = 34.2387554.0976171. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareGCC 14.0.1 20240411Clang 18.1.11.25352.5073.76055.0146.2675SE +/- 0.005, N = 3SE +/- 0.008, N = 35.5715.2471. (CXX) g++ options: -O3 -march=native -lm

QuantLib

Configuration: Multi-Threaded

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedClang 18.1.1GCC 14.0.1 2024041160K120K180K240K300KSE +/- 1085.73, N = 3SE +/- 975.48, N = 3295240.5293011.21. (CXX) g++ options: -O3 -march=native -fPIE -pie

JPEG-XL libjxl

Input: JPEG - Quality: 90

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: JPEG - Quality: 90Clang 18.1.1GCC 14.0.1 202404111224364860SE +/- 0.62, N = 4SE +/- 0.51, N = 551.2546.971. (CXX) g++ options: -O3 -march=native -fno-rtti -fPIE -pie -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.4WAV To Opus EncodeClang 18.1.1GCC 14.0.1 20240411510152025SE +/- 0.02, N = 5SE +/- 0.02, N = 518.4219.331. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGCC 14.0.1 20240411Clang 18.1.1120K240K360K480K600KSE +/- 668.00, N = 3SE +/- 3691.65, N = 35414614605251. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512Clang 18.1.1GCC 14.0.1 20240411300M600M900M1200M1500MSE +/- 7521155.35, N = 3SE +/- 4106498.91, N = 3129786666712422000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

ASTC Encoder

Preset: Very Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: Very ThoroughClang 18.1.1GCC 14.0.1 202404113691215SE +/- 0.0212, N = 3SE +/- 0.0183, N = 38.96158.71201. (CXX) g++ options: -O3 -flto -pthread

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptClang 18.1.1GCC 14.0.1 2024041130K60K90K120K150KSE +/- 1053.28, N = 3SE +/- 239.95, N = 31597261534461. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishClang 18.1.1GCC 14.0.1 2024041130K60K90K120K150KSE +/- 609.99, N = 3SE +/- 79.08, N = 31577851528661. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512Clang 18.1.1GCC 14.0.1 20240411300M600M900M1200M1500MSE +/- 5417358.93, N = 3SE +/- 3868821.24, N = 3123106666710257333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100Clang 18.1.1GCC 14.0.1 202404112004006008001000SE +/- 2.74, N = 3SE +/- 2.19, N = 31002.52990.571. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Lossless, Highest CompressionClang 18.1.1GCC 14.0.1 202404110.18450.3690.55350.7380.9225SE +/- 0.00, N = 3SE +/- 0.00, N = 30.820.81-lpng16 -ljpeg1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: ExhaustiveClang 18.1.1GCC 14.0.1 202404111.24042.48083.72124.96166.202SE +/- 0.0146, N = 3SE +/- 0.0104, N = 35.51305.34461. (CXX) g++ options: -O3 -flto -pthread

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 1024Clang 18.1.1GCC 14.0.1 20240411246810SE +/- 0.013288, N = 3SE +/- 0.000716, N = 36.5507666.1616161. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Clang 18.1.1GCC 14.0.1 2024041150100150200250SE +/- 0.27, N = 3SE +/- 0.24, N = 3210.78175.051. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2Clang 18.1.1GCC 14.0.1 2024041111002200330044005500SE +/- 6.63, N = 3SE +/- 5.94, N = 35269.554376.241. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Primesieve

Length: 1e13

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.1Length: 1e13GCC 14.0.1 20240411Clang 18.1.1612182430SE +/- 0.06, N = 3SE +/- 0.02, N = 326.3526.431. (CXX) g++ options: -O3 -march=native

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.4Compression Level: 3 - Decompression SpeedGCC 14.0.1 20240411Clang 18.1.112002400360048006000SE +/- 20.86, N = 3SE +/- 60.76, N = 45658.05493.71. (CC) gcc options: -O3 -march=native

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.4Compression Level: 3 - Compression SpeedGCC 14.0.1 20240411Clang 18.1.1306090120150SE +/- 0.46, N = 3SE +/- 1.63, N = 4146.67141.721. (CC) gcc options: -O3 -march=native

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.4Compression Level: 9 - Decompression SpeedGCC 14.0.1 20240411Clang 18.1.113002600390052006500SE +/- 36.97, N = 3SE +/- 4.45, N = 35882.15640.11. (CC) gcc options: -O3 -march=native

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.4Compression Level: 9 - Compression SpeedGCC 14.0.1 20240411Clang 18.1.11122334455SE +/- 0.30, N = 3SE +/- 0.07, N = 349.0245.861. (CC) gcc options: -O3 -march=native

uvg266

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: SlowGCC 14.0.1 20240411Clang 18.1.1612182430SE +/- 0.02, N = 3SE +/- 0.03, N = 327.1626.451. (CXX) g++ options: -O3 -march=native

uvg266

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: MediumGCC 14.0.1 20240411Clang 18.1.1714212835SE +/- 0.06, N = 3SE +/- 0.02, N = 330.3929.501. (CXX) g++ options: -O3 -march=native

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput TotalGCC 14.0.1 20240411Clang 18.1.110002000300040005000SE +/- 0.38, N = 3SE +/- 0.23, N = 34801.94446.1MIN: 3393.9 / MAX: 4802.5MIN: 2910.7 / MAX: 4446.51. (CXX) g++ options: -O3 -march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq -fno-trapping-math -fno-math-errno -ldl

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 4 - Input: Bosphorus 4KClang 18.1.1GCC 14.0.1 202404113691215SE +/- 0.023, N = 3SE +/- 0.023, N = 39.8539.8481. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.4WAV To FLACClang 18.1.1GCC 14.0.1 202404113691215SE +/- 0.03, N = 5SE +/- 0.02, N = 510.6811.391. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm

QuantLib

Configuration: Single-Threaded

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Single-ThreadedClang 18.1.1GCC 14.0.1 2024041110002000300040005000SE +/- 22.05, N = 3SE +/- 36.95, N = 34697.64611.61. (CXX) g++ options: -O3 -march=native -fPIE -pie

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumClang 18.1.1GCC 14.0.1 20240411918273645SE +/- 0.06, N = 3SE +/- 0.04, N = 338.6737.05-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 13 - Input: Bosphorus 4KClang 18.1.1GCC 14.0.1 202404114080120160200SE +/- 1.67, N = 15SE +/- 1.61, N = 3195.31193.781. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.6Video Input: Bosphorus 4KClang 18.1.1GCC 14.0.1 20240411918273645SE +/- 0.12, N = 3SE +/- 0.17, N = 340.0839.161. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 14.0.1 20240411Clang 18.1.14080120160200SE +/- 0.09, N = 3SE +/- 1.52, N = 15175.90174.771. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1GCC 14.0.1 20240411Clang 18.1.19001800270036004500SE +/- 2.34, N = 3SE +/- 37.96, N = 154397.544369.141. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

JPEG-XL Decoding libjxl

CPU Threads: All

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL Decoding libjxl 0.10.1CPU Threads: AllGCC 14.0.1 20240411Clang 18.1.1130260390520650SE +/- 1.93, N = 3SE +/- 3.24, N = 3600.87597.36

JPEG-XL libjxl

Input: PNG - Quality: 100

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: PNG - Quality: 100GCC 14.0.1 20240411Clang 18.1.1918273645SE +/- 0.13, N = 3SE +/- 0.04, N = 341.1840.711. (CXX) g++ options: -O3 -march=native -fno-rtti -fPIE -pie -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, LosslessGCC 14.0.1 20240411Clang 18.1.10.46580.93161.39741.86322.329SE +/- 0.01, N = 3SE +/- 0.01, N = 32.072.05-lpng16 -ljpeg1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

JPEG-XL libjxl

Input: JPEG - Quality: 100

OpenBenchmarking.orgMP/s, More Is BetterJPEG-XL libjxl 0.10.1Input: JPEG - Quality: 100GCC 14.0.1 20240411Clang 18.1.11020304050SE +/- 0.17, N = 3SE +/- 0.04, N = 341.6941.651. (CXX) g++ options: -O3 -march=native -fno-rtti -fPIE -pie -lm

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: ThoroughClang 18.1.1GCC 14.0.1 202404111428425670SE +/- 0.11, N = 3SE +/- 0.03, N = 363.5662.451. (CXX) g++ options: -O3 -flto -pthread

SMHasher

Hash: MeowHash x86_64 AES-NI

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: MeowHash x86_64 AES-NIGCC 14.0.1 20240411Clang 18.1.11020304050SE +/- 0.08, N = 3SE +/- 0.59, N = 341.8442.39-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: MeowHash x86_64 AES-NI

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: MeowHash x86_64 AES-NIClang 18.1.1GCC 14.0.1 2024041111K22K33K44K55KSE +/- 397.13, N = 3SE +/- 105.54, N = 351816.5450680.40-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 12 - Input: Bosphorus 4KGCC 14.0.1 20240411Clang 18.1.14080120160200SE +/- 1.09, N = 3SE +/- 1.72, N = 8197.49194.891. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SMHasher

Hash: FarmHash128

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: FarmHash128Clang 18.1.1GCC 14.0.1 202404111020304050SE +/- 0.01, N = 3SE +/- 0.03, N = 331.0243.85-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: FarmHash128

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: FarmHash128GCC 14.0.1 20240411Clang 18.1.15K10K15K20K25KSE +/- 114.70, N = 3SE +/- 58.42, N = 321159.3719825.36-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.0Encoder Mode: Preset 8 - Input: Bosphorus 4KGCC 14.0.1 20240411Clang 18.1.120406080100SE +/- 0.71, N = 3SE +/- 0.39, N = 395.2593.191. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastClang 18.1.1GCC 14.0.1 2024041120406080100SE +/- 0.32, N = 3SE +/- 0.91, N = 596.3793.12-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

uvg266

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very FastClang 18.1.1GCC 14.0.1 2024041120406080100SE +/- 0.26, N = 3SE +/- 0.15, N = 374.3870.731. (CXX) g++ options: -O3 -march=native

uvg266

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Super FastClang 18.1.1GCC 14.0.1 2024041120406080100SE +/- 0.22, N = 3SE +/- 0.25, N = 375.2772.651. (CXX) g++ options: -O3 -march=native

uvg266

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Ultra FastClang 18.1.1GCC 14.0.1 2024041120406080100SE +/- 0.08, N = 3SE +/- 0.13, N = 376.4574.071. (CXX) g++ options: -O3 -march=native

RNNoise

Input: 26 Minute Long Talking Sample

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 0.2Input: 26 Minute Long Talking SampleClang 18.1.1GCC 14.0.1 20240411246810SE +/- 0.046, N = 3SE +/- 0.035, N = 37.4847.6071. (CC) gcc options: -O3 -march=native -pedantic -fvisibility=hidden

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 14.0.1 20240411Clang 18.1.1246810SE +/- 0.026, N = 3SE +/- 0.018, N = 36.1798.4921. (CC) gcc options: -lm -lpthread -O3 -march=native

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastClang 18.1.1GCC 14.0.1 2024041120406080100SE +/- 0.17, N = 3SE +/- 0.12, N = 384.1679.49-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

SMHasher

Hash: Spooky32

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: Spooky32GCC 14.0.1 20240411Clang 18.1.1612182430SE +/- 0.13, N = 3SE +/- 0.14, N = 325.0925.15-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: Spooky32

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: Spooky32Clang 18.1.1GCC 14.0.1 202404114K8K12K16K20KSE +/- 64.55, N = 3SE +/- 51.73, N = 320643.4520053.07-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: FarmHash32 x86_64 AVX

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: FarmHash32 x86_64 AVXGCC 14.0.1 20240411Clang 18.1.1612182430SE +/- 0.06, N = 3SE +/- 0.04, N = 324.1127.00-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: FarmHash32 x86_64 AVX

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: FarmHash32 x86_64 AVXGCC 14.0.1 20240411Clang 18.1.18K16K24K32K40KSE +/- 13.98, N = 3SE +/- 154.09, N = 337461.4631033.62-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Church FacadeClang 18.1.1GCC 14.0.1 2024041111002200330044005500SE +/- 9.17, N = 3SE +/- 9.33, N = 3511252401. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: fasthash32

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: fasthash32Clang 18.1.1GCC 14.0.1 20240411510152025SE +/- 0.07, N = 3SE +/- 0.03, N = 320.5620.80-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: fasthash32

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: fasthash32GCC 14.0.1 20240411Clang 18.1.12K4K6K8K10KSE +/- 25.94, N = 3SE +/- 13.72, N = 38833.718593.78-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastClang 18.1.1GCC 14.0.1 2024041120406080100SE +/- 0.86, N = 3SE +/- 0.89, N = 397.8097.05-lpthread1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.7WAV To WavPackClang 18.1.1GCC 14.0.1 202404110.8331.6662.4993.3324.165SE +/- 0.002, N = 5SE +/- 0.009, N = 53.7013.7021. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: t1ha2_atonce

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: t1ha2_atonceGCC 14.0.1 20240411Clang 18.1.1510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 319.1119.35-flto=auto -fno-fat-lto-objects-flto=thin1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: t1ha2_atonce

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: t1ha2_atonceClang 18.1.1GCC 14.0.1 202404114K8K12K16K20KSE +/- 62.32, N = 3SE +/- 10.73, N = 319737.3919662.33-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: t1ha0_aes_avx2 x86_64

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: t1ha0_aes_avx2 x86_64Clang 18.1.1GCC 14.0.1 20240411510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 318.8419.09-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: t1ha0_aes_avx2 x86_64

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: t1ha0_aes_avx2 x86_64Clang 18.1.1GCC 14.0.1 2024041120K40K60K80K100KSE +/- 51.02, N = 3SE +/- 144.34, N = 397667.2696787.57-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: LionClang 18.1.1GCC 14.0.1 202404119001800270036004500SE +/- 18.35, N = 3SE +/- 16.90, N = 3394240491. (CXX) g++ options: -O3 -march=native

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100, Highest CompressionClang 18.1.1GCC 14.0.1 202404111.2692.5383.8075.0766.345SE +/- 0.01, N = 3SE +/- 0.00, N = 35.644.57-lpng16 -ljpeg1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.7Preset: MediumClang 18.1.1GCC 14.0.1 20240411100200300400500SE +/- 0.34, N = 3SE +/- 0.79, N = 3459.58450.111. (CXX) g++ options: -O3 -flto -pthread

SMHasher

Hash: wyhash

OpenBenchmarking.orgcycles/hash, Fewer Is BetterSMHasher 2022-08-22Hash: wyhashClang 18.1.1GCC 14.0.1 202404113691215SE +/- 0.00, N = 3SE +/- 0.01, N = 312.9713.44-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

SMHasher

Hash: wyhash

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: wyhashClang 18.1.1GCC 14.0.1 202404116K12K18K24K30KSE +/- 26.97, N = 3SE +/- 77.66, N = 329888.6926137.98-flto=thin-flto=auto -fno-fat-lto-objects1. (CXX) g++ options: -O3 -march=native

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.2.4Encode Settings: Quality 100GCC 14.0.1 20240411Clang 18.1.148121620SE +/- 0.03, N = 3SE +/- 0.04, N = 314.8014.58-lpng16 -ljpeg1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 18.1.1GCC 14.0.1 20240411500K1000K1500K2000K2500KSE +/- 2576.48, N = 5SE +/- 3119.54, N = 5211838919600841. (CC) gcc options: -O3 -march=native


Phoronix Test Suite v10.8.5