Intel Optimized Power Mode Xeon Platinum Benchmarks

2 x INTEL XEON PLATINUM 8592 testing by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2312153-NE-XEONEMRPO30
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Default
December 14 2023
  16 Hours, 25 Minutes
Optimized Power Mode
December 15 2023
  1 Day, 52 Minutes
Invert Behavior (Only Show Selected Data)
  20 Hours, 38 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel Optimized Power Mode Xeon Platinum BenchmarksOpenBenchmarking.orgPhoronix Test Suite2 x INTEL XEON PLATINUM 8592+ @ 3.90GHz (128 Cores / 256 Threads)Quanta Cloud S6Q-MB-MPS (3B05.TEL4P1 BIOS)Intel Device 1bce1008GB3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Intel X710 for 10GBASE-TUbuntu 23.106.5.0-13-generic (x86_64)GCC 13.2.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionIntel Optimized Power Mode Xeon Platinum Benchmarks PerformanceSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x21000161- OpenJDK Runtime Environment (build 11.0.21+9-post-Ubuntu-0ubuntu123.10)- Python 3.11.6- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

Default vs. Optimized Power Mode ComparisonPhoronix Test SuiteBaseline+13.5%+13.5%+27%+27%+40.5%+40.5%+54%+54%15%10.7%8.4%4.6%4.2%3.7%2.7%2.5%1:1054.1%CPU - 256 - ResNet-5041.2%CPU - 256 - ResNet-15239.3%CPU - 64 - ResNet-15238.5%50035.1%CPU - 64 - ResNet-5033.7%motorBike - Execution Time33.3%100031.7%GET - 5024.5%12 - Compression Speed22.6%CPU - 64 - Efficientnet_v2_l20.2%GhostRider - 1M19.4%Bosphorus 4K - Faster19.1%Bosphorus 4K - Ultra Fast18.3%CPU - 256 - Efficientnet_v2_l18.3%19 - Compression Speed17.9%10 - G.M.O.A.Q16.9%e.G.B.S - 120016.5%19, Long Mode - Compression Speed15.7%Barbershop - CPU-OnlyPreset 12 - Bosphorus 1080p14.4%e.G.B.S - 240014.4%IMDB14.2%Bosphorus 4K - Super Fast14.2%Bosphorus 4K - Very Fast14.2%Create - 100 - 10000014%50012.4%19 - D.S12.3%Preset 8 - Bosphorus 1080p12.3%Preset 8 - Bosphorus 4K12.2%libx265 - Live11.5%Bosphorus 1080p - Faster10.9%12 - D.S10.9%1B32 - 256 - 5710.5%Bosphorus 1080p - Fast10.3%VMAF Optimized - Bosphorus 1080p10.1%64 - 256 - 579.9%Preset 12 - Bosphorus 4K9.9%Preset 4 - Bosphorus 4K9.8%Preset 13 - Bosphorus 4K9.6%Preset 13 - Bosphorus 1080p9.4%R.O.R.S.I9.4%Bosphorus 4K - Ultra Fast9.4%Bosphorus 1080p - Very Fast9.2%Bosphorus 4K - Fast9.2%Bosphorus 1080p - Ultra Fast8.7%Bosphorus 1080p - Super Fast8.6%d.S.M.S - Mesh Time19, Long Mode - D.S8.2%P.S.O - Bosphorus 1080p8%Time To Compile7.5%100 - 1000 - Read Write - Average Latency7.2%100 - 1000 - Read Write7.1%V.Q.O - Bosphorus 1080p6.9%Bosphorus 1080p - Ultra Fast6.9%Bosphorus 4K - Very Fast6.8%Preset 4 - Bosphorus 1080p6.5%10005.9%Unix Makefiles5.9%TPC-H Parquet5.8%Bosphorus 4K - Super Fast5.7%Compression Rating5.4%libx265 - Platform5.2%1:1005.1%Bosphorus 1080p - Super Fast5%128 - 256 - 575%Bosphorus 1080p - Very Fast4.8%SET - 5004.8%R.5.S.I - A.M.S4.7%R.5.S.I - A.M.S4.7%Classroom - CPU-Onlylibx265 - Video On Demand4.2%RT.ldr_alb_nrm.3840x2160 - CPU-Onlylibx265 - Upload3.8%128 - 256 - 32A.G.R.R.0.F.I - CPU3.6%32 - 256 - 5123.4%256 - 256 - 573.2%defconfig3%GET - 5002.8%128 - 256 - 512A.G.R.R.0.F.I - CPU2.5%Bumper BeamNinja2.5%10 - Q2210.5%10 - Q2117.7%10 - Q2019.6%10 - Q1912.6%10 - Q1810.1%10 - Q173.4%10 - Q1613.8%10 - Q158.7%10 - Q1413.3%10 - Q1315.8%10 - Q1226.8%10 - Q1133.9%10 - Q1018.4%10 - Q0923.3%10 - Q0830.5%10 - Q0722.5%10 - Q0626.8%10 - Q0539.4%10 - Q0413.7%10 - Q0319.3%10 - Q0240.2%10 - Q0119%MemcachedPyTorchPyTorchPyTorchnginxPyTorchOpenFOAMnginxRedisZstd CompressionPyTorchXmrigVVenCuvg266PyTorchZstd CompressionApache Spark TPC-HeasyWaveZstd CompressionBlenderSVT-AV1easyWaveDuckDBuvg266uvg266Apache HadoopApache HTTP ServerZstd CompressionSVT-AV1SVT-AV1FFmpegVVenCZstd CompressionY-CruncherLiquid-DSPVVenCSVT-VP9Liquid-DSPSVT-AV1SVT-AV1SVT-AV1SVT-AV1OpenRadiossKvazaaruvg266VVenCuvg266uvg266OpenFOAMZstd CompressionSVT-VP9Timed GCC CompilationPostgreSQLPostgreSQLSVT-VP9KvazaarKvazaarSVT-AV1Apache HTTP ServerTimed LLVM CompilationDuckDBKvazaar7-Zip CompressionFFmpegMemcachedKvazaarLiquid-DSPKvazaarRedisNeural Magic DeepSparseNeural Magic DeepSparseBlenderFFmpegIntel Open Image DenoiseFFmpegLiquid-DSPOpenVINOLiquid-DSPLiquid-DSPTimed Linux Kernel CompilationRedisLiquid-DSPOpenVINOOpenRadiossTimed LLVM CompilationApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HDefaultOptimized Power Mode

Intel Optimized Power Mode Xeon Platinum Benchmarkspytorch: CPU - 256 - ResNet-50pytorch: CPU - 256 - ResNet-152pytorch: CPU - 64 - ResNet-152nginx: 500pytorch: CPU - 64 - ResNet-50openfoam: motorBike - Execution Timenginx: 1000redis: GET - 50compress-zstd: 12 - Compression Speedpytorch: CPU - 64 - Efficientnet_v2_lxmrig: GhostRider - 1Mvvenc: Bosphorus 4K - Fasteruvg266: Bosphorus 4K - Ultra Fastpytorch: CPU - 256 - Efficientnet_v2_lcompress-zstd: 19 - Compression Speedspark-tpch: 10 - Geometric Mean Of All Querieseasywave: e2Asean Grid + BengkuluSept2007 Source - 1200compress-zstd: 19, Long Mode - Compression Speedsvt-av1: Preset 12 - Bosphorus 1080pduckdb: IMDBuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Very Fasthadoop: Create - 100 - 100000apache: 500compress-zstd: 19 - Decompression Speedsvt-av1: Preset 8 - Bosphorus 1080psvt-av1: Preset 8 - Bosphorus 4Kffmpeg: libx265 - Livevvenc: Bosphorus 1080p - Fastercompress-zstd: 12 - Decompression Speedy-cruncher: 1Bliquid-dsp: 32 - 256 - 57vvenc: Bosphorus 1080p - Fastliquid-dsp: 64 - 256 - 57svt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 1080popenradioss: Rubber O-Ring Seal Installationkvazaar: Bosphorus 4K - Ultra Fastuvg266: Bosphorus 1080p - Very Fastvvenc: Bosphorus 4K - Fastuvg266: Bosphorus 1080p - Ultra Fastuvg266: Bosphorus 1080p - Super Fastopenfoam: drivaerFastback, Small Mesh Size - Mesh Timecompress-zstd: 19, Long Mode - Decompression Speedsvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080pbuild-gcc: Time To Compilepgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 1000 - Read Writesvt-vp9: Visual Quality Optimized - Bosphorus 1080pkvazaar: Bosphorus 1080p - Ultra Fastkvazaar: Bosphorus 4K - Very Fastsvt-av1: Preset 4 - Bosphorus 1080papache: 1000build-llvm: Unix Makefilesduckdb: TPC-H Parquetkvazaar: Bosphorus 4K - Super Fastcompress-7zip: Compression Ratingffmpeg: libx265 - Platformmemcached: 1:100kvazaar: Bosphorus 1080p - Super Fastliquid-dsp: 128 - 256 - 57kvazaar: Bosphorus 1080p - Very Fastredis: SET - 500deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamblender: Classroom - CPU-Onlyffmpeg: libx265 - Video On Demandoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyffmpeg: libx265 - Uploadliquid-dsp: 128 - 256 - 32openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUliquid-dsp: 32 - 256 - 512liquid-dsp: 256 - 256 - 57build-linux-kernel: defconfigredis: GET - 500liquid-dsp: 128 - 256 - 512openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenradioss: Bumper Beambuild-llvm: Ninjauvg266: Bosphorus 1080p - Mediumliquid-dsp: 256 - 256 - 32qmcpack: Li2_STO_aeuvg266: Bosphorus 4K - Slowblender: BMW27 - CPU-Onlyuvg266: Bosphorus 1080p - Slowxmrig: KawPow - 1Mkvazaar: Bosphorus 1080p - Slowopenradioss: Cell Phone Drop Testopenvino: Weld Porosity Detection FP16-INT8 - CPUdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamliquid-dsp: 64 - 256 - 512openvino: Weld Porosity Detection FP16-INT8 - CPUdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streambuild-linux-kernel: allmodconfigopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUxmrig: Wownero - 1Mdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamkvazaar: Bosphorus 4K - Slowquantlib: Multi-Threadedkvazaar: Bosphorus 4K - Mediumopenradioss: INIVOL and Fluid Structure Interaction Drop Containerliquid-dsp: 64 - 256 - 32liquid-dsp: 32 - 256 - 32deepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamuvg266: Bosphorus 4K - Mediumopenradioss: Chrysler Neon 1Mdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamopenvino: Machine Translation EN To DE FP16 - CPUopenradioss: Bird Strike on Windshielddeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamopenvino: Face Detection Retail FP16-INT8 - CPUkvazaar: Bosphorus 1080p - Mediumopenvino: Person Vehicle Bike Detection FP16 - CPUxmrig: CryptoNight-Heavy - 1Mopenvino: Person Vehicle Bike Detection FP16 - CPUredis: SET - 50openvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamquantlib: Single-Threadedxmrig: CryptoNight-Femto UPX2 - 1Mxmrig: Monero - 1Mdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamliquid-dsp: 256 - 256 - 512deepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamopenvino: Vehicle Detection FP16-INT8 - CPUdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamopenvino: Vehicle Detection FP16-INT8 - CPUdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamcompress-7zip: Decompression Ratingopenvino: Road Segmentation ADAS FP16-INT8 - CPUdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamopenvino: Road Segmentation ADAS FP16-INT8 - CPUdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamblender: Barbershop - CPU-Onlypgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Onlymemcached: 1:10spark-tpch: 10 - Q22spark-tpch: 10 - Q21spark-tpch: 10 - Q20spark-tpch: 10 - Q19spark-tpch: 10 - Q18spark-tpch: 10 - Q17spark-tpch: 10 - Q16spark-tpch: 10 - Q15spark-tpch: 10 - Q14spark-tpch: 10 - Q13spark-tpch: 10 - Q12spark-tpch: 10 - Q11spark-tpch: 10 - Q10spark-tpch: 10 - Q09spark-tpch: 10 - Q08spark-tpch: 10 - Q07spark-tpch: 10 - Q06spark-tpch: 10 - Q05spark-tpch: 10 - Q04spark-tpch: 10 - Q03spark-tpch: 10 - Q02spark-tpch: 10 - Q01svt-vp9: VMAF Optimized - Bosphorus 1080peasywave: e2Asean Grid + BengkuluSept2007 Source - 2400DefaultOptimized Power Mode44.5817.8517.67273684.7143.914.13288243384.194744838.9333.62.6216522.510.83659.592.5919.18.3591968336.5469.90502.707121.96757.8756.06549090823.061173.5142.26569.466131.8132.0911422.75.199142910000019.9152810166667218.4527.297218.763628.75581.6977.53160.636.921166.47158.2330.042231183.8542.17706.25915.63463962463.27282.0569.3620.95280826.71177.431148.5571.0568951653.983441439.71263.884329866667268.492503177.6711244.79415.673234.9453.634.5327.003611866667121577.55508245000583550000023.7563240001.5814739000000.4084.7896.51888.46618560000094.38427.2012.8980.3070383.2127.2525.7448795.97160.4064397.773110123366672.45350.9008149.79238.413330.1975002.3182.15911791.27371899.222335.669533.638541.45388447.241.9997.8724760333331211766667137.354029.7985.8235.93141778.594523.46982224795.171106.32462.9932137.343928.85111.23463.08545.15131.1410205.6170573.012.513060186.42748.1042.72236.70539.6913.94513650.569677.870469.84580.6318214456666751.70338930.00862.138214.321235.406363731153.5174.07222390.2576.5445834.1767149.491.1248930893360511.124.4956595126.481879558.561719895.0945846212.8941656812.600738214.972441044.245615805.778444925.644020567.933933265.9976617511.8791221017.5612882012.0530392311.474618282.5661711712.944283498.1769722312.590769777.698114088.84185569542.9889.98431.5812.8112.76202507.4232.835.50897184861.993811819.50272.22.1813832.49.09650.362.1916.29.7730412742.5598.56439.477139.32650.6849.10481480817.731044.6126.69761.914118.2428.9261282.74.697129333333318.0532556573333198.7426.645199.549574.59989.3770.87147.086.340153.17145.7527.708731094.2502.05759.21616.75259695433.53263.9664.9719.66976315.65187.837157.10167.2165445951.313274425.32251.334125000000256.192389636.6710738.13525.938533.4051.454.7226.023745233333117369.55491426667565660000024.4763152359.7515134666670.4182.7398.90586.78630148333392.67226.7112.6778.9369196.7125.1825.3749494.06162.5895392.45159996633332.42346.7411151.27638.793297.8175701.7183.71041776.52521883.657335.956133.903241.13391468.441.6998.4924608666671204500000138.149729.6286.3036.12861768.910823.34883324922.401100.76460.6851138.011428.99111.73461.14695.13130.6410168.0670338.612.553050498.00750.4342.59237.36538.2013.90953641.469847.970298.84590.3375214100000051.61768943.88863.347414.301237.025763813653.4573.98962392.3876.5042834.2706129.941.1418784682180648.214.9695701631.1611911210.242673195.7373820014.1998758313.028431495.656485284.614900046.548335146.5348194710.057694168.0304369914.0628064021.6599576115.7232208214.057704933.2531764218.041547789.2947692915.0188730810.7942171110.52531964493.26102.928OpenBenchmarking.org

PyTorch

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: ResNet-50Optimized Power ModeDefault1020304050SE +/- 0.02, N = 3SE +/- 0.51, N = 431.5844.58MIN: 14.91 / MAX: 34.67MIN: 19.44 / MAX: 46.72

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: ResNet-152Optimized Power ModeDefault48121620SE +/- 0.17, N = 3SE +/- 0.13, N = 1112.8117.85MIN: 6.6 / MAX: 13.9MIN: 7.02 / MAX: 19.19

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: ResNet-152Optimized Power ModeDefault48121620SE +/- 0.12, N = 12SE +/- 0.23, N = 312.7617.67MIN: 5.75 / MAX: 15.71MIN: 10.89 / MAX: 18.38

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500Optimized Power ModeDefault60K120K180K240K300KSE +/- 10.52, N = 3SE +/- 25.30, N = 3202507.42273684.711. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

PyTorch

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: ResNet-50Optimized Power ModeDefault1020304050SE +/- 0.39, N = 3SE +/- 0.55, N = 432.8343.91MIN: 15.85 / MAX: 36.48MIN: 17.29 / MAX: 46.23

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution TimeOptimized Power ModeDefault1.23952.4793.71854.9586.19755.508974.132881. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000Optimized Power ModeDefault50K100K150K200K250KSE +/- 72.63, N = 2SE +/- 397.04, N = 3184861.99243384.191. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 50Optimized Power ModeDefault1000K2000K3000K4000K5000KSE +/- 1281.58, N = 3SE +/- 8167.04, N = 43811819.504744838.901. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Zstd Compression

This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression SpeedOptimized Power ModeDefault70140210280350SE +/- 2.98, N = 5SE +/- 3.46, N = 3272.2333.61. (CC) gcc options: -O3 -pthread -lz -llzma

PyTorch

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: Efficientnet_v2_lOptimized Power ModeDefault0.58951.1791.76852.3582.9475SE +/- 0.02, N = 3SE +/- 0.02, N = 32.182.62MIN: 0.98 / MAX: 3.41MIN: 1.32 / MAX: 3.9

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MOptimized Power ModeDefault4K8K12K16K20KSE +/- 22.03, N = 3SE +/- 67.39, N = 313832.416522.51. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: FasterOptimized Power ModeDefault3691215SE +/- 0.072, N = 3SE +/- 0.081, N = 39.09610.8361. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Ultra FastOptimized Power ModeDefault1326395265SE +/- 0.21, N = 4SE +/- 0.37, N = 550.3659.59

PyTorch

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: Efficientnet_v2_lOptimized Power ModeDefault0.58281.16561.74842.33122.914SE +/- 0.02, N = 3SE +/- 0.00, N = 32.192.59MIN: 0.98 / MAX: 3.3MIN: 1.19 / MAX: 4

Zstd Compression

This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression SpeedOptimized Power ModeDefault510152025SE +/- 0.19, N = 3SE +/- 0.06, N = 316.219.11. (CC) gcc options: -O3 -pthread -lz -llzma

Apache Spark TPC-H

This is a benchmark of Apache Spark using TPC-H data-set. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmarks the Apache Spark in a single-system configuration using spark-submit. The test makes use of https://github.com/ssavvides/tpch-spark/ for facilitating the TPC-H benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark TPC-H 3.5Scale Factor: 10 - Geometric Mean Of All QueriesOptimized Power ModeDefault3691215SE +/- 0.09145601, N = 7SE +/- 0.07178690, N = 39.773041278.35919683MIN: 4.46 / MAX: 32.81MIN: 4.15 / MAX: 27.27

easyWave

The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200Optimized Power ModeDefault1020304050SE +/- 0.41, N = 15SE +/- 0.28, N = 342.5636.551. (CXX) g++ options: -O3 -fopenmp

Zstd Compression

This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression SpeedOptimized Power ModeDefault3691215SE +/- 0.01, N = 3SE +/- 0.02, N = 38.569.901. (CC) gcc options: -O3 -pthread -lz -llzma

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 12 - Input: Bosphorus 1080pOptimized Power ModeDefault110220330440550SE +/- 4.01, N = 15SE +/- 3.53, N = 9439.48502.711. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

DuckDB

DuckDB is an in-progress SQL OLAP database management system optimized for analytics and features a vectorized and parallel engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: IMDBOptimized Power ModeDefault306090120150SE +/- 0.43, N = 3SE +/- 0.49, N = 3139.33121.971. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Super FastOptimized Power ModeDefault1326395265SE +/- 0.48, N = 4SE +/- 0.15, N = 550.6857.87

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very FastOptimized Power ModeDefault1326395265SE +/- 0.45, N = 15SE +/- 0.07, N = 549.1056.06

Apache Hadoop

This is a benchmark of the Apache Hadoop making use of its built-in name-node throughput benchmark (NNThroughputBenchmark). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps per sec, More Is BetterApache Hadoop 3.3.6Operation: Create - Threads: 100 - Files: 100000Optimized Power ModeDefault12002400360048006000SE +/- 62.35, N = 3SE +/- 32.35, N = 348145490

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 500Optimized Power ModeDefault20K40K60K80K100KSE +/- 184.86, N = 3SE +/- 265.12, N = 380817.7390823.061. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Zstd Compression

This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression SpeedOptimized Power ModeDefault30060090012001500SE +/- 2.93, N = 3SE +/- 1.19, N = 31044.61173.51. (CC) gcc options: -O3 -pthread -lz -llzma

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 8 - Input: Bosphorus 1080pOptimized Power ModeDefault306090120150SE +/- 0.77, N = 6SE +/- 0.36, N = 7126.70142.271. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 8 - Input: Bosphorus 4KOptimized Power ModeDefault1530456075SE +/- 0.56, N = 4SE +/- 0.39, N = 461.9169.471. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

FFmpeg

This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: LiveOptimized Power ModeDefault306090120150SE +/- 0.79, N = 3SE +/- 1.03, N = 3118.24131.811. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: FasterOptimized Power ModeDefault714212835SE +/- 0.30, N = 4SE +/- 0.36, N = 328.9332.091. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Zstd Compression

This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression SpeedOptimized Power ModeDefault30060090012001500SE +/- 1.74, N = 5SE +/- 1.98, N = 31282.71422.71. (CC) gcc options: -O3 -pthread -lz -llzma

Y-Cruncher

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.2Pi Digits To Calculate: 1BOptimized Power ModeDefault1.16982.33963.50944.67925.849SE +/- 0.009, N = 5SE +/- 0.018, N = 54.6975.199

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57Optimized Power ModeDefault300M600M900M1200M1500MSE +/- 16392511.84, N = 3SE +/- 9950041.88, N = 3129333333314291000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: FastOptimized Power ModeDefault510152025SE +/- 0.11, N = 3SE +/- 0.15, N = 318.0519.921. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57Optimized Power ModeDefault600M1200M1800M2400M3000MSE +/- 20734086.19, N = 15SE +/- 24169149.30, N = 3255657333328101666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 12 - Input: Bosphorus 4KOptimized Power ModeDefault50100150200250SE +/- 2.06, N = 5SE +/- 0.90, N = 6198.74218.451. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 4 - Input: Bosphorus 4KOptimized Power ModeDefault246810SE +/- 0.006, N = 3SE +/- 0.098, N = 36.6457.2971. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 13 - Input: Bosphorus 4KOptimized Power ModeDefault50100150200250SE +/- 1.56, N = 9SE +/- 1.12, N = 5199.55218.761. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 13 - Input: Bosphorus 1080pOptimized Power ModeDefault140280420560700SE +/- 3.16, N = 10SE +/- 4.69, N = 11574.60628.761. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal InstallationOptimized Power ModeDefault20406080100SE +/- 0.13, N = 3SE +/- 0.03, N = 389.3781.69

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastOptimized Power ModeDefault20406080100SE +/- 0.70, N = 6SE +/- 0.23, N = 670.8777.531. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Very FastOptimized Power ModeDefault4080120160200SE +/- 1.07, N = 11SE +/- 0.74, N = 8147.08160.63

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: FastOptimized Power ModeDefault246810SE +/- 0.066, N = 3SE +/- 0.067, N = 36.3406.9211. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Ultra FastOptimized Power ModeDefault4080120160200SE +/- 1.12, N = 15SE +/- 1.12, N = 9153.17166.47

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Super FastOptimized Power ModeDefault306090120150SE +/- 1.05, N = 15SE +/- 1.01, N = 15145.75158.23

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeOptimized Power ModeDefault71421283527.7130.041. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

Zstd Compression

This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression SpeedOptimized Power ModeDefault30060090012001500SE +/- 2.17, N = 3SE +/- 16.60, N = 31094.21183.81. (CC) gcc options: -O3 -pthread -lz -llzma

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pOptimized Power ModeDefault120240360480600SE +/- 3.72, N = 8SE +/- 4.34, N = 9502.05542.171. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Timed GCC Compilation

This test times how long it takes to build the GNU Compiler Collection (GCC) open-source compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 13.2Time To CompileOptimized Power ModeDefault160320480640800SE +/- 4.73, N = 3SE +/- 2.37, N = 3759.22706.26

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyOptimized Power ModeDefault48121620SE +/- 0.06, N = 3SE +/- 0.02, N = 316.7515.631. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteOptimized Power ModeDefault14K28K42K56K70KSE +/- 214.64, N = 3SE +/- 70.87, N = 359695639621. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pOptimized Power ModeDefault100200300400500SE +/- 2.86, N = 8SE +/- 1.93, N = 8433.53463.271. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: Ultra FastOptimized Power ModeDefault60120180240300SE +/- 1.93, N = 10SE +/- 2.07, N = 11263.96282.051. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastOptimized Power ModeDefault1530456075SE +/- 0.44, N = 5SE +/- 0.26, N = 564.9769.361. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 4 - Input: Bosphorus 1080pOptimized Power ModeDefault510152025SE +/- 0.11, N = 5SE +/- 0.16, N = 519.6720.951. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 1000Optimized Power ModeDefault20K40K60K80K100K76315.6580826.711. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix MakefilesOptimized Power ModeDefault4080120160200SE +/- 0.58, N = 3SE +/- 0.30, N = 3187.84177.43

DuckDB

DuckDB is an in-progress SQL OLAP database management system optimized for analytics and features a vectorized and parallel engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: TPC-H ParquetOptimized Power ModeDefault306090120150SE +/- 0.38, N = 3SE +/- 0.31, N = 3157.10148.551. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastOptimized Power ModeDefault1632486480SE +/- 0.12, N = 5SE +/- 0.25, N = 567.2171.051. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingOptimized Power ModeDefault150K300K450K600K750KSE +/- 1440.87, N = 3SE +/- 869.44, N = 36544596895161. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

FFmpeg

This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: PlatformOptimized Power ModeDefault1224364860SE +/- 0.14, N = 3SE +/- 0.11, N = 351.3153.981. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100Optimized Power ModeDefault700K1400K2100K2800K3500KSE +/- 42725.42, N = 3SE +/- 25091.90, N = 33274425.323441439.711. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: Super FastOptimized Power ModeDefault60120180240300SE +/- 1.69, N = 10SE +/- 1.09, N = 10251.33263.881. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 57Optimized Power ModeDefault900M1800M2700M3600M4500MSE +/- 34188302.09, N = 3SE +/- 11478143.48, N = 3412500000043298666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: Very FastOptimized Power ModeDefault60120180240300SE +/- 1.04, N = 10SE +/- 1.16, N = 10256.19268.491. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 500Optimized Power ModeDefault500K1000K1500K2000K2500KSE +/- 19286.75, N = 15SE +/- 7546.74, N = 32389636.672503177.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault2K4K6K8K10KSE +/- 12.41, N = 3SE +/- 7.42, N = 310738.1411244.79

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault1.33622.67244.00865.34486.681SE +/- 0.0065, N = 3SE +/- 0.0033, N = 35.93855.6732

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: Classroom - Compute: CPU-OnlyOptimized Power ModeDefault816243240SE +/- 0.07, N = 3SE +/- 0.33, N = 333.4034.94

FFmpeg

This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: Video On DemandOptimized Power ModeDefault1224364860SE +/- 0.10, N = 3SE +/- 0.04, N = 351.4553.631. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Intel Open Image Denoise

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.1Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-OnlyOptimized Power ModeDefault1.0622.1243.1864.2485.31SE +/- 0.01, N = 6SE +/- 0.01, N = 64.724.53

FFmpeg

This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: UploadOptimized Power ModeDefault612182430SE +/- 0.06, N = 3SE +/- 0.06, N = 326.0227.001. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32Optimized Power ModeDefault800M1600M2400M3200M4000MSE +/- 3883440.63, N = 3SE +/- 1386041.53, N = 3374523333336118666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUOptimized Power ModeDefault30K60K90K120K150KSE +/- 1361.46, N = 3SE +/- 775.18, N = 3117369.55121577.551. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512Optimized Power ModeDefault110M220M330M440M550MSE +/- 3588947.54, N = 3SE +/- 5123258.57, N = 64914266675082450001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 57Optimized Power ModeDefault1200M2400M3600M4800M6000MSE +/- 19835069.95, N = 3SE +/- 11767044.38, N = 3565660000058355000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigOptimized Power ModeDefault612182430SE +/- 0.18, N = 11SE +/- 0.21, N = 824.4823.76

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 500Optimized Power ModeDefault700K1400K2100K2800K3500KSE +/- 36270.68, N = 3SE +/- 482.98, N = 33152359.753240001.581. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512Optimized Power ModeDefault300M600M900M1200M1500MSE +/- 13505101.92, N = 3SE +/- 4762352.36, N = 3151346666714739000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUOptimized Power ModeDefault0.09230.18460.27690.36920.4615SE +/- 0.01, N = 3SE +/- 0.00, N = 30.410.40MIN: 0.19 / MAX: 15.84MIN: 0.18 / MAX: 14.471. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper BeamOptimized Power ModeDefault20406080100SE +/- 0.13, N = 3SE +/- 0.18, N = 382.7384.78

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaOptimized Power ModeDefault20406080100SE +/- 0.20, N = 3SE +/- 0.29, N = 398.9196.52

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: MediumOptimized Power ModeDefault20406080100SE +/- 0.25, N = 6SE +/- 0.08, N = 686.7888.46

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 32Optimized Power ModeDefault1300M2600M3900M5200M6500MSE +/- 61606333.10, N = 6SE +/- 7150058.27, N = 3630148333361856000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: Li2_STO_aeOptimized Power ModeDefault20406080100SE +/- 1.02, N = 5SE +/- 0.40, N = 392.6794.381. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: SlowOptimized Power ModeDefault612182430SE +/- 0.08, N = 3SE +/- 0.08, N = 326.7127.20

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: BMW27 - Compute: CPU-OnlyOptimized Power ModeDefault3691215SE +/- 0.04, N = 4SE +/- 0.03, N = 412.6712.89

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: SlowOptimized Power ModeDefault20406080100SE +/- 0.19, N = 6SE +/- 0.10, N = 678.9380.30

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: KawPow - Hash Count: 1MOptimized Power ModeDefault15K30K45K60K75KSE +/- 943.50, N = 3SE +/- 82.14, N = 469196.770383.21. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: SlowOptimized Power ModeDefault306090120150SE +/- 0.35, N = 8SE +/- 0.38, N = 8125.18127.251. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop TestOptimized Power ModeDefault612182430SE +/- 0.12, N = 3SE +/- 0.14, N = 325.3725.74

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault11K22K33K44K55KSE +/- 229.19, N = 3SE +/- 490.25, N = 349494.0648795.971. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault4080120160200SE +/- 0.33, N = 3SE +/- 1.36, N = 3162.59160.41

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault90180270360450SE +/- 0.71, N = 3SE +/- 3.53, N = 3392.45397.77

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512Optimized Power ModeDefault200M400M600M800M1000MSE +/- 5205466.14, N = 3SE +/- 10474795.68, N = 399966333310123366671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault0.55131.10261.65392.20522.7565SE +/- 0.00, N = 3SE +/- 0.01, N = 32.422.45MIN: 1.96 / MAX: 28.3MIN: 2.03 / MAX: 23.831. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault80160240320400SE +/- 0.05, N = 3SE +/- 0.69, N = 3346.74350.90

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigOptimized Power ModeDefault306090120150SE +/- 0.48, N = 3SE +/- 0.51, N = 3151.28149.79

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUOptimized Power ModeDefault918273645SE +/- 0.08, N = 3SE +/- 0.08, N = 338.7938.41MIN: 36.26 / MAX: 60.12MIN: 35.95 / MAX: 58.711. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUOptimized Power ModeDefault7001400210028003500SE +/- 6.73, N = 3SE +/- 7.06, N = 33297.813330.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Wownero - Hash Count: 1MOptimized Power ModeDefault16K32K48K64K80KSE +/- 24.14, N = 4SE +/- 677.05, N = 475701.775002.31. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault4080120160200SE +/- 0.02, N = 3SE +/- 0.43, N = 3183.71182.16

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault400800120016002000SE +/- 3.31, N = 3SE +/- 7.80, N = 31776.531791.27

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault400800120016002000SE +/- 0.88, N = 3SE +/- 1.67, N = 31883.661899.22

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault816243240SE +/- 0.07, N = 3SE +/- 0.16, N = 335.9635.67

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault816243240SE +/- 0.01, N = 3SE +/- 0.03, N = 333.9033.64

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowOptimized Power ModeDefault918273645SE +/- 0.06, N = 4SE +/- 0.06, N = 441.1341.451. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedOptimized Power ModeDefault80K160K240K320K400KSE +/- 52.43, N = 3SE +/- 127.57, N = 3391468.4388447.21. (CXX) g++ options: -O3 -march=native -fPIE -pie

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumOptimized Power ModeDefault1020304050SE +/- 0.05, N = 4SE +/- 0.06, N = 441.6941.991. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop ContainerOptimized Power ModeDefault20406080100SE +/- 0.08, N = 3SE +/- 0.07, N = 398.4997.87

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32Optimized Power ModeDefault500M1000M1500M2000M2500MSE +/- 13974301.81, N = 3SE +/- 6835284.27, N = 3246086666724760333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32Optimized Power ModeDefault300M600M900M1200M1500MSE +/- 1193035.34, N = 3SE +/- 1273228.62, N = 3120450000012117666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault306090120150SE +/- 0.30, N = 3SE +/- 0.09, N = 3138.15137.35

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: MediumOptimized Power ModeDefault714212835SE +/- 0.19, N = 3SE +/- 0.08, N = 329.6229.79

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1MOptimized Power ModeDefault20406080100SE +/- 0.17, N = 3SE +/- 0.14, N = 386.3085.82

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault816243240SE +/- 0.14, N = 3SE +/- 0.08, N = 336.1335.93

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault400800120016002000SE +/- 6.86, N = 3SE +/- 3.42, N = 31768.911778.59

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeOptimized Power ModeDefault61218243023.3523.471. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUOptimized Power ModeDefault5K10K15K20K25KSE +/- 6.86, N = 3SE +/- 42.09, N = 324922.4024795.171. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUOptimized Power ModeDefault2004006008001000SE +/- 3.05, N = 3SE +/- 9.12, N = 31100.761106.321. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault100200300400500SE +/- 0.70, N = 3SE +/- 0.03, N = 3460.69462.99

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault306090120150SE +/- 0.12, N = 3SE +/- 0.21, N = 3138.01137.34

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUOptimized Power ModeDefault714212835SE +/- 0.08, N = 3SE +/- 0.23, N = 328.9928.85MIN: 22.06 / MAX: 222.83MIN: 21.11 / MAX: 222.121. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on WindshieldOptimized Power ModeDefault306090120150SE +/- 0.28, N = 3SE +/- 0.65, N = 3111.73111.23

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault100200300400500SE +/- 0.42, N = 3SE +/- 0.27, N = 3461.15463.09

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUOptimized Power ModeDefault1.15882.31763.47644.63525.794SE +/- 0.00, N = 3SE +/- 0.01, N = 35.135.15MIN: 4.64 / MAX: 29.56MIN: 4.63 / MAX: 27.91. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: MediumOptimized Power ModeDefault306090120150SE +/- 0.58, N = 8SE +/- 0.46, N = 8130.64131.141. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUOptimized Power ModeDefault2K4K6K8K10KSE +/- 6.61, N = 3SE +/- 7.22, N = 310168.0610205.611. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Heavy - Hash Count: 1MOptimized Power ModeDefault15K30K45K60K75KSE +/- 96.69, N = 3SE +/- 49.63, N = 470338.670573.01. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUOptimized Power ModeDefault3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 312.5512.51MIN: 10.77 / MAX: 42.91MIN: 10.84 / MAX: 42.361. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 50Optimized Power ModeDefault700K1400K2100K2800K3500KSE +/- 1584.15, N = 3SE +/- 12726.60, N = 33050498.003060186.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUOptimized Power ModeDefault160320480640800SE +/- 0.90, N = 3SE +/- 0.71, N = 3750.43748.101. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUOptimized Power ModeDefault1020304050SE +/- 0.05, N = 3SE +/- 0.04, N = 342.5942.72MIN: 32.12 / MAX: 128.96MIN: 31.92 / MAX: 84.11. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault50100150200250SE +/- 0.55, N = 3SE +/- 0.35, N = 3237.36236.70MIN: 160.6 / MAX: 286.17MIN: 157.14 / MAX: 268.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault120240360480600SE +/- 1.35, N = 3SE +/- 0.65, N = 3538.20539.691. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault48121620SE +/- 0.01, N = 3SE +/- 0.02, N = 313.9113.95

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Single-ThreadedOptimized Power ModeDefault8001600240032004000SE +/- 2.92, N = 3SE +/- 1.44, N = 33641.43650.51. (CXX) g++ options: -O3 -march=native -fPIE -pie

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Femto UPX2 - Hash Count: 1MOptimized Power ModeDefault15K30K45K60K75KSE +/- 443.65, N = 3SE +/- 844.15, N = 469847.969677.81. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Monero - Hash Count: 1MOptimized Power ModeDefault15K30K45K60K75KSE +/- 41.44, N = 3SE +/- 27.69, N = 470298.870469.81. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault10002000300040005000SE +/- 2.64, N = 3SE +/- 6.99, N = 34590.344580.63

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 512Optimized Power ModeDefault500M1000M1500M2000M2500MSE +/- 1101514.11, N = 3SE +/- 2968913.01, N = 3214100000021445666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault1224364860SE +/- 0.07, N = 3SE +/- 0.05, N = 351.6251.70

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault2K4K6K8K10KSE +/- 4.32, N = 3SE +/- 5.30, N = 38943.888930.001. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault2004006008001000SE +/- 0.10, N = 3SE +/- 0.30, N = 3863.35862.14

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 314.3014.32MIN: 12.18 / MAX: 47.3MIN: 12.39 / MAX: 39.161. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault30060090012001500SE +/- 2.51, N = 3SE +/- 1.85, N = 31237.031235.41

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingOptimized Power ModeDefault140K280K420K560K700KSE +/- 9574.07, N = 3SE +/- 4040.78, N = 36381366373111. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUOptimized Power ModeDefault1224364860SE +/- 0.16, N = 3SE +/- 0.08, N = 353.4553.51MIN: 45.01 / MAX: 111.52MIN: 43.94 / MAX: 101.681. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault1632486480SE +/- 0.03, N = 3SE +/- 0.05, N = 373.9974.07

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUOptimized Power ModeDefault5001000150020002500SE +/- 7.16, N = 3SE +/- 3.42, N = 32392.382390.251. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault20406080100SE +/- 0.04, N = 3SE +/- 0.04, N = 376.5076.54

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault2004006008001000SE +/- 0.60, N = 3SE +/- 0.97, N = 3834.27834.18

CPU Power Consumption Monitor

OpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringOptimized Power ModeDefault140280420560700Min: 88.73 / Avg: 366.23 / Max: 802.53Min: 101.15 / Avg: 445.93 / Max: 802.3

CPU Peak Freq (Highest CPU Core Frequency) Monitor

OpenBenchmarking.orgMegahertzCPU Peak Freq (Highest CPU Core Frequency) MonitorPhoronix Test Suite System MonitoringOptimized Power ModeDefault10002000300040005000Min: 800 / Avg: 3437 / Max: 5154Min: 500 / Avg: 3374.53 / Max: 5474

Blender

MinAvgMaxOptimized Power Mode179654760Default207601760OpenBenchmarking.orgWatts, Fewer Is BetterBlender 4.0CPU Power Consumption Monitor2004006008001000

MinAvgMaxOptimized Power Mode80027703909Default50026593908OpenBenchmarking.orgMegahertz, More Is BetterBlender 4.0CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: Barbershop - Compute: CPU-OnlyOptimized Power ModeDefault306090120150SE +/- 0.11, N = 3SE +/- 6.68, N = 9129.94149.49

PostgreSQL

OpenBenchmarking.orgWatts, Fewer Is BetterPostgreSQL 16CPU Power Consumption MonitorOptimized Power ModeDefault130260390520650Min: 94.15 / Avg: 458.57 / Max: 678.78Min: 199.24 / Avg: 508.51 / Max: 759.36

OpenBenchmarking.orgMegahertz, More Is BetterPostgreSQL 16CPU Peak Freq (Highest CPU Core Frequency) MonitorOptimized Power ModeDefault7001400210028003500Min: 800 / Avg: 2988.27 / Max: 3921Min: 800 / Avg: 3000.4 / Max: 3925

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyOptimized Power ModeDefault0.25670.51340.77011.02681.2835SE +/- 0.016, N = 12SE +/- 0.022, N = 101.1411.1241. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyOptimized Power ModeDefault200K400K600K800K1000KSE +/- 11877.48, N = 12SE +/- 17460.42, N = 108784688930891. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Memcached

MinAvgMaxOptimized Power Mode94.3422.8504.5Default199.2534.2599.5OpenBenchmarking.orgWatts, Fewer Is BetterMemcached 1.6.19CPU Power Consumption Monitor160320480640800

MinAvgMaxOptimized Power Mode80029253905Default80029083906OpenBenchmarking.orgMegahertz, More Is BetterMemcached 1.6.19CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

OpenBenchmarking.orgOps/sec Per Watt, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10Optimized Power ModeDefault130026003900520065005158.076290.44

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10Optimized Power ModeDefault700K1400K2100K2800K3500KSE +/- 43551.88, N = 15SE +/- 29533.59, N = 152180648.213360511.121. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

Tuning: Visual Quality Optimized - Input: Bosphorus 4K

Default: The test quit with a non-zero exit status.

Optimized Power Mode: The test quit with a non-zero exit status.

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K

Default: The test quit with a non-zero exit status.

Optimized Power Mode: The test quit with a non-zero exit status.

MinAvgMaxOptimized Power Mode94.5216.8399.4Default195.4271.1486.7OpenBenchmarking.orgWatts, Fewer Is BetterSVT-VP9 0.3CPU Power Consumption Monitor130260390520650

MinAvgMaxOptimized Power Mode80036123910Default80033963909OpenBenchmarking.orgMegahertz, More Is BetterSVT-VP9 0.3CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

OpenBenchmarking.orgFrames Per Second Per Watt, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pOptimized Power ModeDefault0.51191.02381.53572.04762.55952.2752.003

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pOptimized Power ModeDefault120240360480600SE +/- 9.88, N = 15SE +/- 2.30, N = 9493.26542.981. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Tuning: VMAF Optimized - Input: Bosphorus 4K

Default: The test quit with a non-zero exit status.

Optimized Power Mode: The test quit with a non-zero exit status.

easyWave

MinAvgMaxOptimized Power Mode160.9328.1496.2Default198.3407.9539.4OpenBenchmarking.orgWatts, Fewer Is BettereasyWave r34CPU Power Consumption Monitor140280420560700

MinAvgMaxOptimized Power Mode80034863906Default80034363906OpenBenchmarking.orgMegahertz, More Is BettereasyWave r34CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400Optimized Power ModeDefault20406080100SE +/- 1.07, N = 3SE +/- 1.41, N = 15102.9389.981. (CXX) g++ options: -O3 -fopenmp

171 Results Shown

PyTorch:
  CPU - 256 - ResNet-50
  CPU - 256 - ResNet-152
  CPU - 64 - ResNet-152
nginx
PyTorch
OpenFOAM
nginx
Redis
Zstd Compression
PyTorch
Xmrig
VVenC
uvg266
PyTorch
Zstd Compression
Apache Spark TPC-H
easyWave
Zstd Compression
SVT-AV1
DuckDB
uvg266:
  Bosphorus 4K - Super Fast
  Bosphorus 4K - Very Fast
Apache Hadoop
Apache HTTP Server
Zstd Compression
SVT-AV1:
  Preset 8 - Bosphorus 1080p
  Preset 8 - Bosphorus 4K
FFmpeg
VVenC
Zstd Compression
Y-Cruncher
Liquid-DSP
VVenC
Liquid-DSP
SVT-AV1:
  Preset 12 - Bosphorus 4K
  Preset 4 - Bosphorus 4K
  Preset 13 - Bosphorus 4K
  Preset 13 - Bosphorus 1080p
OpenRadioss
Kvazaar
uvg266
VVenC
uvg266:
  Bosphorus 1080p - Ultra Fast
  Bosphorus 1080p - Super Fast
OpenFOAM
Zstd Compression
SVT-VP9
Timed GCC Compilation
PostgreSQL:
  100 - 1000 - Read Write - Average Latency
  100 - 1000 - Read Write
SVT-VP9
Kvazaar:
  Bosphorus 1080p - Ultra Fast
  Bosphorus 4K - Very Fast
SVT-AV1
Apache HTTP Server
Timed LLVM Compilation
DuckDB
Kvazaar
7-Zip Compression
FFmpeg
Memcached
Kvazaar
Liquid-DSP
Kvazaar
Redis
Neural Magic DeepSparse:
  ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
Blender
FFmpeg
Intel Open Image Denoise
FFmpeg
Liquid-DSP
OpenVINO
Liquid-DSP:
  32 - 256 - 512
  256 - 256 - 57
Timed Linux Kernel Compilation
Redis
Liquid-DSP
OpenVINO
OpenRadioss
Timed LLVM Compilation
uvg266
Liquid-DSP
QMCPACK
uvg266
Blender
uvg266
Xmrig
Kvazaar
OpenRadioss
OpenVINO
Neural Magic DeepSparse:
  BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
    items/sec
    ms/batch
Liquid-DSP
OpenVINO
Neural Magic DeepSparse
Timed Linux Kernel Compilation
OpenVINO:
  Handwritten English Recognition FP16-INT8 - CPU:
    ms
    FPS
Xmrig
Neural Magic DeepSparse:
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream
  BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream
  BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream
Kvazaar
QuantLib
Kvazaar
OpenRadioss
Liquid-DSP:
  64 - 256 - 32
  32 - 256 - 32
Neural Magic DeepSparse
uvg266
OpenRadioss
Neural Magic DeepSparse:
  ResNet-50, Baseline - Asynchronous Multi-Stream:
    ms/batch
    items/sec
OpenFOAM
OpenVINO:
  Face Detection Retail FP16-INT8 - CPU
  Machine Translation EN To DE FP16 - CPU
Neural Magic DeepSparse:
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream
OpenVINO
OpenRadioss
Neural Magic DeepSparse
OpenVINO
Kvazaar
OpenVINO
Xmrig
OpenVINO
Redis
OpenVINO:
  Person Detection FP16 - CPU:
    FPS
    ms
  Face Detection FP16-INT8 - CPU:
    ms
    FPS
Neural Magic DeepSparse
QuantLib
Xmrig:
  CryptoNight-Femto UPX2 - 1M
  Monero - 1M
Neural Magic DeepSparse
Liquid-DSP
Neural Magic DeepSparse
OpenVINO
Neural Magic DeepSparse
OpenVINO
Neural Magic DeepSparse
7-Zip Compression
OpenVINO
Neural Magic DeepSparse
OpenVINO
Neural Magic DeepSparse:
  CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
    ms/batch
    items/sec
CPU Power Consumption Monitor:
  Phoronix Test Suite System Monitoring:
    Watts
    Megahertz
  CPU Power Consumption Monitor:
    Watts
  CPU Peak Freq (Highest CPU Core Frequency) Monitor:
    Megahertz
Blender
PostgreSQL:
  CPU Power Consumption Monitor
  CPU Peak Freq (Highest CPU Core Frequency) Monitor
PostgreSQL:
  100 - 1000 - Read Only - Average Latency
  100 - 1000 - Read Only
Memcached:
  CPU Power Consumption Monitor
  CPU Peak Freq (Highest CPU Core Frequency) Monitor
  1:10
Memcached
SVT-VP9:
  CPU Power Consumption Monitor
  CPU Peak Freq (Highest CPU Core Frequency) Monitor
  VMAF Optimized - Bosphorus 1080p
SVT-VP9
easyWave:
  CPU Power Consumption Monitor
  CPU Peak Freq (Highest CPU Core Frequency) Monitor
easyWave