Intel Optimized Power Mode Xeon Platinum Benchmarks

2 x INTEL XEON PLATINUM 8592 testing by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2312153-NE-XEONEMRPO30
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

C++ Boost Tests 3 Tests
Timed Code Compilation 3 Tests
C/C++ Compiler Tests 9 Tests
Compression Tests 2 Tests
CPU Massive 12 Tests
Creator Workloads 9 Tests
Database Test Suite 4 Tests
Encoding 6 Tests
Game Development 2 Tests
HPC - High Performance Computing 6 Tests
Java Tests 2 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 3 Tests
Multi-Core 15 Tests
Intel oneAPI 2 Tests
OpenMPI Tests 3 Tests
Programmer / Developer System Benchmarks 4 Tests
Python Tests 7 Tests
Scientific Computing 2 Tests
Server 8 Tests
Server CPU Tests 9 Tests
Single-Threaded 2 Tests
Video Encoding 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Default
December 14 2023
  16 Hours, 25 Minutes
Optimized Power Mode
December 15 2023
  1 Day, 52 Minutes
Invert Hiding All Results Option
  20 Hours, 38 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel Optimized Power Mode Xeon Platinum BenchmarksOpenBenchmarking.orgPhoronix Test Suite2 x INTEL XEON PLATINUM 8592+ @ 3.90GHz (128 Cores / 256 Threads)Quanta Cloud S6Q-MB-MPS (3B05.TEL4P1 BIOS)Intel Device 1bce1008GB3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Intel X710 for 10GBASE-TUbuntu 23.106.5.0-13-generic (x86_64)GCC 13.2.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionIntel Optimized Power Mode Xeon Platinum Benchmarks PerformanceSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x21000161- OpenJDK Runtime Environment (build 11.0.21+9-post-Ubuntu-0ubuntu123.10)- Python 3.11.6- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

Default vs. Optimized Power Mode ComparisonPhoronix Test SuiteBaseline+13.5%+13.5%+27%+27%+40.5%+40.5%+54%+54%15%10.7%8.4%4.6%4.2%3.7%2.7%2.5%1:1054.1%CPU - 256 - ResNet-5041.2%CPU - 256 - ResNet-15239.3%CPU - 64 - ResNet-15238.5%50035.1%CPU - 64 - ResNet-5033.7%motorBike - Execution Time33.3%100031.7%GET - 5024.5%12 - Compression Speed22.6%CPU - 64 - Efficientnet_v2_l20.2%GhostRider - 1M19.4%Bosphorus 4K - Faster19.1%Bosphorus 4K - Ultra Fast18.3%CPU - 256 - Efficientnet_v2_l18.3%19 - Compression Speed17.9%10 - G.M.O.A.Q16.9%e.G.B.S - 120016.5%19, Long Mode - Compression Speed15.7%Barbershop - CPU-OnlyPreset 12 - Bosphorus 1080p14.4%e.G.B.S - 240014.4%IMDB14.2%Bosphorus 4K - Super Fast14.2%Bosphorus 4K - Very Fast14.2%Create - 100 - 10000014%50012.4%19 - D.S12.3%Preset 8 - Bosphorus 1080p12.3%Preset 8 - Bosphorus 4K12.2%libx265 - Live11.5%Bosphorus 1080p - Faster10.9%12 - D.S10.9%1B32 - 256 - 5710.5%Bosphorus 1080p - Fast10.3%VMAF Optimized - Bosphorus 1080p10.1%64 - 256 - 579.9%Preset 12 - Bosphorus 4K9.9%Preset 4 - Bosphorus 4K9.8%Preset 13 - Bosphorus 4K9.6%Preset 13 - Bosphorus 1080p9.4%R.O.R.S.I9.4%Bosphorus 4K - Ultra Fast9.4%Bosphorus 1080p - Very Fast9.2%Bosphorus 4K - Fast9.2%Bosphorus 1080p - Ultra Fast8.7%Bosphorus 1080p - Super Fast8.6%d.S.M.S - Mesh Time19, Long Mode - D.S8.2%P.S.O - Bosphorus 1080p8%Time To Compile7.5%100 - 1000 - Read Write - Average Latency7.2%100 - 1000 - Read Write7.1%V.Q.O - Bosphorus 1080p6.9%Bosphorus 1080p - Ultra Fast6.9%Bosphorus 4K - Very Fast6.8%Preset 4 - Bosphorus 1080p6.5%10005.9%Unix Makefiles5.9%TPC-H Parquet5.8%Bosphorus 4K - Super Fast5.7%Compression Rating5.4%libx265 - Platform5.2%1:1005.1%Bosphorus 1080p - Super Fast5%128 - 256 - 575%Bosphorus 1080p - Very Fast4.8%SET - 5004.8%R.5.S.I - A.M.S4.7%R.5.S.I - A.M.S4.7%Classroom - CPU-Onlylibx265 - Video On Demand4.2%RT.ldr_alb_nrm.3840x2160 - CPU-Onlylibx265 - Upload3.8%128 - 256 - 32A.G.R.R.0.F.I - CPU3.6%32 - 256 - 5123.4%256 - 256 - 573.2%defconfig3%GET - 5002.8%128 - 256 - 512A.G.R.R.0.F.I - CPU2.5%Bumper BeamNinja2.5%10 - Q0119%10 - Q0240.2%10 - Q0319.3%10 - Q0413.7%10 - Q0539.4%10 - Q0626.8%10 - Q0722.5%10 - Q0830.5%10 - Q0923.3%10 - Q1018.4%10 - Q1133.9%10 - Q1226.8%10 - Q1315.8%10 - Q1413.3%10 - Q158.7%10 - Q1613.8%10 - Q173.4%10 - Q1810.1%10 - Q1912.6%10 - Q2019.6%10 - Q2117.7%10 - Q2210.5%MemcachedPyTorchPyTorchPyTorchnginxPyTorchOpenFOAMnginxRedisZstd CompressionPyTorchXmrigVVenCuvg266PyTorchZstd CompressionApache Spark TPC-HeasyWaveZstd CompressionBlenderSVT-AV1easyWaveDuckDBuvg266uvg266Apache HadoopApache HTTP ServerZstd CompressionSVT-AV1SVT-AV1FFmpegVVenCZstd CompressionY-CruncherLiquid-DSPVVenCSVT-VP9Liquid-DSPSVT-AV1SVT-AV1SVT-AV1SVT-AV1OpenRadiossKvazaaruvg266VVenCuvg266uvg266OpenFOAMZstd CompressionSVT-VP9Timed GCC CompilationPostgreSQLPostgreSQLSVT-VP9KvazaarKvazaarSVT-AV1Apache HTTP ServerTimed LLVM CompilationDuckDBKvazaar7-Zip CompressionFFmpegMemcachedKvazaarLiquid-DSPKvazaarRedisNeural Magic DeepSparseNeural Magic DeepSparseBlenderFFmpegIntel Open Image DenoiseFFmpegLiquid-DSPOpenVINOLiquid-DSPLiquid-DSPTimed Linux Kernel CompilationRedisLiquid-DSPOpenVINOOpenRadiossTimed LLVM CompilationApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HDefaultOptimized Power Mode

Intel Optimized Power Mode Xeon Platinum Benchmarkscompress-7zip: Compression Ratingcompress-7zip: Decompression Ratinghadoop: Create - 100 - 100000apache: 500apache: 1000spark-tpch: 10 - Geometric Mean Of All Queriesspark-tpch: 10 - Q01spark-tpch: 10 - Q02spark-tpch: 10 - Q03spark-tpch: 10 - Q04spark-tpch: 10 - Q05spark-tpch: 10 - Q06spark-tpch: 10 - Q07spark-tpch: 10 - Q08spark-tpch: 10 - Q09spark-tpch: 10 - Q10spark-tpch: 10 - Q11spark-tpch: 10 - Q12spark-tpch: 10 - Q13spark-tpch: 10 - Q14spark-tpch: 10 - Q15spark-tpch: 10 - Q16spark-tpch: 10 - Q17spark-tpch: 10 - Q18spark-tpch: 10 - Q19spark-tpch: 10 - Q20spark-tpch: 10 - Q21spark-tpch: 10 - Q22blender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Barbershop - CPU-Onlyduckdb: IMDBduckdb: TPC-H Parqueteasywave: e2Asean Grid + BengkuluSept2007 Source - 1200easywave: e2Asean Grid + BengkuluSept2007 Source - 2400ffmpeg: libx265 - Liveffmpeg: libx265 - Uploadffmpeg: libx265 - Platformffmpeg: libx265 - Video On Demandoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlykvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Super Fastkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 1080p - Very Fastkvazaar: Bosphorus 1080p - Super Fastkvazaar: Bosphorus 1080p - Ultra Fastliquid-dsp: 32 - 256 - 32liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 32liquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 32liquid-dsp: 128 - 256 - 57liquid-dsp: 256 - 256 - 32liquid-dsp: 256 - 256 - 57liquid-dsp: 32 - 256 - 512liquid-dsp: 64 - 256 - 512liquid-dsp: 128 - 256 - 512liquid-dsp: 256 - 256 - 512memcached: 1:10memcached: 1:100deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamnginx: 500nginx: 1000openfoam: motorBike - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenradioss: Bumper Beamopenradioss: Chrysler Neon 1Mopenradioss: Cell Phone Drop Testopenradioss: Bird Strike on Windshieldopenradioss: Rubber O-Ring Seal Installationopenradioss: INIVOL and Fluid Structure Interaction Drop Containeropenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUpgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Writepgbench: 100 - 1000 - Read Write - Average Latencypytorch: CPU - 64 - ResNet-50pytorch: CPU - 256 - ResNet-50pytorch: CPU - 64 - ResNet-152pytorch: CPU - 256 - ResNet-152pytorch: CPU - 64 - Efficientnet_v2_lpytorch: CPU - 256 - Efficientnet_v2_lqmcpack: Li2_STO_aequantlib: Multi-Threadedquantlib: Single-Threadedredis: GET - 50redis: SET - 50redis: GET - 500redis: SET - 500svt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 4 - Bosphorus 1080psvt-av1: Preset 8 - Bosphorus 1080psvt-av1: Preset 12 - Bosphorus 1080psvt-av1: Preset 13 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080pbuild-gcc: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild-llvm: Ninjabuild-llvm: Unix Makefilesuvg266: Bosphorus 4K - Slowuvg266: Bosphorus 4K - Mediumuvg266: Bosphorus 1080p - Slowuvg266: Bosphorus 1080p - Mediumuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Ultra Fastuvg266: Bosphorus 1080p - Very Fastuvg266: Bosphorus 1080p - Super Fastuvg266: Bosphorus 1080p - Ultra Fastvvenc: Bosphorus 4K - Fastvvenc: Bosphorus 4K - Fastervvenc: Bosphorus 1080p - Fastvvenc: Bosphorus 1080p - Fasterxmrig: KawPow - 1Mxmrig: Monero - 1Mxmrig: Wownero - 1Mxmrig: GhostRider - 1Mxmrig: CryptoNight-Heavy - 1Mxmrig: CryptoNight-Femto UPX2 - 1My-cruncher: 1Bcompress-zstd: 12 - Compression Speedcompress-zstd: 12 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression SpeedDefaultOptimized Power Mode689516637311549090823.0680826.718.359196838.841855697.6981140812.590769778.1769722312.944283492.5661711711.4746182812.0530392317.5612882011.879122105.997661757.933933265.644020565.778444924.245615804.9724410412.6007382112.894165685.094584628.5617198926.481879554.4956595112.8934.94149.49121.967148.5536.54689.984131.8127.0053.9853.634.5341.4541.99127.25131.1469.3671.0577.53268.49263.88282.05121176666714291000002476033333281016666736118666674329866667618560000058355000005082450001012336667147390000021445666673360511.123441439.71137.3439463.08544580.631813.94511778.594535.931411244.79415.6732834.176776.5445160.4064397.77311791.273735.6695862.138274.07221235.406351.7033182.1591350.90081899.222333.6385137.3540462.9932273684.71243384.194.1328830.0422323.46982284.7885.8225.74111.2381.6997.87748.1042.72539.69236.708930.0014.3224795.175.152390.2553.511106.3228.8548795.972.4510205.6112.513330.1938.41121577.550.408930891.1246396215.63443.9144.5817.6717.852.622.5994.384388447.23650.54744838.93060186.423240001.582503177.677.29769.466218.452218.76320.952142.265502.707628.755542.98542.17463.27706.25923.756149.79296.518177.43127.2029.7980.3088.4656.0657.8759.59160.63158.23166.476.92110.83619.91532.09170383.270469.875002.316522.570573.069677.85.199333.61422.719.11173.59.901183.8654459638136481480817.7376315.659.7730412710.5253196410.7942171115.018873089.2947692918.041547783.2531764214.0577049315.7232208221.6599576114.062806408.0304369910.057694166.534819476.548335144.614900045.6564852813.0284314914.199875835.7373820010.2426731931.161191124.9695701612.6733.40129.94139.326157.10142.559102.928118.2426.0251.3151.454.7241.1341.69125.18130.6464.9767.2170.87256.19251.33263.9612045000001293333333246086666725565733333745233333412500000063014833335656600000491426667999663333151346666721410000002180648.213274425.32138.0114461.14694590.337513.90951768.910836.128610738.13525.9385834.270676.5042162.5895392.45151776.525235.9561863.347473.98961237.025751.6176183.7104346.74111883.657333.9032138.1497460.6851202507.42184861.995.5089727.7087323.34883382.7386.3025.37111.7389.3798.49750.4342.59538.20237.368943.8814.3024922.405.132392.3853.451100.7628.9949494.062.4210168.0612.553297.8138.79117369.550.418784681.1415969516.75232.8331.5812.7612.812.182.1992.672391468.43641.43811819.503050498.003152359.752389636.676.64561.914198.742199.54919.669126.697439.477574.599493.26502.05433.53759.21624.476151.27698.905187.83726.7129.6278.9386.7849.1050.6850.36147.08145.75153.176.3409.09618.05328.92669196.770298.875701.713832.470338.669847.94.697272.21282.716.21044.68.561094.2OpenBenchmarking.org

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingDefaultOptimized Power Mode150K300K450K600K750KSE +/- 869.44, N = 3SE +/- 1440.87, N = 36895166544591. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingOptimized Power ModeDefault140K280K420K560K700KSE +/- 9574.07, N = 3SE +/- 4040.78, N = 36381366373111. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Apache Hadoop

This is a benchmark of the Apache Hadoop making use of its built-in name-node throughput benchmark (NNThroughputBenchmark). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps per sec, More Is BetterApache Hadoop 3.3.6Operation: Create - Threads: 100 - Files: 100000DefaultOptimized Power Mode12002400360048006000SE +/- 32.35, N = 3SE +/- 62.35, N = 354904814

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 500DefaultOptimized Power Mode20K40K60K80K100KSE +/- 265.12, N = 3SE +/- 184.86, N = 390823.0680817.731. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 1000DefaultOptimized Power Mode20K40K60K80K100K80826.7176315.651. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache Spark TPC-H

This is a benchmark of Apache Spark using TPC-H data-set. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmarks the Apache Spark in a single-system configuration using spark-submit. The test makes use of https://github.com/ssavvides/tpch-spark/ for facilitating the TPC-H benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark TPC-H 3.5Scale Factor: 10 - Geometric Mean Of All QueriesDefaultOptimized Power Mode3691215SE +/- 0.07178690, N = 3SE +/- 0.09145601, N = 78.359196839.77304127MIN: 4.15 / MAX: 27.27MIN: 4.46 / MAX: 32.81

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: BMW27 - Compute: CPU-OnlyOptimized Power ModeDefault3691215SE +/- 0.04, N = 4SE +/- 0.03, N = 412.6712.89

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: Classroom - Compute: CPU-OnlyOptimized Power ModeDefault816243240SE +/- 0.07, N = 3SE +/- 0.33, N = 333.4034.94

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: Barbershop - Compute: CPU-OnlyOptimized Power ModeDefault306090120150SE +/- 0.11, N = 3SE +/- 6.68, N = 9129.94149.49

CPU Peak Freq (Highest CPU Core Frequency) Monitor

OpenBenchmarking.orgMegahertzCPU Peak Freq (Highest CPU Core Frequency) MonitorPhoronix Test Suite System MonitoringDefaultOptimized Power Mode10002000300040005000Min: 500 / Avg: 3374.53 / Max: 5474Min: 800 / Avg: 3437 / Max: 5154

CPU Power Consumption Monitor

OpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringOptimized Power ModeDefault140280420560700Min: 88.73 / Avg: 366.23 / Max: 802.53Min: 101.15 / Avg: 445.93 / Max: 802.3

DuckDB

DuckDB is an in-progress SQL OLAP database management system optimized for analytics and features a vectorized and parallel engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: IMDBDefaultOptimized Power Mode306090120150SE +/- 0.49, N = 3SE +/- 0.43, N = 3121.97139.331. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: TPC-H ParquetDefaultOptimized Power Mode306090120150SE +/- 0.31, N = 3SE +/- 0.38, N = 3148.55157.101. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

easyWave

The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200DefaultOptimized Power Mode1020304050SE +/- 0.28, N = 3SE +/- 0.41, N = 1536.5542.561. (CXX) g++ options: -O3 -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400DefaultOptimized Power Mode20406080100SE +/- 1.41, N = 15SE +/- 1.07, N = 389.98102.931. (CXX) g++ options: -O3 -fopenmp

FFmpeg

This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: LiveDefaultOptimized Power Mode306090120150SE +/- 1.03, N = 3SE +/- 0.79, N = 3131.81118.241. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: UploadDefaultOptimized Power Mode612182430SE +/- 0.06, N = 3SE +/- 0.06, N = 327.0026.021. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: PlatformDefaultOptimized Power Mode1224364860SE +/- 0.11, N = 3SE +/- 0.14, N = 353.9851.311. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: Video On DemandDefaultOptimized Power Mode1224364860SE +/- 0.04, N = 3SE +/- 0.10, N = 353.6351.451. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Intel Open Image Denoise

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.1Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-OnlyOptimized Power ModeDefault1.0622.1243.1864.2485.31SE +/- 0.01, N = 6SE +/- 0.01, N = 64.724.53

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowDefaultOptimized Power Mode918273645SE +/- 0.06, N = 4SE +/- 0.06, N = 441.4541.131. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumDefaultOptimized Power Mode1020304050SE +/- 0.06, N = 4SE +/- 0.05, N = 441.9941.691. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: SlowDefaultOptimized Power Mode306090120150SE +/- 0.38, N = 8SE +/- 0.35, N = 8127.25125.181. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: MediumDefaultOptimized Power Mode306090120150SE +/- 0.46, N = 8SE +/- 0.58, N = 8131.14130.641. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastDefaultOptimized Power Mode1530456075SE +/- 0.26, N = 5SE +/- 0.44, N = 569.3664.971. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastDefaultOptimized Power Mode1632486480SE +/- 0.25, N = 5SE +/- 0.12, N = 571.0567.211. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastDefaultOptimized Power Mode20406080100SE +/- 0.23, N = 6SE +/- 0.70, N = 677.5370.871. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: Very FastDefaultOptimized Power Mode60120180240300SE +/- 1.16, N = 10SE +/- 1.04, N = 10268.49256.191. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: Super FastDefaultOptimized Power Mode60120180240300SE +/- 1.09, N = 10SE +/- 1.69, N = 10263.88251.331. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: Ultra FastDefaultOptimized Power Mode60120180240300SE +/- 2.07, N = 11SE +/- 1.93, N = 10282.05263.961. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32DefaultOptimized Power Mode300M600M900M1200M1500MSE +/- 1273228.62, N = 3SE +/- 1193035.34, N = 3121176666712045000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57DefaultOptimized Power Mode300M600M900M1200M1500MSE +/- 9950041.88, N = 3SE +/- 16392511.84, N = 3142910000012933333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32DefaultOptimized Power Mode500M1000M1500M2000M2500MSE +/- 6835284.27, N = 3SE +/- 13974301.81, N = 3247603333324608666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57DefaultOptimized Power Mode600M1200M1800M2400M3000MSE +/- 24169149.30, N = 3SE +/- 20734086.19, N = 15281016666725565733331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32Optimized Power ModeDefault800M1600M2400M3200M4000MSE +/- 3883440.63, N = 3SE +/- 1386041.53, N = 3374523333336118666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 57DefaultOptimized Power Mode900M1800M2700M3600M4500MSE +/- 11478143.48, N = 3SE +/- 34188302.09, N = 3432986666741250000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 32Optimized Power ModeDefault1300M2600M3900M5200M6500MSE +/- 61606333.10, N = 6SE +/- 7150058.27, N = 3630148333361856000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 57DefaultOptimized Power Mode1200M2400M3600M4800M6000MSE +/- 11767044.38, N = 3SE +/- 19835069.95, N = 3583550000056566000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512DefaultOptimized Power Mode110M220M330M440M550MSE +/- 5123258.57, N = 6SE +/- 3588947.54, N = 35082450004914266671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512DefaultOptimized Power Mode200M400M600M800M1000MSE +/- 10474795.68, N = 3SE +/- 5205466.14, N = 310123366679996633331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512Optimized Power ModeDefault300M600M900M1200M1500MSE +/- 13505101.92, N = 3SE +/- 4762352.36, N = 3151346666714739000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 512DefaultOptimized Power Mode500M1000M1500M2000M2500MSE +/- 2968913.01, N = 3SE +/- 1101514.11, N = 3214456666721410000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10DefaultOptimized Power Mode700K1400K2100K2800K3500KSE +/- 29533.59, N = 15SE +/- 43551.88, N = 153360511.122180648.211. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100DefaultOptimized Power Mode700K1400K2100K2800K3500KSE +/- 25091.90, N = 3SE +/- 42725.42, N = 33441439.713274425.321. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault306090120150SE +/- 0.12, N = 3SE +/- 0.21, N = 3138.01137.34

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault100200300400500SE +/- 0.42, N = 3SE +/- 0.27, N = 3461.15463.09

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault10002000300040005000SE +/- 2.64, N = 3SE +/- 6.99, N = 34590.344580.63

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault48121620SE +/- 0.01, N = 3SE +/- 0.02, N = 313.9113.95

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamDefaultOptimized Power Mode400800120016002000SE +/- 3.42, N = 3SE +/- 6.86, N = 31778.591768.91

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamDefaultOptimized Power Mode816243240SE +/- 0.08, N = 3SE +/- 0.14, N = 335.9336.13

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamDefaultOptimized Power Mode2K4K6K8K10KSE +/- 7.42, N = 3SE +/- 12.41, N = 311244.7910738.14

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamDefaultOptimized Power Mode1.33622.67244.00865.34486.681SE +/- 0.0033, N = 3SE +/- 0.0065, N = 35.67325.9385

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault2004006008001000SE +/- 0.60, N = 3SE +/- 0.97, N = 3834.27834.18

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault20406080100SE +/- 0.04, N = 3SE +/- 0.04, N = 376.5076.54

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault4080120160200SE +/- 0.33, N = 3SE +/- 1.36, N = 3162.59160.41

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault90180270360450SE +/- 0.71, N = 3SE +/- 3.53, N = 3392.45397.77

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamDefaultOptimized Power Mode400800120016002000SE +/- 7.80, N = 3SE +/- 3.31, N = 31791.271776.53

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamDefaultOptimized Power Mode816243240SE +/- 0.16, N = 3SE +/- 0.07, N = 335.6735.96

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault2004006008001000SE +/- 0.10, N = 3SE +/- 0.30, N = 3863.35862.14

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault1632486480SE +/- 0.03, N = 3SE +/- 0.05, N = 373.9974.07

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault30060090012001500SE +/- 2.51, N = 3SE +/- 1.85, N = 31237.031235.41

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault1224364860SE +/- 0.07, N = 3SE +/- 0.05, N = 351.6251.70

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault4080120160200SE +/- 0.02, N = 3SE +/- 0.43, N = 3183.71182.16

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault80160240320400SE +/- 0.05, N = 3SE +/- 0.69, N = 3346.74350.90

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamDefaultOptimized Power Mode400800120016002000SE +/- 1.67, N = 3SE +/- 0.88, N = 31899.221883.66

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamDefaultOptimized Power Mode816243240SE +/- 0.03, N = 3SE +/- 0.01, N = 333.6433.90

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault306090120150SE +/- 0.30, N = 3SE +/- 0.09, N = 3138.15137.35

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault100200300400500SE +/- 0.70, N = 3SE +/- 0.03, N = 3460.69462.99

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500DefaultOptimized Power Mode60K120K180K240K300KSE +/- 25.30, N = 3SE +/- 10.52, N = 3273684.71202507.421. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000DefaultOptimized Power Mode50K100K150K200K250KSE +/- 397.04, N = 3SE +/- 72.63, N = 2243384.19184861.991. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution TimeDefaultOptimized Power Mode1.23952.4793.71854.9586.19754.132885.508971. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeOptimized Power ModeDefault71421283527.7130.041. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeOptimized Power ModeDefault61218243023.3523.471. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper BeamOptimized Power ModeDefault20406080100SE +/- 0.13, N = 3SE +/- 0.18, N = 382.7384.78

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1MDefaultOptimized Power Mode20406080100SE +/- 0.14, N = 3SE +/- 0.17, N = 385.8286.30

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop TestOptimized Power ModeDefault612182430SE +/- 0.12, N = 3SE +/- 0.14, N = 325.3725.74

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on WindshieldDefaultOptimized Power Mode306090120150SE +/- 0.65, N = 3SE +/- 0.28, N = 3111.23111.73

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal InstallationDefaultOptimized Power Mode20406080100SE +/- 0.03, N = 3SE +/- 0.13, N = 381.6989.37

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop ContainerDefaultOptimized Power Mode20406080100SE +/- 0.07, N = 3SE +/- 0.08, N = 397.8798.49

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUOptimized Power ModeDefault160320480640800SE +/- 0.90, N = 3SE +/- 0.71, N = 3750.43748.101. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUOptimized Power ModeDefault1020304050SE +/- 0.05, N = 3SE +/- 0.04, N = 342.5942.72MIN: 32.12 / MAX: 128.96MIN: 31.92 / MAX: 84.11. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUDefaultOptimized Power Mode120240360480600SE +/- 0.65, N = 3SE +/- 1.35, N = 3539.69538.201. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUDefaultOptimized Power Mode50100150200250SE +/- 0.35, N = 3SE +/- 0.55, N = 3236.70237.36MIN: 157.14 / MAX: 268.19MIN: 160.6 / MAX: 286.171. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault2K4K6K8K10KSE +/- 4.32, N = 3SE +/- 5.30, N = 38943.888930.001. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 314.3014.32MIN: 12.18 / MAX: 47.3MIN: 12.39 / MAX: 39.161. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUOptimized Power ModeDefault5K10K15K20K25KSE +/- 6.86, N = 3SE +/- 42.09, N = 324922.4024795.171. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUOptimized Power ModeDefault1.15882.31763.47644.63525.794SE +/- 0.00, N = 3SE +/- 0.01, N = 35.135.15MIN: 4.64 / MAX: 29.56MIN: 4.63 / MAX: 27.91. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUOptimized Power ModeDefault5001000150020002500SE +/- 7.16, N = 3SE +/- 3.42, N = 32392.382390.251. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUOptimized Power ModeDefault1224364860SE +/- 0.16, N = 3SE +/- 0.08, N = 353.4553.51MIN: 45.01 / MAX: 111.52MIN: 43.94 / MAX: 101.681. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUDefaultOptimized Power Mode2004006008001000SE +/- 9.12, N = 3SE +/- 3.05, N = 31106.321100.761. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUDefaultOptimized Power Mode714212835SE +/- 0.23, N = 3SE +/- 0.08, N = 328.8528.99MIN: 21.11 / MAX: 222.12MIN: 22.06 / MAX: 222.831. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault11K22K33K44K55KSE +/- 229.19, N = 3SE +/- 490.25, N = 349494.0648795.971. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault0.55131.10261.65392.20522.7565SE +/- 0.00, N = 3SE +/- 0.01, N = 32.422.45MIN: 1.96 / MAX: 28.3MIN: 2.03 / MAX: 23.831. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUDefaultOptimized Power Mode2K4K6K8K10KSE +/- 7.22, N = 3SE +/- 6.61, N = 310205.6110168.061. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUDefaultOptimized Power Mode3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 312.5112.55MIN: 10.84 / MAX: 42.36MIN: 10.77 / MAX: 42.911. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUDefaultOptimized Power Mode7001400210028003500SE +/- 7.06, N = 3SE +/- 6.73, N = 33330.193297.811. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUDefaultOptimized Power Mode918273645SE +/- 0.08, N = 3SE +/- 0.08, N = 338.4138.79MIN: 35.95 / MAX: 58.71MIN: 36.26 / MAX: 60.121. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUDefaultOptimized Power Mode30K60K90K120K150KSE +/- 775.18, N = 3SE +/- 1361.46, N = 3121577.55117369.551. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUDefaultOptimized Power Mode0.09230.18460.27690.36920.4615SE +/- 0.00, N = 3SE +/- 0.01, N = 30.400.41MIN: 0.18 / MAX: 14.47MIN: 0.19 / MAX: 15.841. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyDefaultOptimized Power Mode200K400K600K800K1000KSE +/- 17460.42, N = 10SE +/- 11877.48, N = 128930898784681. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyDefaultOptimized Power Mode0.25670.51340.77011.02681.2835SE +/- 0.022, N = 10SE +/- 0.016, N = 121.1241.1411. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteDefaultOptimized Power Mode14K28K42K56K70KSE +/- 70.87, N = 3SE +/- 214.64, N = 363962596951. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyDefaultOptimized Power Mode48121620SE +/- 0.02, N = 3SE +/- 0.06, N = 315.6316.751. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PyTorch

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: ResNet-50DefaultOptimized Power Mode1020304050SE +/- 0.55, N = 4SE +/- 0.39, N = 343.9132.83MIN: 17.29 / MAX: 46.23MIN: 15.85 / MAX: 36.48

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: ResNet-50DefaultOptimized Power Mode1020304050SE +/- 0.51, N = 4SE +/- 0.02, N = 344.5831.58MIN: 19.44 / MAX: 46.72MIN: 14.91 / MAX: 34.67

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: ResNet-152DefaultOptimized Power Mode48121620SE +/- 0.23, N = 3SE +/- 0.12, N = 1217.6712.76MIN: 10.89 / MAX: 18.38MIN: 5.75 / MAX: 15.71

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: ResNet-152DefaultOptimized Power Mode48121620SE +/- 0.13, N = 11SE +/- 0.17, N = 317.8512.81MIN: 7.02 / MAX: 19.19MIN: 6.6 / MAX: 13.9

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: Efficientnet_v2_lDefaultOptimized Power Mode0.58951.1791.76852.3582.9475SE +/- 0.02, N = 3SE +/- 0.02, N = 32.622.18MIN: 1.32 / MAX: 3.9MIN: 0.98 / MAX: 3.41

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: Efficientnet_v2_lDefaultOptimized Power Mode0.58281.16561.74842.33122.914SE +/- 0.00, N = 3SE +/- 0.02, N = 32.592.19MIN: 1.19 / MAX: 4MIN: 0.98 / MAX: 3.3

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: Li2_STO_aeOptimized Power ModeDefault20406080100SE +/- 1.02, N = 5SE +/- 0.40, N = 392.6794.381. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedOptimized Power ModeDefault80K160K240K320K400KSE +/- 52.43, N = 3SE +/- 127.57, N = 3391468.4388447.21. (CXX) g++ options: -O3 -march=native -fPIE -pie

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Single-ThreadedDefaultOptimized Power Mode8001600240032004000SE +/- 1.44, N = 3SE +/- 2.92, N = 33650.53641.41. (CXX) g++ options: -O3 -march=native -fPIE -pie

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 50DefaultOptimized Power Mode1000K2000K3000K4000K5000KSE +/- 8167.04, N = 4SE +/- 1281.58, N = 34744838.903811819.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 50DefaultOptimized Power Mode700K1400K2100K2800K3500KSE +/- 12726.60, N = 3SE +/- 1584.15, N = 33060186.423050498.001. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 500DefaultOptimized Power Mode700K1400K2100K2800K3500KSE +/- 482.98, N = 3SE +/- 36270.68, N = 33240001.583152359.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 500DefaultOptimized Power Mode500K1000K1500K2000K2500KSE +/- 7546.74, N = 3SE +/- 19286.75, N = 152503177.672389636.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 4 - Input: Bosphorus 4KDefaultOptimized Power Mode246810SE +/- 0.098, N = 3SE +/- 0.006, N = 37.2976.6451. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 8 - Input: Bosphorus 4KDefaultOptimized Power Mode1530456075SE +/- 0.39, N = 4SE +/- 0.56, N = 469.4761.911. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 12 - Input: Bosphorus 4KDefaultOptimized Power Mode50100150200250SE +/- 0.90, N = 6SE +/- 2.06, N = 5218.45198.741. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 13 - Input: Bosphorus 4KDefaultOptimized Power Mode50100150200250SE +/- 1.12, N = 5SE +/- 1.56, N = 9218.76199.551. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 4 - Input: Bosphorus 1080pDefaultOptimized Power Mode510152025SE +/- 0.16, N = 5SE +/- 0.11, N = 520.9519.671. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 8 - Input: Bosphorus 1080pDefaultOptimized Power Mode306090120150SE +/- 0.36, N = 7SE +/- 0.77, N = 6142.27126.701. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 12 - Input: Bosphorus 1080pDefaultOptimized Power Mode110220330440550SE +/- 3.53, N = 9SE +/- 4.01, N = 15502.71439.481. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 13 - Input: Bosphorus 1080pDefaultOptimized Power Mode140280420560700SE +/- 4.69, N = 11SE +/- 3.16, N = 10628.76574.601. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

Tuning: VMAF Optimized - Input: Bosphorus 4K

Default: The test quit with a non-zero exit status.

Optimized Power Mode: The test quit with a non-zero exit status.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pDefaultOptimized Power Mode120240360480600SE +/- 2.30, N = 9SE +/- 9.88, N = 15542.98493.261. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K

Default: The test quit with a non-zero exit status.

Optimized Power Mode: The test quit with a non-zero exit status.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pDefaultOptimized Power Mode120240360480600SE +/- 4.34, N = 9SE +/- 3.72, N = 8542.17502.051. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Tuning: Visual Quality Optimized - Input: Bosphorus 4K

Default: The test quit with a non-zero exit status.

Optimized Power Mode: The test quit with a non-zero exit status.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pDefaultOptimized Power Mode100200300400500SE +/- 1.93, N = 8SE +/- 2.86, N = 8463.27433.531. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Timed GCC Compilation

This test times how long it takes to build the GNU Compiler Collection (GCC) open-source compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 13.2Time To CompileDefaultOptimized Power Mode160320480640800SE +/- 2.37, N = 3SE +/- 4.73, N = 3706.26759.22

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigDefaultOptimized Power Mode612182430SE +/- 0.21, N = 8SE +/- 0.18, N = 1123.7624.48

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigDefaultOptimized Power Mode306090120150SE +/- 0.51, N = 3SE +/- 0.48, N = 3149.79151.28

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaDefaultOptimized Power Mode20406080100SE +/- 0.29, N = 3SE +/- 0.20, N = 396.5298.91

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix MakefilesDefaultOptimized Power Mode4080120160200SE +/- 0.30, N = 3SE +/- 0.58, N = 3177.43187.84

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: SlowDefaultOptimized Power Mode612182430SE +/- 0.08, N = 3SE +/- 0.08, N = 327.2026.71

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: MediumDefaultOptimized Power Mode714212835SE +/- 0.08, N = 3SE +/- 0.19, N = 329.7929.62

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: SlowDefaultOptimized Power Mode20406080100SE +/- 0.10, N = 6SE +/- 0.19, N = 680.3078.93

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: MediumDefaultOptimized Power Mode20406080100SE +/- 0.08, N = 6SE +/- 0.25, N = 688.4686.78

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very FastDefaultOptimized Power Mode1326395265SE +/- 0.07, N = 5SE +/- 0.45, N = 1556.0649.10

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Super FastDefaultOptimized Power Mode1326395265SE +/- 0.15, N = 5SE +/- 0.48, N = 457.8750.68

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Ultra FastDefaultOptimized Power Mode1326395265SE +/- 0.37, N = 5SE +/- 0.21, N = 459.5950.36

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Very FastDefaultOptimized Power Mode4080120160200SE +/- 0.74, N = 8SE +/- 1.07, N = 11160.63147.08

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Super FastDefaultOptimized Power Mode306090120150SE +/- 1.01, N = 15SE +/- 1.05, N = 15158.23145.75

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Ultra FastDefaultOptimized Power Mode4080120160200SE +/- 1.12, N = 9SE +/- 1.12, N = 15166.47153.17

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: FastDefaultOptimized Power Mode246810SE +/- 0.067, N = 3SE +/- 0.066, N = 36.9216.3401. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: FasterDefaultOptimized Power Mode3691215SE +/- 0.081, N = 3SE +/- 0.072, N = 310.8369.0961. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: FastDefaultOptimized Power Mode510152025SE +/- 0.15, N = 3SE +/- 0.11, N = 319.9218.051. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: FasterDefaultOptimized Power Mode714212835SE +/- 0.36, N = 3SE +/- 0.30, N = 432.0928.931. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: KawPow - Hash Count: 1MDefaultOptimized Power Mode15K30K45K60K75KSE +/- 82.14, N = 4SE +/- 943.50, N = 370383.269196.71. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Monero - Hash Count: 1MDefaultOptimized Power Mode15K30K45K60K75KSE +/- 27.69, N = 4SE +/- 41.44, N = 370469.870298.81. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Wownero - Hash Count: 1MOptimized Power ModeDefault16K32K48K64K80KSE +/- 24.14, N = 4SE +/- 677.05, N = 475701.775002.31. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MDefaultOptimized Power Mode4K8K12K16K20KSE +/- 67.39, N = 3SE +/- 22.03, N = 316522.513832.41. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Heavy - Hash Count: 1MDefaultOptimized Power Mode15K30K45K60K75KSE +/- 49.63, N = 4SE +/- 96.69, N = 370573.070338.61. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Femto UPX2 - Hash Count: 1MOptimized Power ModeDefault15K30K45K60K75KSE +/- 443.65, N = 3SE +/- 844.15, N = 469847.969677.81. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Y-Cruncher

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.2Pi Digits To Calculate: 1BOptimized Power ModeDefault1.16982.33963.50944.67925.849SE +/- 0.009, N = 5SE +/- 0.018, N = 54.6975.199

Zstd Compression

This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression SpeedDefaultOptimized Power Mode70140210280350SE +/- 3.46, N = 3SE +/- 2.98, N = 5333.6272.21. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression SpeedDefaultOptimized Power Mode30060090012001500SE +/- 1.98, N = 3SE +/- 1.74, N = 51422.71282.71. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression SpeedDefaultOptimized Power Mode510152025SE +/- 0.06, N = 3SE +/- 0.19, N = 319.116.21. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression SpeedDefaultOptimized Power Mode30060090012001500SE +/- 1.19, N = 3SE +/- 2.93, N = 31173.51044.61. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression SpeedDefaultOptimized Power Mode3691215SE +/- 0.02, N = 3SE +/- 0.01, N = 39.908.561. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression SpeedDefaultOptimized Power Mode30060090012001500SE +/- 16.60, N = 3SE +/- 2.17, N = 31183.81094.21. (CC) gcc options: -O3 -pthread -lz -llzma

159 Results Shown

7-Zip Compression:
  Compression Rating
  Decompression Rating
Apache Hadoop
Apache HTTP Server:
  500
  1000
Apache Spark TPC-H
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
  Barbershop - CPU-Only
CPU Peak Freq (Highest CPU Core Frequency) Monitor:
  Phoronix Test Suite System Monitoring:
    Megahertz
    Watts
DuckDB:
  IMDB
  TPC-H Parquet
easyWave:
  e2Asean Grid + BengkuluSept2007 Source - 1200
  e2Asean Grid + BengkuluSept2007 Source - 2400
FFmpeg:
  libx265 - Live
  libx265 - Upload
  libx265 - Platform
  libx265 - Video On Demand
Intel Open Image Denoise
Kvazaar:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
  Bosphorus 1080p - Slow
  Bosphorus 1080p - Medium
  Bosphorus 4K - Very Fast
  Bosphorus 4K - Super Fast
  Bosphorus 4K - Ultra Fast
  Bosphorus 1080p - Very Fast
  Bosphorus 1080p - Super Fast
  Bosphorus 1080p - Ultra Fast
Liquid-DSP:
  32 - 256 - 32
  32 - 256 - 57
  64 - 256 - 32
  64 - 256 - 57
  128 - 256 - 32
  128 - 256 - 57
  256 - 256 - 32
  256 - 256 - 57
  32 - 256 - 512
  64 - 256 - 512
  128 - 256 - 512
  256 - 256 - 512
Memcached:
  1:10
  1:100
Neural Magic DeepSparse:
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  ResNet-50, Baseline - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
nginx:
  500
  1000
OpenFOAM:
  motorBike - Execution Time
  drivaerFastback, Small Mesh Size - Mesh Time
  drivaerFastback, Small Mesh Size - Execution Time
OpenRadioss:
  Bumper Beam
  Chrysler Neon 1M
  Cell Phone Drop Test
  Bird Strike on Windshield
  Rubber O-Ring Seal Installation
  INIVOL and Fluid Structure Interaction Drop Container
OpenVINO:
  Person Detection FP16 - CPU:
    FPS
    ms
  Face Detection FP16-INT8 - CPU:
    FPS
    ms
  Vehicle Detection FP16-INT8 - CPU:
    FPS
    ms
  Face Detection Retail FP16-INT8 - CPU:
    FPS
    ms
  Road Segmentation ADAS FP16-INT8 - CPU:
    FPS
    ms
  Machine Translation EN To DE FP16 - CPU:
    FPS
    ms
  Weld Porosity Detection FP16-INT8 - CPU:
    FPS
    ms
  Person Vehicle Bike Detection FP16 - CPU:
    FPS
    ms
  Handwritten English Recognition FP16-INT8 - CPU:
    FPS
    ms
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
    FPS
    ms
PostgreSQL:
  100 - 1000 - Read Only
  100 - 1000 - Read Only - Average Latency
  100 - 1000 - Read Write
  100 - 1000 - Read Write - Average Latency
PyTorch:
  CPU - 64 - ResNet-50
  CPU - 256 - ResNet-50
  CPU - 64 - ResNet-152
  CPU - 256 - ResNet-152
  CPU - 64 - Efficientnet_v2_l
  CPU - 256 - Efficientnet_v2_l
QMCPACK
QuantLib:
  Multi-Threaded
  Single-Threaded
Redis:
  GET - 50
  SET - 50
  GET - 500
  SET - 500
SVT-AV1:
  Preset 4 - Bosphorus 4K
  Preset 8 - Bosphorus 4K
  Preset 12 - Bosphorus 4K
  Preset 13 - Bosphorus 4K
  Preset 4 - Bosphorus 1080p
  Preset 8 - Bosphorus 1080p
  Preset 12 - Bosphorus 1080p
  Preset 13 - Bosphorus 1080p
SVT-VP9:
  VMAF Optimized - Bosphorus 1080p
  PSNR/SSIM Optimized - Bosphorus 1080p
  Visual Quality Optimized - Bosphorus 1080p
Timed GCC Compilation
Timed Linux Kernel Compilation:
  defconfig
  allmodconfig
Timed LLVM Compilation:
  Ninja
  Unix Makefiles
uvg266:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
  Bosphorus 1080p - Slow
  Bosphorus 1080p - Medium
  Bosphorus 4K - Very Fast
  Bosphorus 4K - Super Fast
  Bosphorus 4K - Ultra Fast
  Bosphorus 1080p - Very Fast
  Bosphorus 1080p - Super Fast
  Bosphorus 1080p - Ultra Fast
VVenC:
  Bosphorus 4K - Fast
  Bosphorus 4K - Faster
  Bosphorus 1080p - Fast
  Bosphorus 1080p - Faster
Xmrig:
  KawPow - 1M
  Monero - 1M
  Wownero - 1M
  GhostRider - 1M
  CryptoNight-Heavy - 1M
  CryptoNight-Femto UPX2 - 1M
Y-Cruncher
Zstd Compression:
  12 - Compression Speed
  12 - Decompression Speed
  19 - Compression Speed
  19 - Decompression Speed
  19, Long Mode - Compression Speed
  19, Long Mode - Decompression Speed