Core i7 4790K 202

Intel Core i7-4790K testing with a Gigabyte Z97-HD3P (F4 BIOS) and Gigabyte Intel Haswell Desktop 2GB on Ubuntu 19.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012207-HA-COREI747935
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

Audio Encoding 2 Tests
Bioinformatics 2 Tests
Chess Test Suite 2 Tests
Timed Code Compilation 3 Tests
C/C++ Compiler Tests 11 Tests
CPU Massive 13 Tests
Creator Workloads 13 Tests
Database Test Suite 2 Tests
Encoding 5 Tests
Fortran Tests 2 Tests
Game Development 3 Tests
HPC - High Performance Computing 8 Tests
Machine Learning 3 Tests
Molecular Dynamics 2 Tests
MPI Benchmarks 3 Tests
Multi-Core 14 Tests
NVIDIA GPU Compute 6 Tests
Intel oneAPI 2 Tests
OpenMPI Tests 3 Tests
Programmer / Developer System Benchmarks 7 Tests
Scientific Computing 5 Tests
Server 5 Tests
Server CPU Tests 7 Tests
Single-Threaded 4 Tests
Texture Compression 3 Tests
Video Encoding 3 Tests
Vulkan Compute 4 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
December 19 2020
  11 Hours, 50 Minutes
2
December 19 2020
  12 Hours, 43 Minutes
3
December 20 2020
  11 Hours, 42 Minutes
Invert Hiding All Results Option
  12 Hours, 5 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Core i7 4790K 202ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123Intel Core i7-4790K @ 4.40GHz (4 Cores / 8 Threads)Gigabyte Z97-HD3P (F4 BIOS)Intel 4th Gen Core DRAM16GB120GB OCZ TRION100Gigabyte Intel Haswell Desktop 2GB (1250MHz)Intel Xeon E3-1200 v3/4thLG Ultra HDRealtek RTL8111/8168/8411Ubuntu 19.105.9.0-050900rc8daily20201009-generic (x86_64) 20201008GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.5 Mesa 19.2.81.1.102GCC 9.2.1 20191008ext43840x2160OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x28 - Thermald 1.9 Java Details- OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-2ubuntu219.10)Python Details- Python 2.7.17 + Python 3.7.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

123Result OverviewPhoronix Test Suite100%105%109%114%CLOMPRedisBetsy GPU CompressorNCNNHPC ChallengeeSpeak-NG Speech EngineLAMMPS Molecular Dynamics SimulatoroneDNNWaifu2x-NCNN VulkanSunflow Rendering Systemrav1eNode.js V8 Web Tooling BenchmarkBuild2simdjsonSQLite SpeedtestCoremarkGROMACSTimed MAFFT AlignmentStockfishBRL-CADNumpy BenchmarkTimed FFmpeg Compilationx265EmbreeLibplaceboLZ4 CompressionWavPack Audio Encodingyquake2asmFishMonkey Audio EncodingBasis UniversalIndigoBenchASTC EncoderKvazaarPHPBenchTimed Eigen CompilationDDraceNetworkTimed HMMer Search

Core i7 4790K 202onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUncnn: CPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v2-v2 - mobilenet-v2onednn: IP Shapes 3D - f32 - CPUwaifu2x-ncnn: 2x - 3 - Yesonednn: Deconvolution Batch shapes_3d - f32 - CPUhpcc: EP-DGEMMncnn: CPU - efficientnet-b0ncnn: Vulkan GPU - yolov4-tinyhpcc: Max Ping Pong Bandwidthonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUncnn: CPU - squeezenet_ssdsimdjson: DistinctUserIDncnn: CPU - blazefacencnn: Vulkan GPU - efficientnet-b0hpcc: G-Rand Accessrav1e: 10rav1e: 6ncnn: CPU - mnasnetbasis: UASTC Level 2 + RDO Post-Processingsimdjson: PartialTweetsncnn: CPU-v3-v3 - mobilenet-v3yquake2: Software CPU - 3840 x 2160onednn: Convolution Batch Shapes Auto - f32 - CPUncnn: CPU - mobilenetncnn: Vulkan GPU - mnasnetrav1e: 1compress-lz4: 9 - Compression Speedhpcc: G-HPLbasis: UASTC Level 0sunflow: Global Illumination + Image Synthesislibplacebo: hdr_peakdetectnode-web-tooling: ncnn: Vulkan GPU - googlenetncnn: CPU - yolov4-tinyyquake2: Software CPU - 2560 x 1440ncnn: CPU - googlenetbuild2: Time To Compilerav1e: 5ncnn: CPU - resnet50onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUsqlite-speedtest: Timed Time - Size 1,000ncnn: CPU - vgg16redis: SADDonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUembree: Pathtracer ISPC - Asian Dragoncoremark: CoreMark Size 666 - Iterations Per Secondncnn: Vulkan GPU - mobilenetncnn: CPU - resnet18ncnn: Vulkan GPU-v3-v3 - mobilenet-v3gromacs: Water Benchmarkncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - squeezenet_ssdmafft: Multiple Sequence Alignment - LSU RNAncnn: CPU - regnety_400monednn: Recurrent Neural Network Inference - f32 - CPUbasis: ETC1Sonednn: Recurrent Neural Network Training - f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUembree: Pathtracer ISPC - Crowncompress-lz4: 3 - Compression Speedyquake2: OpenGL 1.x - 2560 x 1440x265: Bosphorus 1080pstockfish: Total Timeonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUbrl-cad: VGR Performance Metriclibplacebo: polar_nocomputencnn: CPU - alexnetnumpy: embree: Pathtracer - Asian Dragononednn: IP Shapes 1D - u8s8f32 - CPUbuild-ffmpeg: Time To Compilencnn: Vulkan GPU - shufflenet-v2embree: Pathtracer ISPC - Asian Dragon Objyquake2: OpenGL 1.x - 1920 x 1080astcenc: Fastonednn: Recurrent Neural Network Training - u8s8f32 - CPUx265: Bosphorus 4Kembree: Pathtracer - Crownkvazaar: Bosphorus 1080p - Ultra Fastonednn: Recurrent Neural Network Inference - u8s8f32 - CPUhpcc: EP-STREAM Triadncnn: Vulkan GPU - vgg16indigobench: CPU - Bedroomonednn: Deconvolution Batch shapes_1d - f32 - CPUencode-wavpack: WAV To WavPackddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapasmfish: 1024 Hash Memory, 26 Depthlibplacebo: deband_heavyembree: Pathtracer - Asian Dragon Objcompress-lz4: 1 - Compression Speedkvazaar: Bosphorus 4K - Mediumencode-ape: WAV To APEindigobench: CPU - Supercaryquake2: Software CPU - 1920 x 1080yquake2: OpenGL 3.x - 1920 x 1080compress-lz4: 1 - Decompression Speedkvazaar: Bosphorus 4K - Ultra Fastyquake2: OpenGL 3.x - 3840 x 2160ncnn: CPU - shufflenet-v2yquake2: OpenGL 1.x - 3840 x 2160basis: UASTC Level 2ddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - Multeasymapkvazaar: Bosphorus 1080p - Mediumbuild-eigen: Time To Compilekvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 1080p - Very Fastncnn: Vulkan GPU - resnet18ddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2phpbench: PHP Benchmark Suiteddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2kvazaar: Bosphorus 1080p - Slowcompress-lz4: 3 - Decompression Speedastcenc: Exhaustivehmmer: Pfam Database Searchyquake2: OpenGL 3.x - 2560 x 1440compress-lz4: 9 - Decompression Speedbasis: UASTC Level 3astcenc: Thoroughlibplacebo: av1_grain_lapastcenc: Mediumkvazaar: Bosphorus 4K - Slowsimdjson: LargeRandsimdjson: Kostyaclomp: Static OMP Speedupncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - blazefaceredis: SETredis: GETredis: LPUSHredis: LPOPespeak: Text-To-Speech Synthesisonednn: IP Shapes 3D - u8s8f32 - CPUlammps: Rhodopsin Proteinhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Ptranshpcc: G-Fftebetsy: ETC2 RGB - Highestbetsy: ETC1 - Highestwaifu2x-ncnn: 2x - 3 - No1236.865088.258.3514.25216.95016.540443.7908711.2846.1719281.57212.393933.220.772.5711.330.026502.5141.0746.83945.4750.736.9127.826.418531.476.800.25551.2890.3130810.5802.28048607.6011.0623.6344.0860.323.93252.4560.82850.696.1677169.413109.272112933.0226.071511.32646.5662159958.25751431.3525.787.050.41022.1433.7712.04516.394730.4073.3908495.759.145294741.155.388852.25130.429.9479901128489.114817513186.9122.24313.685.58065.21940124.8319.525.8902206.87.658480.056.554.750743.434741.204.03790109.450.75010.746013.794101.151224916118.955.20355671.512.2112.1881.68498.0200.96623.310.9862.89.4767.174.38729.589.7370.6616.0624.5725.7913.0371029345.469.466530.1568.11123.727125.96532.7145.88469.57711.8610.392.160.450.681.316.5751.242.581884493.332540423.881611833.092687331.8349.7334.259142.3183.393650.278790.493982.065266.7116.97524.7816.943589.039.1015.07416.53517.533442.5374711.8045.8019517.86012.603634.570.742.6411.710.026252.5951.1087.04949.4050.717.0928.026.922632.226.960.26151.2890.1368210.7222.29349578.6211.2723.5344.9059.724.27250.2640.82351.566.0641869.860110.792140255.0026.140011.28606.6290159234.48205931.7626.117.140.41322.4134.1811.92516.584693.2472.6198410.769.096584713.765.346952.40129.830.1679862878413.384823713223.2022.43313.695.61395.17862125.7939.455.9266208.27.708426.986.594.742343.684714.604.04885109.700.75410.721513.726100.651225497318.955.21015687.382.2212.2431.68797.6200.16643.610.9662.69.4567.274.24829.609.7570.7316.0724.5625.8113.0570996545.409.466527.5567.56123.806125.86537.8145.79469.61711.5110.392.160.450.681.117.4160.767.671808870.972339268.781622814.521682927.1651.4674.164332.2572.829670.327000.495181.956846.6686.28125.9408.223588.548.5915.42156.82416.634744.6802711.6044.1618703.02812.097433.460.742.5511.560.027132.5521.0796.84972.1340.737.1028.527.058831.646.810.25950.1192.1595710.4952.32949012.3211.2223.9744.4560.823.85254.6480.81451.486.0840270.595108.962104918.2625.743511.16416.5417161328.40394531.5325.787.060.40822.2834.1811.90416.454743.2973.2798446.759.187434758.635.339251.92131.029.8979187998482.734859913109.5522.25316.255.56965.18374125.6839.495.8849207.67.708475.516.564.722543.614729.754.06038109.110.75210.778413.724100.851230948619.045.18605697.272.2112.2171.69197.8200.56648.611.0062.79.4867.074.43729.539.7370.6016.0724.6025.7713.0370928245.449.456523.6568.09123.832125.96536.6145.87069.61711.7610.392.160.450.681.316.5952.102.611916548.162314175.991655968.921695087.3351.2135.296262.2583.213550.278470.479801.893106.4146.37425.933OpenBenchmarking.org

DDraceNetwork

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time321714212835Min: 4.01 / Avg: 22.02 / Max: 31.07Min: 5.55 / Avg: 22.02 / Max: 28.23Min: 3.55 / Avg: 22.02 / Max: 28.051. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time321510152025Min: 4.46 / Avg: 9.94 / Max: 17Min: 4.4 / Avg: 9.98 / Max: 16.99Min: 4.45 / Avg: 9.95 / Max: 18.411. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time3211326395265Min: 5.77 / Avg: 34.02 / Max: 65.99Min: 5.76 / Avg: 33.96 / Max: 57.53Min: 6.24 / Avg: 33.97 / Max: 65.291. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU321246810SE +/- 0.11002, N = 15SE +/- 0.09722, N = 3SE +/- 0.00679, N = 38.223586.943586.86508MIN: 7.36MIN: 6.67MIN: 6.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

DDraceNetwork

MinAvgMax372.076.782.4271.676.684.9172.376.885.1OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time204060801001. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v23213691215SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 38.549.038.25MIN: 8.16 / MAX: 12.1MIN: 8.75 / MAX: 11.63MIN: 8.02 / MAX: 11.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v23213691215SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 38.599.108.35MIN: 8.33 / MAX: 13.54MIN: 8.89 / MAX: 12.03MIN: 7.99 / MAX: 21.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU32148121620SE +/- 0.15, N = 15SE +/- 0.13, N = 3SE +/- 0.01, N = 315.4215.0714.25MIN: 14.53MIN: 14.72MIN: 14.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes321246810SE +/- 0.085, N = 15SE +/- 0.096, N = 4SE +/- 0.059, N = 126.8246.5356.950

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU32148121620SE +/- 0.12, N = 3SE +/- 0.26, N = 3SE +/- 0.05, N = 316.6317.5316.54MIN: 16.28MIN: 16.55MIN: 16.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM3211020304050SE +/- 0.66, N = 3SE +/- 0.10, N = 3SE +/- 0.53, N = 344.6842.5443.791. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b03213691215SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 311.6011.8011.28MIN: 11.37 / MAX: 26.59MIN: 11.52 / MAX: 12.14MIN: 10.83 / MAX: 19.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny3211020304050SE +/- 0.15, N = 3SE +/- 0.74, N = 3SE +/- 0.68, N = 344.1645.8046.17MIN: 43.39 / MAX: 57.44MIN: 43.83 / MAX: 58.33MIN: 44.33 / MAX: 611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth3214K8K12K16K20KSE +/- 289.88, N = 3SE +/- 639.68, N = 3SE +/- 146.25, N = 318703.0319517.8619281.571. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU3213691215SE +/- 0.06, N = 3SE +/- 0.18, N = 3SE +/- 0.20, N = 312.1012.6012.39MIN: 10.95MIN: 11.17MIN: 11.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd321816243240SE +/- 0.27, N = 3SE +/- 0.57, N = 3SE +/- 0.07, N = 333.4634.5733.22MIN: 32.94 / MAX: 43.71MIN: 33.72 / MAX: 36.89MIN: 32.95 / MAX: 34.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID3210.17330.34660.51990.69320.8665SE +/- 0.01, N = 3SE +/- 0.01, N = 4SE +/- 0.01, N = 30.740.740.771. (CXX) g++ options: -O3 -pthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface3210.5941.1881.7822.3762.97SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 32.552.642.57MIN: 2.49 / MAX: 2.61MIN: 2.6 / MAX: 2.69MIN: 2.37 / MAX: 2.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b03213691215SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 311.5611.7111.33MIN: 11.28 / MAX: 25.04MIN: 11.47 / MAX: 14.37MIN: 11.18 / MAX: 14.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access3210.00610.01220.01830.02440.0305SE +/- 0.00050, N = 3SE +/- 0.00077, N = 3SE +/- 0.00052, N = 30.027130.026250.026501. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 103210.58391.16781.75172.33562.9195SE +/- 0.015, N = 3SE +/- 0.044, N = 3SE +/- 0.012, N = 32.5522.5952.514

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 63210.24930.49860.74790.99721.2465SE +/- 0.013, N = 3SE +/- 0.004, N = 3SE +/- 0.011, N = 31.0791.1081.074

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet321246810SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 36.847.046.83MIN: 6.59 / MAX: 10.33MIN: 6.85 / MAX: 9.1MIN: 6.61 / MAX: 7.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing3212004006008001000SE +/- 12.55, N = 4SE +/- 2.85, N = 3SE +/- 1.28, N = 3972.13949.41945.481. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets3210.16430.32860.49290.65720.8215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 30.730.710.731. (CXX) g++ options: -O3 -pthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3321246810SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 37.107.096.91MIN: 6.88 / MAX: 9.9MIN: 6.9 / MAX: 10.15MIN: 6.69 / MAX: 12.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 3840 x 2160321714212835SE +/- 0.03, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 328.528.027.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU321612182430SE +/- 0.16, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 327.0626.9226.42MIN: 26.33MIN: 26.51MIN: 26.191. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet321714212835SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.26, N = 331.6432.2231.47MIN: 31.25 / MAX: 55.14MIN: 31.89 / MAX: 45.17MIN: 30.7 / MAX: 45.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet321246810SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 36.816.966.80MIN: 6.53 / MAX: 7.12MIN: 6.68 / MAX: 21.89MIN: 6.62 / MAX: 10.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 13210.05870.11740.17610.23480.2935SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 30.2590.2610.255

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed3211224364860SE +/- 0.85, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 350.1151.2851.281. (CC) gcc options: -O3

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL32120406080100SE +/- 1.45, N = 3SE +/- 1.01, N = 6SE +/- 1.21, N = 492.1690.1490.311. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 03213691215SE +/- 0.02, N = 3SE +/- 0.16, N = 3SE +/- 0.09, N = 310.5010.7210.581. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis3210.5241.0481.5722.0962.62SE +/- 0.024, N = 3SE +/- 0.019, N = 3SE +/- 0.012, N = 32.3292.2932.280MIN: 2.21 / MAX: 3.04MIN: 2.17 / MAX: 2.99MIN: 2.17 / MAX: 2.91

Libplacebo

Libplacebo is a multimedia rendering library based on the core rendering code of the MPV player. The libplacebo benchmark relies on the Vulkan API and tests various primitives. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: hdr_peakdetect32111K22K33K44K55KSE +/- 107.39, N = 3SE +/- 90.21, N = 3SE +/- 736.78, N = 349012.3249578.6248607.601. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Node.js V8 Web Tooling Benchmark

Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark3213691215SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 311.2211.2711.061. Nodejs v10.15.2

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet321612182430SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 323.9723.5323.63MIN: 23.72 / MAX: 36.58MIN: 23.31 / MAX: 26.84MIN: 23.33 / MAX: 36.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny3211020304050SE +/- 0.38, N = 3SE +/- 0.29, N = 3SE +/- 0.72, N = 344.4544.9044.08MIN: 43.34 / MAX: 46.59MIN: 44.01 / MAX: 52.5MIN: 42.69 / MAX: 58.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 2560 x 14403211428425670SE +/- 0.19, N = 3SE +/- 0.43, N = 3SE +/- 0.31, N = 360.859.760.31. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet321612182430SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 323.8524.2723.93MIN: 23.58 / MAX: 37.59MIN: 23.92 / MAX: 27MIN: 23.66 / MAX: 24.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile32160120180240300SE +/- 2.21, N = 3SE +/- 1.16, N = 3SE +/- 1.13, N = 3254.65250.26252.46

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 53210.18630.37260.55890.74520.9315SE +/- 0.006, N = 3SE +/- 0.013, N = 3SE +/- 0.005, N = 30.8140.8230.828

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet503211224364860SE +/- 0.47, N = 3SE +/- 0.27, N = 3SE +/- 0.10, N = 351.4851.5650.69MIN: 50.22 / MAX: 66.07MIN: 50.54 / MAX: 64.38MIN: 50.36 / MAX: 63.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU321246810SE +/- 0.06654, N = 3SE +/- 0.01547, N = 3SE +/- 0.01016, N = 36.084026.064186.16771MIN: 5.81MIN: 5.69MIN: 5.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0003211632486480SE +/- 0.54, N = 3SE +/- 0.12, N = 3SE +/- 0.25, N = 370.6069.8669.411. (CC) gcc options: -O2 -ldl -lz -lpthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1632120406080100SE +/- 0.12, N = 3SE +/- 0.82, N = 3SE +/- 0.57, N = 3108.96110.79109.27MIN: 108.21 / MAX: 115.52MIN: 109.24 / MAX: 127.6MIN: 108.04 / MAX: 121.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD321500K1000K1500K2000K2500KSE +/- 30741.46, N = 13SE +/- 8400.74, N = 3SE +/- 33704.56, N = 122104918.262140255.002112933.021. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU321612182430SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 325.7426.1426.07MIN: 25.45MIN: 25.95MIN: 25.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3213691215SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 311.1611.2911.33MIN: 10.46MIN: 10.69MIN: 10.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon321246810SE +/- 0.0287, N = 3SE +/- 0.0431, N = 3SE +/- 0.0417, N = 36.54176.62906.5662MIN: 6.45 / MAX: 6.74MIN: 6.51 / MAX: 6.81MIN: 6.46 / MAX: 6.79

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second32130K60K90K120K150KSE +/- 104.15, N = 3SE +/- 1169.64, N = 3SE +/- 270.17, N = 3161328.40159234.48159958.261. (CC) gcc options: -O2 -lrt" -lrt

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet321714212835SE +/- 0.09, N = 3SE +/- 0.24, N = 3SE +/- 0.06, N = 331.5331.7631.35MIN: 31.22 / MAX: 33.07MIN: 31.13 / MAX: 48.02MIN: 31.01 / MAX: 43.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18321612182430SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 325.7826.1125.78MIN: 25.3 / MAX: 26.66MIN: 25.75 / MAX: 41.28MIN: 25.38 / MAX: 39.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3321246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 37.067.147.05MIN: 6.87 / MAX: 10.89MIN: 6.95 / MAX: 21.34MIN: 6.84 / MAX: 8.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark3210.09290.18580.27870.37160.4645SE +/- 0.006, N = 4SE +/- 0.003, N = 3SE +/- 0.005, N = 30.4080.4130.4101. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet321510152025SE +/- 0.10, N = 3SE +/- 0.17, N = 3SE +/- 0.13, N = 322.2822.4122.14MIN: 21.96 / MAX: 22.64MIN: 21.96 / MAX: 34.32MIN: 21.81 / MAX: 24.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd321816243240SE +/- 0.14, N = 3SE +/- 0.27, N = 3SE +/- 0.38, N = 334.1834.1833.77MIN: 33.24 / MAX: 46.06MIN: 32.78 / MAX: 44.2MIN: 32.69 / MAX: 43.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA3213691215SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 311.9011.9312.051. (CC) gcc options: -std=c99 -O3 -lm -lpthread

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m32148121620SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 316.4516.5816.39MIN: 16.16 / MAX: 17.78MIN: 16.23 / MAX: 31.03MIN: 16.23 / MAX: 18.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU32110002000300040005000SE +/- 3.10, N = 3SE +/- 15.09, N = 3SE +/- 0.95, N = 34743.294693.244730.40MIN: 4730.58MIN: 4657.74MIN: 4721.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S3211632486480SE +/- 0.38, N = 3SE +/- 0.68, N = 3SE +/- 0.54, N = 373.2872.6273.391. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU3212K4K6K8K10KSE +/- 15.35, N = 3SE +/- 20.81, N = 3SE +/- 17.63, N = 38446.758410.768495.75MIN: 8413.69MIN: 8366.44MIN: 8463.891. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU3213691215SE +/- 0.04734, N = 3SE +/- 0.14569, N = 3SE +/- 0.03949, N = 39.187439.096589.14529MIN: 8.82MIN: 8.69MIN: 8.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU32110002000300040005000SE +/- 24.18, N = 3SE +/- 16.47, N = 3SE +/- 15.27, N = 34758.634713.764741.15MIN: 4700.29MIN: 4686.24MIN: 4705.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown3211.21252.4253.63754.856.0625SE +/- 0.0582, N = 3SE +/- 0.0293, N = 3SE +/- 0.0366, N = 35.33925.34695.3888MIN: 5.19 / MAX: 5.51MIN: 5.26 / MAX: 5.47MIN: 5.28 / MAX: 5.51

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed3211224364860SE +/- 0.23, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 351.9252.4052.251. (CC) gcc options: -O3

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 2560 x 1440321306090120150SE +/- 0.12, N = 3SE +/- 0.55, N = 3SE +/- 0.37, N = 3131.0129.8130.41. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p321714212835SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 329.8930.1629.941. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time3212M4M6M8M10MSE +/- 100608.19, N = 3SE +/- 53934.69, N = 3SE +/- 42720.90, N = 37918799798628779901121. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU3212K4K6K8K10KSE +/- 20.61, N = 3SE +/- 13.64, N = 3SE +/- 11.29, N = 38482.738413.388489.11MIN: 8441.98MIN: 8383.33MIN: 8460.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric32110K20K30K40K50K4859948237481751. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Libplacebo

Libplacebo is a multimedia rendering library based on the core rendering code of the MPV player. The libplacebo benchmark relies on the Vulkan API and tests various primitives. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: polar_nocompute3213K6K9K12K15KSE +/- 44.87, N = 3SE +/- 37.92, N = 3SE +/- 29.01, N = 313109.5513223.2013186.911. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet321510152025SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 322.2522.4322.24MIN: 22 / MAX: 36.1MIN: 21.97 / MAX: 35.74MIN: 21.89 / MAX: 24.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark32170140210280350SE +/- 0.35, N = 3SE +/- 0.64, N = 3SE +/- 0.12, N = 3316.25313.69313.68

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon3211.26312.52623.78935.05246.3155SE +/- 0.0036, N = 3SE +/- 0.0161, N = 3SE +/- 0.0040, N = 35.56965.61395.5806MIN: 5.5 / MAX: 5.65MIN: 5.54 / MAX: 5.72MIN: 5.53 / MAX: 5.66

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3211.17442.34883.52324.69765.872SE +/- 0.01621, N = 3SE +/- 0.00946, N = 3SE +/- 0.00525, N = 35.183745.178625.21940MIN: 4.84MIN: 4.94MIN: 4.851. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile321306090120150SE +/- 0.37, N = 3SE +/- 0.58, N = 3SE +/- 0.10, N = 3125.68125.79124.83

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v23213691215SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 39.499.459.52MIN: 9.39 / MAX: 11.92MIN: 9.35 / MAX: 12.21MIN: 9.42 / MAX: 11.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj3211.33352.6674.00055.3346.6675SE +/- 0.0272, N = 3SE +/- 0.0196, N = 3SE +/- 0.0176, N = 35.88495.92665.8902MIN: 5.81 / MAX: 5.99MIN: 5.87 / MAX: 6.03MIN: 5.83 / MAX: 6

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 1920 x 108032150100150200250SE +/- 0.32, N = 3SE +/- 0.55, N = 3SE +/- 0.42, N = 3207.6208.2206.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast321246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 37.707.707.651. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU3212K4K6K8K10KSE +/- 25.14, N = 3SE +/- 7.59, N = 3SE +/- 6.10, N = 38475.518426.988480.05MIN: 8432.62MIN: 8406.64MIN: 8460.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K321246810SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 36.566.596.551. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown3211.06892.13783.20674.27565.3445SE +/- 0.0074, N = 3SE +/- 0.0037, N = 3SE +/- 0.0091, N = 34.72254.74234.7507MIN: 4.69 / MAX: 4.82MIN: 4.71 / MAX: 4.82MIN: 4.71 / MAX: 4.83

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast3211020304050SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 343.6143.6843.431. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU32110002000300040005000SE +/- 19.23, N = 3SE +/- 5.23, N = 3SE +/- 22.26, N = 34729.754714.604741.20MIN: 4687.53MIN: 4700.85MIN: 4710.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad3210.91361.82722.74083.65444.568SE +/- 0.01163, N = 3SE +/- 0.00333, N = 3SE +/- 0.01197, N = 34.060384.048854.037901. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1632120406080100SE +/- 0.21, N = 3SE +/- 0.32, N = 3SE +/- 0.26, N = 3109.11109.70109.45MIN: 108.06 / MAX: 125.36MIN: 108.72 / MAX: 136.67MIN: 108.54 / MAX: 123.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom3210.16970.33940.50910.67880.8485SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 30.7520.7540.750

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU3213691215SE +/- 0.11, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 310.7810.7210.75MIN: 10.43MIN: 9.95MIN: 10.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WavPack Audio Encoding

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack32148121620SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.04, N = 513.7213.7313.791. (CXX) g++ options: -rdynamic

DDraceNetwork

This is a test of DDraceNetwork, an open-source cooperative platformer. OpenGL 3.3 is used for rendering, with fallbacks for older OpenGL versions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap32120406080100SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.26, N = 3100.85100.65101.15MIN: 37.89 / MAX: 224.47MIN: 58.03 / MAX: 227.27MIN: 54.32 / MAX: 224.921. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth3213M6M9M12M15MSE +/- 130255.47, N = 3SE +/- 117949.09, N = 3SE +/- 93407.88, N = 3123094861225497312249161

Libplacebo

Libplacebo is a multimedia rendering library based on the core rendering code of the MPV player. The libplacebo benchmark relies on the Vulkan API and tests various primitives. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: deband_heavy321510152025SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 319.0418.9518.951. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj3211.17232.34463.51694.68925.8615SE +/- 0.0051, N = 3SE +/- 0.0125, N = 3SE +/- 0.0107, N = 35.18605.21015.2035MIN: 5.15 / MAX: 5.26MIN: 5.17 / MAX: 5.28MIN: 5.16 / MAX: 5.26

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed32112002400360048006000SE +/- 2.69, N = 3SE +/- 5.97, N = 3SE +/- 7.62, N = 35697.275687.385671.511. (CC) gcc options: -O3

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium3210.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.212.222.211. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE3213691215SE +/- 0.06, N = 5SE +/- 0.04, N = 5SE +/- 0.02, N = 512.2212.2412.191. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar3210.38050.7611.14151.5221.9025SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.005, N = 31.6911.6871.684

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 108032120406080100SE +/- 0.38, N = 3SE +/- 0.98, N = 3SE +/- 0.87, N = 397.897.698.01. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 10803214080120160200SE +/- 0.95, N = 3SE +/- 0.23, N = 3SE +/- 0.53, N = 3200.5200.1200.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed32114002800420056007000SE +/- 1.39, N = 3SE +/- 4.53, N = 3SE +/- 10.10, N = 36648.66643.66623.31. (CC) gcc options: -O3

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast3213691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.0010.9610.981. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 3840 x 21603211428425670SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 362.762.662.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v23213691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 39.489.459.47MIN: 9.35 / MAX: 13.34MIN: 9.32 / MAX: 12.68MIN: 9.36 / MAX: 12.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 3840 x 21603211530456075SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.09, N = 367.067.267.11. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 232120406080100SE +/- 0.22, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 374.4474.2574.391. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

DDraceNetwork

This is a test of DDraceNetwork, an open-source cooperative platformer. OpenGL 3.3 is used for rendering, with fallbacks for older OpenGL versions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap321714212835SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 329.5329.6029.58MIN: 15.15 / MAX: 181.72MIN: 14.76 / MAX: 161.92MIN: 14.88 / MAX: 175.931. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium3213691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 39.739.759.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Timed Eigen Compilation

This test times how long it takes to build all Eigen examples. The Eigen examples are compiled serially. Eigen is a C++ template library for linear algebra. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile3211632486480SE +/- 0.04, N = 3SE +/- 0.20, N = 3SE +/- 0.05, N = 370.6070.7370.66

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast321246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 36.076.076.061. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast321612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 324.6024.5624.571. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18321612182430SE +/- 0.32, N = 3SE +/- 0.19, N = 3SE +/- 0.29, N = 325.7725.8125.79MIN: 25 / MAX: 26.95MIN: 25.13 / MAX: 40.98MIN: 24.99 / MAX: 28.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

DDraceNetwork

This is a test of DDraceNetwork, an open-source cooperative platformer. OpenGL 3.3 is used for rendering, with fallbacks for older OpenGL versions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore23213691215SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 313.0313.0513.03MIN: 11.77 / MAX: 14.01MIN: 10.32 / MAX: 14.01MIN: 11.75 / MAX: 13.981. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite321150K300K450K600K750KSE +/- 709.37, N = 3SE +/- 2148.66, N = 3SE +/- 448.34, N = 3709282709965710293

DDraceNetwork

This is a test of DDraceNetwork, an open-source cooperative platformer. OpenGL 3.3 is used for rendering, with fallbacks for older OpenGL versions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore23211020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 345.4445.4045.46MIN: 25.99 / MAX: 180.9MIN: 33.02 / MAX: 50.63MIN: 25.16 / MAX: 257.861. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow3213691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.459.469.461. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed32114002800420056007000SE +/- 1.37, N = 3SE +/- 1.58, N = 3SE +/- 1.59, N = 36523.66527.56530.11. (CC) gcc options: -O3

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive321120240360480600SE +/- 0.25, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3568.09567.56568.111. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search321306090120150SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 3123.83123.81123.731. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 2560 x 1440321306090120150SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3125.9125.8125.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed32114002800420056007000SE +/- 1.35, N = 3SE +/- 5.32, N = 3SE +/- 3.54, N = 36536.66537.86532.71. (CC) gcc options: -O3

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3321306090120150SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 0.14, N = 3145.87145.79145.881. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough3211530456075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 369.6169.6169.571. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Libplacebo

Libplacebo is a multimedia rendering library based on the core rendering code of the MPV player. The libplacebo benchmark relies on the Vulkan API and tests various primitives. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: av1_grain_lap321150300450600750SE +/- 0.22, N = 3SE +/- 0.36, N = 3SE +/- 0.24, N = 3711.76711.51711.861. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium3213691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310.3910.3910.391. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow3210.4860.9721.4581.9442.43SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.162.162.161. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.10130.20260.30390.40520.5065SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.450.450.451. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.1530.3060.4590.6120.765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.680.680.681. (CXX) g++ options: -O3 -pthread

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup3210.29250.5850.87751.171.4625SE +/- 0.03, N = 12SE +/- 0.05, N = 12SE +/- 0.05, N = 121.31.11.31. (CC) gcc options: -fopenmp -O3 -lm

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m32148121620SE +/- 0.05, N = 3SE +/- 0.68, N = 3SE +/- 0.10, N = 316.5917.4116.57MIN: 16.38 / MAX: 20.04MIN: 16.52 / MAX: 269.38MIN: 16.28 / MAX: 16.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet503211428425670SE +/- 0.55, N = 3SE +/- 8.74, N = 3SE +/- 0.54, N = 352.1060.7651.24MIN: 50.52 / MAX: 64.77MIN: 51.25 / MAX: 1056.17MIN: 50.5 / MAX: 66.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface321246810SE +/- 0.03, N = 3SE +/- 5.13, N = 3SE +/- 0.02, N = 32.617.672.58MIN: 2.53 / MAX: 2.69MIN: 2.41 / MAX: 416.67MIN: 2.47 / MAX: 2.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET321400K800K1200K1600K2000KSE +/- 25199.46, N = 3SE +/- 49673.53, N = 15SE +/- 29746.00, N = 31916548.161808870.971884493.331. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET321500K1000K1500K2000K2500KSE +/- 48510.25, N = 12SE +/- 23853.74, N = 8SE +/- 21911.47, N = 132314175.992339268.782540423.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH321400K800K1200K1600K2000KSE +/- 5741.58, N = 3SE +/- 20312.72, N = 15SE +/- 35412.98, N = 121655968.921622814.521611833.091. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP321600K1200K1800K2400K3000KSE +/- 2760.10, N = 3SE +/- 86194.23, N = 13SE +/- 37569.90, N = 31695087.331682927.162687331.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis3211224364860SE +/- 0.37, N = 4SE +/- 0.86, N = 17SE +/- 1.23, N = 1651.2151.4749.731. (CC) gcc options: -O2 -std=c99

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3211.19172.38343.57514.76685.9585SE +/- 0.09775, N = 15SE +/- 0.00978, N = 3SE +/- 0.00331, N = 35.296264.164334.25914MIN: 4.25MIN: 4.05MIN: 4.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein3210.52161.04321.56482.08642.608SE +/- 0.031, N = 15SE +/- 0.052, N = 14SE +/- 0.064, N = 122.2582.2572.3181. (CXX) g++ options: -O3 -pthread -lm

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth3210.76361.52722.29083.05443.818SE +/- 0.24447, N = 3SE +/- 0.10194, N = 3SE +/- 0.09620, N = 33.213552.829673.393651. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency3210.07360.14720.22080.29440.368SE +/- 0.04588, N = 3SE +/- 0.00582, N = 3SE +/- 0.04635, N = 30.278470.327000.278791. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans3210.11140.22280.33420.44560.557SE +/- 0.01039, N = 3SE +/- 0.03821, N = 3SE +/- 0.01258, N = 30.479800.495180.493981. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte3210.46470.92941.39411.85882.3235SE +/- 0.07276, N = 3SE +/- 0.13431, N = 3SE +/- 0.08108, N = 31.893101.956842.065261. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

Betsy GPU Compressor

Betsy is an open-source GPU compressor of various GPU compression techniques. Betsy is written in GLSL for Vulkan/OpenGL (compute shader) support for GPU-based texture compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highest321246810SE +/- 0.127, N = 12SE +/- 0.117, N = 15SE +/- 0.431, N = 156.4146.6686.7111. (CXX) g++ options: -O3 -O2 -lpthread -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highest321246810SE +/- 0.143, N = 12SE +/- 0.134, N = 12SE +/- 0.422, N = 156.3746.2816.9751. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: No321612182430SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 1.00, N = 1225.9325.9424.78

148 Results Shown

DDraceNetwork:
  1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2 - Total Frame Time
  1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymap - Total Frame Time
  3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - Multeasymap - Total Frame Time
oneDNN
DDraceNetwork
NCNN:
  CPU-v2-v2 - mobilenet-v2
  Vulkan GPU-v2-v2 - mobilenet-v2
oneDNN
Waifu2x-NCNN Vulkan
oneDNN
HPC Challenge
NCNN:
  CPU - efficientnet-b0
  Vulkan GPU - yolov4-tiny
HPC Challenge
oneDNN
NCNN
simdjson
NCNN:
  CPU - blazeface
  Vulkan GPU - efficientnet-b0
HPC Challenge
rav1e:
  10
  6
NCNN
Basis Universal
simdjson
NCNN
yquake2
oneDNN
NCNN:
  CPU - mobilenet
  Vulkan GPU - mnasnet
rav1e
LZ4 Compression
HPC Challenge
Basis Universal
Sunflow Rendering System
Libplacebo
Node.js V8 Web Tooling Benchmark
NCNN:
  Vulkan GPU - googlenet
  CPU - yolov4-tiny
yquake2
NCNN
Build2
rav1e
NCNN
oneDNN
SQLite Speedtest
NCNN
Redis
oneDNN:
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
Embree
Coremark
NCNN:
  Vulkan GPU - mobilenet
  CPU - resnet18
  Vulkan GPU-v3-v3 - mobilenet-v3
GROMACS
NCNN:
  Vulkan GPU - alexnet
  Vulkan GPU - squeezenet_ssd
Timed MAFFT Alignment
NCNN
oneDNN
Basis Universal
oneDNN:
  Recurrent Neural Network Training - f32 - CPU
  IP Shapes 1D - f32 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Embree
LZ4 Compression
yquake2
x265
Stockfish
oneDNN
BRL-CAD
Libplacebo
NCNN
Numpy Benchmark
Embree
oneDNN
Timed FFmpeg Compilation
NCNN
Embree
yquake2
ASTC Encoder
oneDNN
x265
Embree
Kvazaar
oneDNN
HPC Challenge
NCNN
IndigoBench
oneDNN
WavPack Audio Encoding
DDraceNetwork
asmFish
Libplacebo
Embree
LZ4 Compression
Kvazaar
Monkey Audio Encoding
IndigoBench
yquake2:
  Software CPU - 1920 x 1080
  OpenGL 3.x - 1920 x 1080
LZ4 Compression
Kvazaar
yquake2
NCNN
yquake2
Basis Universal
DDraceNetwork
Kvazaar
Timed Eigen Compilation
Kvazaar:
  Bosphorus 4K - Very Fast
  Bosphorus 1080p - Very Fast
NCNN
DDraceNetwork
PHPBench
DDraceNetwork
Kvazaar
LZ4 Compression
ASTC Encoder
Timed HMMer Search
yquake2
LZ4 Compression
Basis Universal
ASTC Encoder
Libplacebo
ASTC Encoder
Kvazaar
simdjson:
  LargeRand
  Kostya
CLOMP
NCNN:
  Vulkan GPU - regnety_400m
  Vulkan GPU - resnet50
  Vulkan GPU - blazeface
Redis:
  SET
  GET
  LPUSH
  LPOP
eSpeak-NG Speech Engine
oneDNN
LAMMPS Molecular Dynamics Simulator
HPC Challenge:
  Rand Ring Bandwidth
  Rand Ring Latency
  G-Ptrans
  G-Ffte
Betsy GPU Compressor:
  ETC2 RGB - Highest
  ETC1 - Highest
Waifu2x-NCNN Vulkan