Core i7 4790K 202

Intel Core i7-4790K testing with a Gigabyte Z97-HD3P (F4 BIOS) and Gigabyte Intel Haswell Desktop 2GB on Ubuntu 19.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012207-HA-COREI747935&sro.

Core i7 4790K 202ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123Intel Core i7-4790K @ 4.40GHz (4 Cores / 8 Threads)Gigabyte Z97-HD3P (F4 BIOS)Intel 4th Gen Core DRAM16GB120GB OCZ TRION100Gigabyte Intel Haswell Desktop 2GB (1250MHz)Intel Xeon E3-1200 v3/4thLG Ultra HDRealtek RTL8111/8168/8411Ubuntu 19.105.9.0-050900rc8daily20201009-generic (x86_64) 20201008GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.5 Mesa 19.2.81.1.102GCC 9.2.1 20191008ext43840x2160OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x28 - Thermald 1.9 Java Details- OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-2ubuntu219.10)Python Details- Python 2.7.17 + Python 3.7.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

Core i7 4790K 202waifu2x-ncnn: 2x - 3 - Nowaifu2x-ncnn: 2x - 3 - Yeslibplacebo: deband_heavylibplacebo: polar_nocomputelibplacebo: hdr_peakdetectlibplacebo: av1_grain_lapbetsy: ETC1 - Highestbetsy: ETC2 RGB - Highestddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2ddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2ddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - Multeasymapyquake2: OpenGL 1.x - 1920 x 1080yquake2: OpenGL 1.x - 2560 x 1440yquake2: OpenGL 1.x - 3840 x 2160yquake2: OpenGL 3.x - 1920 x 1080yquake2: OpenGL 3.x - 2560 x 1440yquake2: OpenGL 3.x - 3840 x 2160yquake2: Software CPU - 1920 x 1080yquake2: Software CPU - 2560 x 1440yquake2: Software CPU - 3840 x 2160hpcc: G-HPLhpcc: G-Fftehpcc: EP-DGEMMhpcc: G-Ptranshpcc: EP-STREAM Triadhpcc: G-Rand Accesshpcc: Rand Ring Latencyhpcc: Rand Ring Bandwidthhpcc: Max Ping Pong Bandwidthhmmer: Pfam Database Searchmafft: Multiple Sequence Alignment - LSU RNAlammps: Rhodopsin Proteinsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDcompress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUembree: Pathtracer - Crownembree: Pathtracer ISPC - Crownembree: Pathtracer - Asian Dragonembree: Pathtracer - Asian Dragon Objembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer ISPC - Asian Dragon Objkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 1080p - Very Fastkvazaar: Bosphorus 1080p - Ultra Fastrav1e: 1rav1e: 5rav1e: 6rav1e: 10x265: Bosphorus 4Kx265: Bosphorus 1080pcoremark: CoreMark Size 666 - Iterations Per Secondstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthbuild-ffmpeg: Time To Compilebuild2: Time To Compilenumpy: espeak: Text-To-Speech Synthesisnode-web-tooling: gromacs: Water Benchmarkastcenc: Fastastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3basis: UASTC Level 2 + RDO Post-Processingsqlite-speedtest: Timed Time - Size 1,000redis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mindigobench: CPU - Bedroomindigobench: CPU - Supercarphpbench: PHP Benchmark Suitesunflow: Global Illumination + Image Synthesisbrl-cad: VGR Performance Metricclomp: Static OMP Speedupbuild-eigen: Time To Compileencode-ape: WAV To APEencode-wavpack: WAV To WavPack12324.7816.95018.9513186.9148607.60711.866.9756.71145.4613.03101.1529.58206.8130.467.1200.9125.962.898.060.327.890.313082.0652643.790870.493984.037900.026500.278793.3936519281.572123.72712.0452.3180.680.450.730.775671.516623.352.256530.151.286532.79.1452914.25215.219404.2591426.418510.746016.540426.071512.393911.32648495.754730.408480.054741.206.865088489.114741.156.167714.75075.38885.58065.20356.56625.89022.162.219.469.736.0610.9824.5743.430.2550.8281.0742.5146.5529.94159958.257514799011212249161124.831252.456313.6849.73311.060.4107.6510.3969.57568.1173.39010.58074.387145.884945.47569.4132687331.832112933.021611833.092540423.881884493.3331.478.256.919.476.8311.282.5723.93109.2725.7822.2450.6944.0833.2216.3931.358.357.059.526.8011.332.5823.63109.4525.7922.1451.2446.1733.7716.570.7501.6847102932.280481751.370.66112.18813.79425.9406.53518.9513223.2049578.62711.516.2816.66845.4013.05100.6529.60208.2129.867.2200.1125.862.697.659.728.090.136821.9568442.537470.495184.048850.026250.327002.8296719517.860123.80611.9252.2570.680.450.710.745687.386643.652.406527.551.286537.89.0965815.07415.178624.1643326.922610.721517.533426.140012.603611.28608410.764693.248426.984714.606.943588413.384713.766.064184.74235.34695.61395.21016.62905.92662.162.229.469.756.0710.9624.5643.680.2610.8231.1082.5956.5930.16159234.482059798628712254973125.793250.264313.6951.46711.270.4137.7010.3969.61567.5672.61910.72274.248145.794949.40569.8601682927.162140255.001622814.522339268.781808870.9732.229.037.099.457.0411.802.6424.27110.7926.1122.4351.5644.9034.5716.5831.769.107.149.456.9611.717.6723.53109.7025.8122.4160.7645.8034.1817.410.7541.6877099652.293482371.170.73112.24313.72625.9336.82419.0413109.5549012.32711.766.3746.41445.4413.03100.8529.53207.6131.067.0200.5125.962.797.860.828.592.159571.8931044.680270.479804.060380.027130.278473.2135518703.028123.83211.9042.2580.680.450.730.745697.276648.651.926523.650.116536.69.1874315.42155.183745.2962627.058810.778416.634725.743512.097411.16418446.754743.298475.514729.758.223588482.734758.636.084024.72255.33925.56965.18606.54175.88492.162.219.459.736.0711.0024.6043.610.2590.8141.0792.5526.5629.89161328.403945791879912309486125.683254.648316.2551.21311.220.4087.7010.3969.61568.0973.27910.49574.437145.870972.13470.5951695087.332104918.261655968.922314175.991916548.1631.648.547.109.486.8411.602.5523.85108.9625.7822.2551.4844.4533.4616.4531.538.597.069.496.8111.562.6123.97109.1125.7722.2852.1044.1634.1816.590.7521.6917092822.329485991.370.60112.21713.724OpenBenchmarking.org

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: No123612182430SE +/- 1.00, N = 12SE +/- 0.11, N = 3SE +/- 0.06, N = 324.7825.9425.93

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes123246810SE +/- 0.059, N = 12SE +/- 0.096, N = 4SE +/- 0.085, N = 156.9506.5356.824

Libplacebo

Test: deband_heavy

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: deband_heavy123510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.09, N = 318.9518.9519.041. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: polar_nocompute

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: polar_nocompute1233K6K9K12K15KSE +/- 29.01, N = 3SE +/- 37.92, N = 3SE +/- 44.87, N = 313186.9113223.2013109.551. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: hdr_peakdetect

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: hdr_peakdetect12311K22K33K44K55KSE +/- 736.78, N = 3SE +/- 90.21, N = 3SE +/- 107.39, N = 348607.6049578.6249012.321. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: av1_grain_lap

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: av1_grain_lap123150300450600750SE +/- 0.24, N = 3SE +/- 0.36, N = 3SE +/- 0.22, N = 3711.86711.51711.761. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Betsy GPU Compressor

Codec: ETC1 - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highest123246810SE +/- 0.422, N = 15SE +/- 0.134, N = 12SE +/- 0.143, N = 126.9756.2816.3741. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highest123246810SE +/- 0.431, N = 15SE +/- 0.117, N = 15SE +/- 0.127, N = 126.7116.6686.4141. (CXX) g++ options: -O3 -O2 -lpthread -ldl

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore21231020304050SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 345.4645.4045.44MIN: 25.16 / MAX: 257.86MIN: 33.02 / MAX: 50.63MIN: 25.99 / MAX: 180.91. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time123714212835Min: 3.55 / Avg: 22.02 / Max: 28.05Min: 5.55 / Avg: 22.02 / Max: 28.23Min: 4.01 / Avg: 22.02 / Max: 31.071. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore21233691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 313.0313.0513.03MIN: 11.75 / MAX: 13.98MIN: 10.32 / MAX: 14.01MIN: 11.77 / MAX: 14.011. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time

MinAvgMax172.376.885.1271.676.684.9372.076.782.4OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time204060801001. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap12320406080100SE +/- 0.26, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3101.15100.65100.85MIN: 54.32 / MAX: 224.92MIN: 58.03 / MAX: 227.27MIN: 37.89 / MAX: 224.471. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time123510152025Min: 4.45 / Avg: 9.95 / Max: 18.41Min: 4.4 / Avg: 9.98 / Max: 16.99Min: 4.46 / Avg: 9.94 / Max: 171. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap123714212835SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 329.5829.6029.53MIN: 14.88 / MAX: 175.93MIN: 14.76 / MAX: 161.92MIN: 15.15 / MAX: 181.721. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time1231326395265Min: 6.24 / Avg: 33.97 / Max: 65.29Min: 5.76 / Avg: 33.96 / Max: 57.53Min: 5.77 / Avg: 34.02 / Max: 65.991. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

yquake2

Renderer: OpenGL 1.x - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 1920 x 108012350100150200250SE +/- 0.42, N = 3SE +/- 0.55, N = 3SE +/- 0.32, N = 3206.8208.2207.61. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: OpenGL 1.x - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 2560 x 1440123306090120150SE +/- 0.37, N = 3SE +/- 0.55, N = 3SE +/- 0.12, N = 3130.4129.8131.01. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: OpenGL 1.x - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 3840 x 21601231530456075SE +/- 0.09, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 367.167.267.01. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: OpenGL 3.x - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 10801234080120160200SE +/- 0.53, N = 3SE +/- 0.23, N = 3SE +/- 0.95, N = 3200.9200.1200.51. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: OpenGL 3.x - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 2560 x 1440123306090120150SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.17, N = 3125.9125.8125.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: OpenGL 3.x - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 3840 x 21601231428425670SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 362.862.662.71. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 108012320406080100SE +/- 0.87, N = 3SE +/- 0.98, N = 3SE +/- 0.38, N = 398.097.697.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 2560 x 14401231428425670SE +/- 0.31, N = 3SE +/- 0.43, N = 3SE +/- 0.19, N = 360.359.760.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: Software CPU - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 3840 x 2160123714212835SE +/- 0.09, N = 3SE +/- 0.24, N = 3SE +/- 0.03, N = 327.828.028.51. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL12320406080100SE +/- 1.21, N = 4SE +/- 1.01, N = 6SE +/- 1.45, N = 390.3190.1492.161. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte1230.46470.92941.39411.85882.3235SE +/- 0.08108, N = 3SE +/- 0.13431, N = 3SE +/- 0.07276, N = 32.065261.956841.893101. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM1231020304050SE +/- 0.53, N = 3SE +/- 0.10, N = 3SE +/- 0.66, N = 343.7942.5444.681. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans1230.11140.22280.33420.44560.557SE +/- 0.01258, N = 3SE +/- 0.03821, N = 3SE +/- 0.01039, N = 30.493980.495180.479801. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad1230.91361.82722.74083.65444.568SE +/- 0.01197, N = 3SE +/- 0.00333, N = 3SE +/- 0.01163, N = 34.037904.048854.060381. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access1230.00610.01220.01830.02440.0305SE +/- 0.00052, N = 3SE +/- 0.00077, N = 3SE +/- 0.00050, N = 30.026500.026250.027131. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency1230.07360.14720.22080.29440.368SE +/- 0.04635, N = 3SE +/- 0.00582, N = 3SE +/- 0.04588, N = 30.278790.327000.278471. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth1230.76361.52722.29083.05443.818SE +/- 0.09620, N = 3SE +/- 0.10194, N = 3SE +/- 0.24447, N = 33.393652.829673.213551. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth1234K8K12K16K20KSE +/- 146.25, N = 3SE +/- 639.68, N = 3SE +/- 289.88, N = 319281.5719517.8618703.031. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search123306090120150SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3123.73123.81123.831. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1233691215SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 312.0511.9311.901. (CC) gcc options: -std=c99 -O3 -lm -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1230.52161.04321.56482.08642.608SE +/- 0.064, N = 12SE +/- 0.052, N = 14SE +/- 0.031, N = 152.3182.2572.2581. (CXX) g++ options: -O3 -pthread -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.1530.3060.4590.6120.765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.680.680.681. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.10130.20260.30390.40520.5065SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.450.450.451. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.16430.32860.49290.65720.8215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.730.710.731. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.17330.34660.51990.69320.8665SE +/- 0.01, N = 3SE +/- 0.01, N = 4SE +/- 0.01, N = 30.770.740.741. (CXX) g++ options: -O3 -pthread

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed12312002400360048006000SE +/- 7.62, N = 3SE +/- 5.97, N = 3SE +/- 2.69, N = 35671.515687.385697.271. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed12314002800420056007000SE +/- 10.10, N = 3SE +/- 4.53, N = 3SE +/- 1.39, N = 36623.36643.66648.61. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1231224364860SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.23, N = 352.2552.4051.921. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed12314002800420056007000SE +/- 1.59, N = 3SE +/- 1.58, N = 3SE +/- 1.37, N = 36530.16527.56523.61. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1231224364860SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.85, N = 351.2851.2850.111. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed12314002800420056007000SE +/- 3.54, N = 3SE +/- 5.32, N = 3SE +/- 1.35, N = 36532.76537.86536.61. (CC) gcc options: -O3

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.03949, N = 3SE +/- 0.14569, N = 3SE +/- 0.04734, N = 39.145299.096589.18743MIN: 8.74MIN: 8.69MIN: 8.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU12348121620SE +/- 0.01, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 1514.2515.0715.42MIN: 14.13MIN: 14.72MIN: 14.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1231.17442.34883.52324.69765.872SE +/- 0.00525, N = 3SE +/- 0.00946, N = 3SE +/- 0.01621, N = 35.219405.178625.18374MIN: 4.85MIN: 4.94MIN: 4.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1231.19172.38343.57514.76685.9585SE +/- 0.00331, N = 3SE +/- 0.00978, N = 3SE +/- 0.09775, N = 154.259144.164335.29626MIN: 4.15MIN: 4.05MIN: 4.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123612182430SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 326.4226.9227.06MIN: 26.19MIN: 26.51MIN: 26.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1233691215SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 310.7510.7210.78MIN: 10.34MIN: 9.95MIN: 10.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU12348121620SE +/- 0.05, N = 3SE +/- 0.26, N = 3SE +/- 0.12, N = 316.5417.5316.63MIN: 16.27MIN: 16.55MIN: 16.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 326.0726.1425.74MIN: 25.78MIN: 25.95MIN: 25.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.20, N = 3SE +/- 0.18, N = 3SE +/- 0.06, N = 312.3912.6012.10MIN: 11.1MIN: 11.17MIN: 10.951. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 311.3311.2911.16MIN: 10.76MIN: 10.69MIN: 10.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1232K4K6K8K10KSE +/- 17.63, N = 3SE +/- 20.81, N = 3SE +/- 15.35, N = 38495.758410.768446.75MIN: 8463.89MIN: 8366.44MIN: 8413.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU12310002000300040005000SE +/- 0.95, N = 3SE +/- 15.09, N = 3SE +/- 3.10, N = 34730.404693.244743.29MIN: 4721.36MIN: 4657.74MIN: 4730.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1232K4K6K8K10KSE +/- 6.10, N = 3SE +/- 7.59, N = 3SE +/- 25.14, N = 38480.058426.988475.51MIN: 8460.68MIN: 8406.64MIN: 8432.621. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12310002000300040005000SE +/- 22.26, N = 3SE +/- 5.23, N = 3SE +/- 19.23, N = 34741.204714.604729.75MIN: 4710.31MIN: 4700.85MIN: 4687.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123246810SE +/- 0.00679, N = 3SE +/- 0.09722, N = 3SE +/- 0.11002, N = 156.865086.943588.22358MIN: 6.69MIN: 6.67MIN: 7.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1232K4K6K8K10KSE +/- 11.29, N = 3SE +/- 13.64, N = 3SE +/- 20.61, N = 38489.118413.388482.73MIN: 8460.3MIN: 8383.33MIN: 8441.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12310002000300040005000SE +/- 15.27, N = 3SE +/- 16.47, N = 3SE +/- 24.18, N = 34741.154713.764758.63MIN: 4705.91MIN: 4686.24MIN: 4700.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.01016, N = 3SE +/- 0.01547, N = 3SE +/- 0.06654, N = 36.167716.064186.08402MIN: 5.88MIN: 5.69MIN: 5.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1231.06892.13783.20674.27565.3445SE +/- 0.0091, N = 3SE +/- 0.0037, N = 3SE +/- 0.0074, N = 34.75074.74234.7225MIN: 4.71 / MAX: 4.83MIN: 4.71 / MAX: 4.82MIN: 4.69 / MAX: 4.82

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown1231.21252.4253.63754.856.0625SE +/- 0.0366, N = 3SE +/- 0.0293, N = 3SE +/- 0.0582, N = 35.38885.34695.3392MIN: 5.28 / MAX: 5.51MIN: 5.26 / MAX: 5.47MIN: 5.19 / MAX: 5.51

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon1231.26312.52623.78935.05246.3155SE +/- 0.0040, N = 3SE +/- 0.0161, N = 3SE +/- 0.0036, N = 35.58065.61395.5696MIN: 5.53 / MAX: 5.66MIN: 5.54 / MAX: 5.72MIN: 5.5 / MAX: 5.65

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj1231.17232.34463.51694.68925.8615SE +/- 0.0107, N = 3SE +/- 0.0125, N = 3SE +/- 0.0051, N = 35.20355.21015.1860MIN: 5.16 / MAX: 5.26MIN: 5.17 / MAX: 5.28MIN: 5.15 / MAX: 5.26

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon123246810SE +/- 0.0417, N = 3SE +/- 0.0431, N = 3SE +/- 0.0287, N = 36.56626.62906.5417MIN: 6.46 / MAX: 6.79MIN: 6.51 / MAX: 6.81MIN: 6.45 / MAX: 6.74

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj1231.33352.6674.00055.3346.6675SE +/- 0.0176, N = 3SE +/- 0.0196, N = 3SE +/- 0.0272, N = 35.89025.92665.8849MIN: 5.83 / MAX: 6MIN: 5.87 / MAX: 6.03MIN: 5.81 / MAX: 5.99

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1230.4860.9721.4581.9442.43SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.162.162.161. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1230.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.212.222.211. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow1233691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.469.469.451. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.739.759.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast123246810SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.066.076.071. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast1233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 310.9810.9611.001. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast123612182430SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 324.5724.5624.601. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast1231020304050SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 343.4343.6843.611. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11230.05870.11740.17610.23480.2935SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 30.2550.2610.259

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 51230.18630.37260.55890.74520.9315SE +/- 0.005, N = 3SE +/- 0.013, N = 3SE +/- 0.006, N = 30.8280.8230.814

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61230.24930.49860.74790.99721.2465SE +/- 0.011, N = 3SE +/- 0.004, N = 3SE +/- 0.013, N = 31.0741.1081.079

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.58391.16781.75172.33562.9195SE +/- 0.012, N = 3SE +/- 0.044, N = 3SE +/- 0.015, N = 32.5142.5952.552

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K123246810SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 36.556.596.561. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p123714212835SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 329.9430.1629.891. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12330K60K90K120K150KSE +/- 270.17, N = 3SE +/- 1169.64, N = 3SE +/- 104.15, N = 3159958.26159234.48161328.401. (CC) gcc options: -O2 -lrt" -lrt

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1232M4M6M8M10MSE +/- 42720.90, N = 3SE +/- 53934.69, N = 3SE +/- 100608.19, N = 37990112798628779187991. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1233M6M9M12M15MSE +/- 93407.88, N = 3SE +/- 117949.09, N = 3SE +/- 130255.47, N = 3122491611225497312309486

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile123306090120150SE +/- 0.10, N = 3SE +/- 0.58, N = 3SE +/- 0.37, N = 3124.83125.79125.68

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12360120180240300SE +/- 1.13, N = 3SE +/- 1.16, N = 3SE +/- 2.21, N = 3252.46250.26254.65

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12370140210280350SE +/- 0.12, N = 3SE +/- 0.64, N = 3SE +/- 0.35, N = 3313.68313.69316.25

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1231224364860SE +/- 1.23, N = 16SE +/- 0.86, N = 17SE +/- 0.37, N = 449.7351.4751.211. (CC) gcc options: -O2 -std=c99

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 311.0611.2711.221. Nodejs v10.15.2

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.09290.18580.27870.37160.4645SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.006, N = 40.4100.4130.4081. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 37.657.707.701. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310.3910.3910.391. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1231530456075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 369.5769.6169.611. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive123120240360480600SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.25, N = 3568.11567.56568.091. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1231632486480SE +/- 0.54, N = 3SE +/- 0.68, N = 3SE +/- 0.38, N = 373.3972.6273.281. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01233691215SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 310.5810.7210.501. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 212320406080100SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.22, N = 374.3974.2574.441. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3123306090120150SE +/- 0.14, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 3145.88145.79145.871. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1232004006008001000SE +/- 1.28, N = 3SE +/- 2.85, N = 3SE +/- 12.55, N = 4945.48949.41972.131. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001231632486480SE +/- 0.25, N = 3SE +/- 0.12, N = 3SE +/- 0.54, N = 369.4169.8670.601. (CC) gcc options: -O2 -ldl -lz -lpthread

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123600K1200K1800K2400K3000KSE +/- 37569.90, N = 3SE +/- 86194.23, N = 13SE +/- 2760.10, N = 32687331.831682927.161695087.331. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123500K1000K1500K2000K2500KSE +/- 33704.56, N = 12SE +/- 8400.74, N = 3SE +/- 30741.46, N = 132112933.022140255.002104918.261. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123400K800K1200K1600K2000KSE +/- 35412.98, N = 12SE +/- 20312.72, N = 15SE +/- 5741.58, N = 31611833.091622814.521655968.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 21911.47, N = 13SE +/- 23853.74, N = 8SE +/- 48510.25, N = 122540423.882339268.782314175.991. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123400K800K1200K1600K2000KSE +/- 29746.00, N = 3SE +/- 49673.53, N = 15SE +/- 25199.46, N = 31884493.331808870.971916548.161. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet123714212835SE +/- 0.26, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 331.4732.2231.64MIN: 30.7 / MAX: 45.93MIN: 31.89 / MAX: 45.17MIN: 31.25 / MAX: 55.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 38.259.038.54MIN: 8.02 / MAX: 11.6MIN: 8.75 / MAX: 11.63MIN: 8.16 / MAX: 12.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3123246810SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 36.917.097.10MIN: 6.69 / MAX: 12.06MIN: 6.9 / MAX: 10.15MIN: 6.88 / MAX: 9.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21233691215SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 39.479.459.48MIN: 9.36 / MAX: 12.12MIN: 9.32 / MAX: 12.68MIN: 9.35 / MAX: 13.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet123246810SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 36.837.046.84MIN: 6.61 / MAX: 7.1MIN: 6.85 / MAX: 9.1MIN: 6.59 / MAX: 10.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01233691215SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 311.2811.8011.60MIN: 10.83 / MAX: 19.59MIN: 11.52 / MAX: 12.14MIN: 11.37 / MAX: 26.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1230.5941.1881.7822.3762.97SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.572.642.55MIN: 2.37 / MAX: 2.66MIN: 2.6 / MAX: 2.69MIN: 2.49 / MAX: 2.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet123612182430SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 323.9324.2723.85MIN: 23.66 / MAX: 24.5MIN: 23.92 / MAX: 27MIN: 23.58 / MAX: 37.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1612320406080100SE +/- 0.57, N = 3SE +/- 0.82, N = 3SE +/- 0.12, N = 3109.27110.79108.96MIN: 108.04 / MAX: 121.43MIN: 109.24 / MAX: 127.6MIN: 108.21 / MAX: 115.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18123612182430SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 325.7826.1125.78MIN: 25.38 / MAX: 39.91MIN: 25.75 / MAX: 41.28MIN: 25.3 / MAX: 26.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet123510152025SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 322.2422.4322.25MIN: 21.89 / MAX: 24.72MIN: 21.97 / MAX: 35.74MIN: 22 / MAX: 36.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501231224364860SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.47, N = 350.6951.5651.48MIN: 50.36 / MAX: 63.24MIN: 50.54 / MAX: 64.38MIN: 50.22 / MAX: 66.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1231020304050SE +/- 0.72, N = 3SE +/- 0.29, N = 3SE +/- 0.38, N = 344.0844.9044.45MIN: 42.69 / MAX: 58.93MIN: 44.01 / MAX: 52.5MIN: 43.34 / MAX: 46.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd123816243240SE +/- 0.07, N = 3SE +/- 0.57, N = 3SE +/- 0.27, N = 333.2234.5733.46MIN: 32.95 / MAX: 34.34MIN: 33.72 / MAX: 36.89MIN: 32.94 / MAX: 43.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m12348121620SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 316.3916.5816.45MIN: 16.23 / MAX: 18.48MIN: 16.23 / MAX: 31.03MIN: 16.16 / MAX: 17.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet123714212835SE +/- 0.06, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 331.3531.7631.53MIN: 31.01 / MAX: 43.41MIN: 31.13 / MAX: 48.02MIN: 31.22 / MAX: 33.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 38.359.108.59MIN: 7.99 / MAX: 21.73MIN: 8.89 / MAX: 12.03MIN: 8.33 / MAX: 13.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3123246810SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 37.057.147.06MIN: 6.84 / MAX: 8.55MIN: 6.95 / MAX: 21.34MIN: 6.87 / MAX: 10.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21233691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 39.529.459.49MIN: 9.42 / MAX: 11.19MIN: 9.35 / MAX: 12.21MIN: 9.39 / MAX: 11.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet123246810SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 36.806.966.81MIN: 6.62 / MAX: 10.18MIN: 6.68 / MAX: 21.89MIN: 6.53 / MAX: 7.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b01233691215SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 311.3311.7111.56MIN: 11.18 / MAX: 14.87MIN: 11.47 / MAX: 14.37MIN: 11.28 / MAX: 25.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface123246810SE +/- 0.02, N = 3SE +/- 5.13, N = 3SE +/- 0.03, N = 32.587.672.61MIN: 2.47 / MAX: 2.74MIN: 2.41 / MAX: 416.67MIN: 2.53 / MAX: 2.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet123612182430SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 323.6323.5323.97MIN: 23.33 / MAX: 36.77MIN: 23.31 / MAX: 26.84MIN: 23.72 / MAX: 36.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1612320406080100SE +/- 0.26, N = 3SE +/- 0.32, N = 3SE +/- 0.21, N = 3109.45109.70109.11MIN: 108.54 / MAX: 123.94MIN: 108.72 / MAX: 136.67MIN: 108.06 / MAX: 125.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18123612182430SE +/- 0.29, N = 3SE +/- 0.19, N = 3SE +/- 0.32, N = 325.7925.8125.77MIN: 24.99 / MAX: 28.18MIN: 25.13 / MAX: 40.98MIN: 25 / MAX: 26.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet123510152025SE +/- 0.13, N = 3SE +/- 0.17, N = 3SE +/- 0.10, N = 322.1422.4122.28MIN: 21.81 / MAX: 24.68MIN: 21.96 / MAX: 34.32MIN: 21.96 / MAX: 22.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501231428425670SE +/- 0.54, N = 3SE +/- 8.74, N = 3SE +/- 0.55, N = 351.2460.7652.10MIN: 50.5 / MAX: 66.36MIN: 51.25 / MAX: 1056.17MIN: 50.52 / MAX: 64.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1231020304050SE +/- 0.68, N = 3SE +/- 0.74, N = 3SE +/- 0.15, N = 346.1745.8044.16MIN: 44.33 / MAX: 61MIN: 43.83 / MAX: 58.33MIN: 43.39 / MAX: 57.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd123816243240SE +/- 0.38, N = 3SE +/- 0.27, N = 3SE +/- 0.14, N = 333.7734.1834.18MIN: 32.69 / MAX: 43.46MIN: 32.78 / MAX: 44.2MIN: 33.24 / MAX: 46.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m12348121620SE +/- 0.10, N = 3SE +/- 0.68, N = 3SE +/- 0.05, N = 316.5717.4116.59MIN: 16.28 / MAX: 16.95MIN: 16.52 / MAX: 269.38MIN: 16.38 / MAX: 20.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1230.16970.33940.50910.67880.8485SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.7500.7540.752

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1230.38050.7611.14151.5221.9025SE +/- 0.005, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 31.6841.6871.691

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite123150K300K450K600K750KSE +/- 448.34, N = 3SE +/- 2148.66, N = 3SE +/- 709.37, N = 3710293709965709282

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1230.5241.0481.5722.0962.62SE +/- 0.012, N = 3SE +/- 0.019, N = 3SE +/- 0.024, N = 32.2802.2932.329MIN: 2.17 / MAX: 2.91MIN: 2.17 / MAX: 2.99MIN: 2.21 / MAX: 3.04

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric12310K20K30K40K50K4817548237485991. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1230.29250.5850.87751.171.4625SE +/- 0.05, N = 12SE +/- 0.05, N = 12SE +/- 0.03, N = 121.31.11.31. (CC) gcc options: -fopenmp -O3 -lm

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1231632486480SE +/- 0.05, N = 3SE +/- 0.20, N = 3SE +/- 0.04, N = 370.6670.7370.60

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1233691215SE +/- 0.02, N = 5SE +/- 0.04, N = 5SE +/- 0.06, N = 512.1912.2412.221. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12348121620SE +/- 0.04, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 513.7913.7313.721. (CXX) g++ options: -rdynamic


Phoronix Test Suite v10.8.5