AMD EPYC compiler benchmarks by Michael Larabel.
Clang 10.0 2020-01-13 Processor: 2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads), Motherboard: AMD DAYTONA_X (RDY1001C BIOS), Chipset: AMD Starship/Matisse, Memory: 504GB, Disk: 280GB INTEL SSDPE21D280GA + 256GB Micron_1100_MTFD, Graphics: llvmpipe 504GB, Monitor: VE228, Network: 2 x Mellanox MT27710
OS: Ubuntu 19.10, Kernel: 5.5.0-rc3-pts (x86_64) 20191229, Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, OpenGL: 3.3 Mesa 20.0.0-devel (git-2d971cc 2019-12-22 eoan-oibaf-ppa) (LLVM 9.0.1 128 bits), Compiler: Clang 10.0.0, File-System: ext4, Screen Resolution: 1920x1080
Environment Notes: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2"Compiler Notes: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2Processor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17 + Python 3.7.5Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + tsx_async_abort: Not affected
Clang 9.0.1 OS: Ubuntu 19.10, Kernel: 5.5.0-rc3-pts (x86_64) 20191229, Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, OpenGL: 3.3 Mesa 20.0.0-devel (git-2d971cc 2019-12-22 eoan-oibaf-ppa) (LLVM 9.0.1 128 bits), Compiler: Clang 9.0.1 + LLVM 9.0.1, File-System: ext4, Screen Resolution: 1920x1080
OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 4K Clang 9.0.1 Clang 10.0 2020-01-13 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 15.56 14.08 1. (CXX) g++ options: -O3 -march=znver2 -lpthread
OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Summer Nature 1080p Clang 9.0.1 Clang 10.0 2020-01-13 11 22 33 44 55 SE +/- 0.84, N = 3 SE +/- 0.06, N = 3 48.67 45.84 1. (CXX) g++ options: -O3 -march=znver2 -lpthread
OpenBenchmarking.org FPS, More Is Better libgav1 2019-10-05 Video Input: Chimera 1080p 10-bit Clang 9.0.1 Clang 10.0 2020-01-13 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.02, N = 3 16.28 16.70 1. (CXX) g++ options: -O3 -march=znver2 -lpthread
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Chimera 1080p Clang 9.0.1 Clang 10.0 2020-01-13 160 320 480 640 800 SE +/- 5.33, N = 3 SE +/- 0.54, N = 3 732.35 728.56 MIN: 225.78 / MAX: 934.19 MIN: 232.01 / MAX: 919.61 1. (CC) gcc options: -O3 -march=znver2 -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Summer Nature 4K Clang 9.0.1 Clang 10.0 2020-01-13 70 140 210 280 350 SE +/- 0.29, N = 3 SE +/- 4.36, N = 3 306.90 309.30 MIN: 50.96 / MAX: 410.48 MIN: 51.16 / MAX: 420.21 1. (CC) gcc options: -O3 -march=znver2 -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Summer Nature 1080p Clang 9.0.1 Clang 10.0 2020-01-13 160 320 480 640 800 SE +/- 11.47, N = 3 SE +/- 5.67, N = 3 715.90 719.07 MIN: 111.13 / MAX: 979.82 MIN: 117.07 / MAX: 974.01 1. (CC) gcc options: -O3 -march=znver2 -pthread
OpenBenchmarking.org FPS, More Is Better dav1d 0.5.0 Video Input: Chimera 1080p 10-bit Clang 9.0.1 Clang 10.0 2020-01-13 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.08, N = 3 103.87 104.09 MIN: 73.44 / MAX: 156.23 MIN: 73.58 / MAX: 156.41 1. (CC) gcc options: -O3 -march=znver2 -pthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2020-01-10 Encoder Mode: Speed 5 Two-Pass Clang 9.0.1 Clang 10.0 2020-01-13 0.2183 0.4366 0.6549 0.8732 1.0915 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.96 0.97 1. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 2020-01-10 Encoder Mode: Speed 8 Realtime Clang 9.0.1 Clang 10.0 2020-01-13 7 14 21 28 35 SE +/- 0.42, N = 3 SE +/- 0.16, N = 3 27.33 28.09 1. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
SVT-AV1 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p Clang 9.0.1 Clang 10.0 2020-01-13 3 6 9 12 15 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 11.34 11.44 1. (CXX) g++ options: -O3 -march=znver2 -fPIE -fPIC -pie
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p Clang 9.0.1 Clang 10.0 2020-01-13 20 40 60 80 100 SE +/- 1.18, N = 3 SE +/- 0.92, N = 3 105.35 109.49 1. (CXX) g++ options: -O3 -march=znver2 -fPIE -fPIC -pie
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Clang 9.0.1 Clang 10.0 2020-01-13 70 140 210 280 350 SE +/- 3.07, N = 9 SE +/- 3.99, N = 3 311.83 319.82 1. (CC) gcc options: -O3 -march=znver2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Clang 9.0.1 Clang 10.0 2020-01-13 60 120 180 240 300 SE +/- 2.14, N = 15 SE +/- 1.53, N = 3 255.34 259.53 1. (CC) gcc options: -O3 -march=znver2 -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 0 Clang 9.0.1 Clang 10.0 2020-01-13 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 6.37 6.48 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=znver2 -fPIC -U_FORTIFY_SOURCE -std=c++11
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.8.2 Speed: Speed 5 Clang 9.0.1 Clang 10.0 2020-01-13 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.32, N = 3 19.63 19.73 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=znver2 -fPIC -U_FORTIFY_SOURCE -std=c++11
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding Clang 9.0.1 Clang 10.0 2020-01-13 30 60 90 120 150 SE +/- 1.61, N = 3 SE +/- 1.03, N = 3 150.31 142.95 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=znver2 -std=gnu99 -mstack-alignment=64 -fPIC -fomit-frame-pointer -fno-tree-vectorize
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl Clang 9.0.1 Clang 10.0 2020-01-13 10 20 30 40 50 SE +/- 0.44, N = 9 46 45 1. (CC) gcc options: -O3 -march=znver2 -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate Clang 9.0.1 Clang 10.0 2020-01-13 120 240 360 480 600 SE +/- 0.33, N = 3 SE +/- 2.52, N = 3 503 535 1. (CC) gcc options: -O3 -march=znver2 -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen Clang 9.0.1 Clang 10.0 2020-01-13 4 8 12 16 20 14 14 1. (CC) gcc options: -O3 -march=znver2 -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced Clang 9.0.1 Clang 10.0 2020-01-13 5 10 15 20 25 22 22 1. (CC) gcc options: -O3 -march=znver2 -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing Clang 9.0.1 Clang 10.0 2020-01-13 30 60 90 120 150 117 116 1. (CC) gcc options: -O3 -march=znver2 -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian Clang 9.0.1 Clang 10.0 2020-01-13 6 12 18 24 30 25 25 1. (CC) gcc options: -O3 -march=znver2 -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space Clang 9.0.1 Clang 10.0 2020-01-13 50 100 150 200 250 SE +/- 0.88, N = 3 SE +/- 1.00, N = 3 140 209 1. (CC) gcc options: -O3 -march=znver2 -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite Clang 9.0.1 Clang 10.0 2020-01-13 600 1200 1800 2400 3000 SE +/- 7.57, N = 3 SE +/- 6.40, N = 3 2788.26 2837.96 1. (CC) gcc options: -O3 -march=znver2 -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo Clang 9.0.1 Clang 10.0 2020-01-13 130 260 390 520 650 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 621.20 621.05 1. (CC) gcc options: -O3 -march=znver2 -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform Clang 9.0.1 Clang 10.0 2020-01-13 50 100 150 200 250 SE +/- 0.81, N = 3 SE +/- 2.49, N = 3 205.14 212.94 1. (CC) gcc options: -O3 -march=znver2 -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply Clang 9.0.1 Clang 10.0 2020-01-13 700 1400 2100 2800 3500 SE +/- 22.62, N = 3 SE +/- 18.57, N = 3 2948.94 3171.61 1. (CC) gcc options: -O3 -march=znver2 -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization Clang 9.0.1 Clang 10.0 2020-01-13 2K 4K 6K 8K 10K SE +/- 17.20, N = 3 SE +/- 11.83, N = 3 8510.15 8528.37 1. (CC) gcc options: -O3 -march=znver2 -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Clang 9.0.1 Clang 10.0 2020-01-13 400 800 1200 1600 2000 SE +/- 0.20, N = 3 SE +/- 0.38, N = 3 1655.87 1655.83 1. (CC) gcc options: -O3 -march=znver2 -lm
TSCP This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance Clang 9.0.1 Clang 10.0 2020-01-13 300K 600K 900K 1200K 1500K SE +/- 759.11, N = 5 SE +/- 3574.01, N = 5 1149851 1172910 1. (CC) gcc options: -O3 -march=znver2 -march=native
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Clang 9.0.1 Clang 10.0 2020-01-13 14K 28K 42K 56K 70K SE +/- 11.67, N = 3 SE +/- 3.71, N = 3 64570 65161 1. (CC) gcc options: -m64 -lssl -lcrypto -pthread -lm -lz -ldl -lcrypt -lbz2 -mavx2 -O3 -march=native -march=znver2 -std=gnu89 -Qunused-arguments -funroll-loops
NGINX Benchmark This is a test of ab, which is the Apache Benchmark program running against nginx. This test profile measures how many requests per second a given system can sustain when carrying out 2,000,000 requests with 500 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.9.9 Static Web Page Serving Clang 9.0.1 Clang 10.0 2020-01-13 6K 12K 18K 24K 30K SE +/- 769.55, N = 15 SE +/- 719.06, N = 15 25759.00 26737.65 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native -march=znver2
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving Clang 9.0.1 Clang 10.0 2020-01-13 5K 10K 15K 20K 25K SE +/- 269.96, N = 3 SE +/- 277.81, N = 5 21253.33 21502.15 1. (CC) gcc options: -shared -fPIC -pthread -O3 -march=znver2
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance Clang 9.0.1 Clang 10.0 2020-01-13 5K 10K 15K 20K 25K SE +/- 17.07, N = 3 SE +/- 26.40, N = 3 24964.9 18960.4 1. (CC) gcc options: -pthread -m64 -Qunused-arguments -O3 -march=znver2 -lssl -lcrypto -ldl
PostgreSQL pgbench This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only Clang 9.0.1 Clang 10.0 2020-01-13 200K 400K 600K 800K 1000K SE +/- 10204.64, N = 9 SE +/- 11267.63, N = 6 971995.19 944709.06 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=znver2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write Clang 9.0.1 Clang 10.0 2020-01-13 3K 6K 9K 12K 15K SE +/- 385.96, N = 9 SE +/- 1023.29, N = 9 13644.51 16181.68 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=znver2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Clang 9.0.1 Clang 10.0 2020-01-13 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 0.24, N = 3 103.25 112.68 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=znver2 -lm
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Clang 9.0.1 Clang 10.0 2020-01-13 3 6 9 12 15 SE +/- 0.128, N = 4 SE +/- 0.076, N = 3 8.987 8.893 1. (CC) gcc options: -lm -lpthread -O3 -march=znver2
Tungsten Renderer Tungsten is a C++ physically based renderer that makes use of Intel's Embree ray tracing library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair Clang 9.0.1 Clang 10.0 2020-01-13 1.2277 2.4554 3.6831 4.9108 6.1385 SE +/- 0.03119, N = 3 SE +/- 0.05042, N = 3 5.45661 5.08467 1. (CXX) g++ options: -O3 -march=znver2 -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic Clang 9.0.1 Clang 10.0 2020-01-13 6 12 18 24 30 SE +/- 0.23, N = 15 SE +/- 0.31, N = 5 23.69 23.22 1. (CXX) g++ options: -O3 -march=znver2 -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential Clang 9.0.1 Clang 10.0 2020-01-13 0.3352 0.6704 1.0056 1.3408 1.676 SE +/- 0.02196, N = 3 SE +/- 0.01796, N = 3 1.48972 1.48624 1. (CXX) g++ options: -O3 -march=znver2 -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic Clang 9.0.1 Clang 10.0 2020-01-13 0.9626 1.9252 2.8878 3.8504 4.813 SE +/- 0.02542, N = 3 SE +/- 0.00977, N = 3 4.14480 4.27817 1. (CXX) g++ options: -O3 -march=znver2 -std=c++0x -march=znver1 -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -mfma -mbmi2 -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -ljpeg -lpthread -ldl
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall Clang 9.0.1 Clang 10.0 2020-01-13 0.9955 1.991 2.9865 3.982 4.9775 SE +/- 0.018141, N = 3 SE +/- 0.018107, N = 3 4.424567 4.420226 1. (CXX) g++ options: -O3 -march=znver2 -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack Clang 9.0.1 Clang 10.0 2020-01-13 1.1912 2.3824 3.5736 4.7648 5.956 SE +/- 0.010447, N = 3 SE +/- 0.018525, N = 3 5.294185 5.266326 1. (CXX) g++ options: -O3 -march=znver2 -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex Clang 9.0.1 Clang 10.0 2020-01-13 1.0305 2.061 3.0915 4.122 5.1525 SE +/- 0.000603, N = 3 SE +/- 0.004395, N = 3 4.579873 4.534685 1. (CXX) g++ options: -O3 -march=znver2 -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls Clang 9.0.1 Clang 10.0 2020-01-13 0.654 1.308 1.962 2.616 3.27 SE +/- 0.000731, N = 3 SE +/- 0.000256, N = 3 2.906627 2.906147 1. (CXX) g++ options: -O3 -march=znver2 -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh Clang 9.0.1 Clang 10.0 2020-01-13 0.2323 0.4646 0.6969 0.9292 1.1615 SE +/- 0.000252, N = 3 SE +/- 0.013795, N = 3 1.019573 1.032531 1. (CXX) g++ options: -O3 -march=znver2 -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh Clang 9.0.1 Clang 10.0 2020-01-13 0.266 0.532 0.798 1.064 1.33 SE +/- 0.000338, N = 3 SE +/- 0.000514, N = 3 1.182357 1.176975 1. (CXX) g++ options: -O3 -march=znver2 -rdynamic
XZ Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 Clang 9.0.1 Clang 10.0 2020-01-13 7 14 21 28 35 SE +/- 0.31, N = 15 SE +/- 0.11, N = 3 27.76 27.24 1. (CC) gcc options: -pthread -fvisibility=hidden -O3 -march=znver2
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library Clang 9.0.1 Clang 10.0 2020-01-13 70 140 210 280 350 SE +/- 0.51, N = 3 SE +/- 0.34, N = 3 332.30 333.07 1. (CXX) g++ options: -O3 -march=znver2 -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers Clang 9.0.1 Clang 10.0 2020-01-13 400 800 1200 1600 2000 SE +/- 0.64, N = 3 SE +/- 1.23, N = 3 1675.00 1659.97 1. (CXX) g++ options: -O3 -march=znver2 -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector Clang 9.0.1 Clang 10.0 2020-01-13 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.04, N = 3 86.28 86.13 1. (CXX) g++ options: -O3 -march=znver2 -std=c++11
OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction Clang 9.0.1 Clang 10.0 2020-01-13 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 33.51 33.57 1. (CXX) g++ options: -O3 -march=znver2 -std=c++11
Clang 10.0 2020-01-13 Processor: 2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads), Motherboard: AMD DAYTONA_X (RDY1001C BIOS), Chipset: AMD Starship/Matisse, Memory: 504GB, Disk: 280GB INTEL SSDPE21D280GA + 256GB Micron_1100_MTFD, Graphics: llvmpipe 504GB, Monitor: VE228, Network: 2 x Mellanox MT27710
OS: Ubuntu 19.10, Kernel: 5.5.0-rc3-pts (x86_64) 20191229, Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, OpenGL: 3.3 Mesa 20.0.0-devel (git-2d971cc 2019-12-22 eoan-oibaf-ppa) (LLVM 9.0.1 128 bits), Compiler: Clang 10.0.0, File-System: ext4, Screen Resolution: 1920x1080
Environment Notes: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2"Compiler Notes: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2Processor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17 + Python 3.7.5Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + tsx_async_abort: Not affected
Testing initiated at 13 January 2020 19:53 by user pts.
Clang 9.0.1 Processor: 2 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads), Motherboard: AMD DAYTONA_X (RDY1001C BIOS), Chipset: AMD Starship/Matisse, Memory: 504GB, Disk: 280GB INTEL SSDPE21D280GA + 256GB Micron_1100_MTFD, Graphics: llvmpipe 504GB, Monitor: VE228, Network: 2 x Mellanox MT27710
OS: Ubuntu 19.10, Kernel: 5.5.0-rc3-pts (x86_64) 20191229, Desktop: GNOME Shell 3.34.1, Display Server: X Server 1.20.5, Display Driver: modesetting 1.20.5, OpenGL: 3.3 Mesa 20.0.0-devel (git-2d971cc 2019-12-22 eoan-oibaf-ppa) (LLVM 9.0.1 128 bits), Compiler: Clang 9.0.1 + LLVM 9.0.1, File-System: ext4, Screen Resolution: 1920x1080
Environment Notes: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2"Compiler Notes: Optimized build; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2Processor Notes: Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x830101cPython Notes: Python 2.7.17 + Python 3.7.5Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + tsx_async_abort: Not affected
Testing initiated at 14 January 2020 07:15 by user pts.