POWER9 compiler benchmarking for a future article on Phoronix.
GCC 8.2.0 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 8.2.0 + clang (GCC) 8.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
GCC 9.0.1 OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental), File-System: ext4, Screen Resolution: 1024x768
Clang 7.0.1 OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 7.0.1 + LLVM 7.0.1, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Clang 8.0.0-rc OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 8.0.0 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 1024x768
Hierarchical INTegration This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: DOUBLE GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 120M 240M 360M 480M 600M SE +/- 30254.60, N = 3 SE +/- 91261.04, N = 3 SE +/- 149978.94, N = 3 SE +/- 147477.11, N = 3 535814531 552839732 377718193 379561299 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 50M 100M 150M 200M 250M SE +/- 14738.26, N = 3 SE +/- 8234.46, N = 3 SE +/- 13561.84, N = 3 SE +/- 13328.34, N = 3 220611266 227407921 174464419 176740207 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
CacheBench This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 13K 26K 39K 52K 65K SE +/- 0.03, N = 3 SE +/- 9.96, N = 3 SE +/- 13.10, N = 3 SE +/- 25.70, N = 3 21154 19793 58691 58871 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 10K 20K 30K 40K 50K SE +/- 0.52, N = 3 SE +/- 14.98, N = 3 SE +/- 18.02, N = 3 SE +/- 3.02, N = 3 11925 11147 44459 44754 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 1000 2000 3000 4000 5000 SE +/- 0.37, N = 3 SE +/- 0.00, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 4782 4897 4750 4750 1. (CC) gcc options: -lrt
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.1 Video Input: Summer Nature 4K GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 20 40 60 80 100 SE +/- 1.78, N = 3 SE +/- 0.40, N = 3 SE +/- 0.92, N = 3 SE +/- 0.44, N = 3 93.23 84.58 101.06 103.17 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 11 22 33 44 55 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 44.39 39.96 47.71 47.84 -fvisibility=hidden -fvisibility=hidden 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -logg -lm
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 58.83 59.03 74.95 75.60 1. (CC) gcc options: -lm -O3 -mtune=native -mcpu=native
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 30 60 90 120 150 154 154 23 22 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 40 80 120 160 200 SE +/- 0.67, N = 3 159 157 15 15 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 30 60 90 120 150 147 148 20 19 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 40 80 120 160 200 SE +/- 0.67, N = 3 164 165 43 43 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 40 80 120 160 200 SE +/- 1.20, N = 3 SE +/- 0.33, N = 3 183 185 118 118 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 178 177 89 88 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 40 80 120 160 200 197 197 159 159 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding GCC 8.2.0 GCC 9.0.1 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 11.30 11.25 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Compression GCC 8.2.0 GCC 9.0.1 5 10 15 20 25 21 22 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.1 Video Input: Summer Nature 1080p GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 7 14 21 28 35 SE +/- 0.09, N = 3 SE +/- 0.39, N = 12 SE +/- 0.19, N = 3 SE +/- 0.18, N = 3 28.93 27.69 31.38 32.19 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread
ebizzy This is a test of ebizzy, a program to generate workloads resembling web server workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 200K 400K 600K 800K 1000K SE +/- 20488.27, N = 12 SE +/- 17339.39, N = 12 SE +/- 18117.64, N = 3 SE +/- 13415.01, N = 3 1117793 1130592 1152232 1071048 1. (CC) gcc options: -pthread -lpthread -O3 -mtune=native -mcpu=native
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 5K 10K 15K 20K 25K SE +/- 24.13, N = 3 SE +/- 41.67, N = 3 SE +/- 125.28, N = 3 SE +/- 50.11, N = 3 21110 21008 21209 21444 1. (CC) gcc options: -shared -fPIC -pthread -O3 -mtune=native -mcpu=native
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 13 26 39 52 65 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 17.84 17.87 56.23 56.53 1. (CC) gcc options: -lm -lpthread -O3 -mtune=native -mcpu=native
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Compression GCC 8.2.0 GCC 9.0.1 70 140 210 280 350 322 317 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Decompression GCC 8.2.0 GCC 9.0.1 80 160 240 320 400 353 356 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Compression GCC 8.2.0 GCC 9.0.1 30 60 90 120 150 115 115 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Decompression GCC 8.2.0 GCC 9.0.1 200 400 600 800 1000 SE +/- 0.33, N = 3 800 813 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Compression GCC 8.2.0 GCC 9.0.1 60 120 180 240 300 286 280 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Add GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 11K 22K 33K 44K 55K SE +/- 791.56, N = 3 SE +/- 62.05, N = 3 SE +/- 165.31, N = 3 SE +/- 770.83, N = 4 49781 37853 49578 50110 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 300 600 900 1200 1500 SE +/- 0.23, N = 3 SE +/- 0.67, N = 3 SE +/- 6.10, N = 3 SE +/- 0.37, N = 3 1255 1173 1585 1593 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Append GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 11K 22K 33K 44K 55K SE +/- 218.98, N = 3 SE +/- 131.76, N = 3 SE +/- 251.90, N = 3 SE +/- 79.39, N = 3 51756 39666 51981 52234 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Replace GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 11K 22K 33K 44K 55K SE +/- 796.15, N = 3 SE +/- 31.59, N = 3 SE +/- 246.26, N = 3 SE +/- 509.02, N = 3 52559 39540 52200 52343 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Prepend GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 11K 22K 33K 44K 55K SE +/- 241.03, N = 3 SE +/- 154.01, N = 3 SE +/- 710.17, N = 3 SE +/- 1070.25, N = 3 51972 39442 52563 53049 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 300K 600K 900K 1200K 1500K SE +/- 23389.59, N = 3 SE +/- 13393.71, N = 11 SE +/- 3347.83, N = 3 SE +/- 13748.37, N = 12 1217931 1298341 1376795 1211944 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 1600 3200 4800 6400 8000 SE +/- 44.37, N = 3 SE +/- 14.69, N = 3 SE +/- 34.93, N = 3 SE +/- 12.55, N = 3 7514 7407 6958 7063 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -mtune=native -mcpu=native -lssl -lcrypto -ldl
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 3 6 9 12 15 SE +/- 0.24, N = 12 SE +/- 0.21, N = 3 SE +/- 0.16, N = 3 SE +/- 0.09, N = 3 11.37 11.59 11.04 11.33 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -lz -llzma
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 400K 800K 1200K 1600K 2000K SE +/- 24484.83, N = 3 SE +/- 18962.05, N = 12 SE +/- 12276.86, N = 3 SE +/- 12435.42, N = 3 1604180 1618223 1721346 1666854 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 1 GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 4 8 12 16 20 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 17.71 17.80 17.92 17.83 1. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 15.73 15.73 18.61 18.86 -pipe -pipe 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lncurses -lm
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 400K 800K 1200K 1600K 2000K SE +/- 21388.79, N = 3 SE +/- 5953.84, N = 3 SE +/- 12153.33, N = 3 SE +/- 18704.28, N = 10 1685935 1714327 1804132 1729936 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 200K 400K 600K 800K 1000K SE +/- 9563.35, N = 3 SE +/- 16643.13, N = 4 SE +/- 18665.14, N = 3 SE +/- 15331.74, N = 5 1017131 1096671 1113384 1033115 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 12 24 36 48 60 SE +/- 0.16, N = 3 SE +/- 0.57, N = 3 SE +/- 0.55, N = 3 SE +/- 0.35, N = 3 53.75 52.15 52.34 51.98 1. (CC) gcc options: -ldl -lm -lpthread -O3 -ffast-math -mtune=native -mcpu=native -maltivec -mabi=altivec -mvsx -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 200K 400K 600K 800K 1000K SE +/- 2969.35, N = 3 SE +/- 11192.70, N = 3 SE +/- 8833.10, N = 3 SE +/- 5628.72, N = 3 791788 820878 867234 836662 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 6.65 6.99 6.88 7.18 1. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 0.3825 0.765 1.1475 1.53 1.9125 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.54 1.54 1.70 1.69 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 1.089 2.178 3.267 4.356 5.445 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.15 4.13 4.84 4.82 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.22 8.45 8.73 8.66 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 7.69 7.55 8.32 8.35 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.28 6.26 6.84 6.85 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 300 600 900 1200 1500 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1248 1248 940 944 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 1000 2000 3000 4000 5000 SE +/- 0.98, N = 3 SE +/- 3.55, N = 3 SE +/- 30.81, N = 3 SE +/- 0.88, N = 3 3441 3042 4605 4509 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 400 800 1200 1600 2000 SE +/- 0.06, N = 3 SE +/- 0.28, N = 3 SE +/- 1.61, N = 3 SE +/- 0.79, N = 3 1119 1116 1724 1829 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 70 140 210 280 350 SE +/- 0.41, N = 3 SE +/- 0.49, N = 3 SE +/- 0.93, N = 3 SE +/- 0.73, N = 3 307 308 305 305 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc 80 160 240 320 400 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 161 153 348 378 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
GCC 8.2.0 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 8.2.0 + clang (GCC) 8.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 5 February 2019 20:23 by user phoronix.
GCC 9.0.1 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental), File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 05:10 by user phoronix.
Clang 7.0.1 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 7.0.1 + LLVM 7.0.1, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 08:23 by user phoronix.
Clang 8.0.0-rc Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 8.0.0 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 12:25 by user phoronix.