POWER9 compiler benchmarking for a future article on Phoronix.
GCC 8.2.0 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 8.2.0 + clang (GCC) 8.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
GCC 9.0.1 OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental), File-System: ext4, Screen Resolution: 1024x768
Clang 7.0.1 OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 7.0.1 + LLVM 7.0.1, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Clang 8.0.0-rc OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 8.0.0 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 1024x768
Hierarchical INTegration This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: DOUBLE Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 120M 240M 360M 480M 600M SE +/- 149978.94, N = 3 SE +/- 147477.11, N = 3 SE +/- 30254.60, N = 3 SE +/- 91261.04, N = 3 377718193 379561299 535814531 552839732 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 50M 100M 150M 200M 250M SE +/- 13561.84, N = 3 SE +/- 13328.34, N = 3 SE +/- 14738.26, N = 3 SE +/- 8234.46, N = 3 174464419 176740207 220611266 227407921 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
CacheBench This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 13K 26K 39K 52K 65K SE +/- 13.10, N = 3 SE +/- 25.70, N = 3 SE +/- 0.03, N = 3 SE +/- 9.96, N = 3 58691 58871 21154 19793 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 10K 20K 30K 40K 50K SE +/- 18.02, N = 3 SE +/- 3.02, N = 3 SE +/- 0.52, N = 3 SE +/- 14.98, N = 3 44459 44754 11925 11147 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 1000 2000 3000 4000 5000 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.37, N = 3 SE +/- 0.00, N = 3 4750 4750 4782 4897 1. (CC) gcc options: -lrt
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.1 Video Input: Summer Nature 4K Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 20 40 60 80 100 SE +/- 0.92, N = 3 SE +/- 0.44, N = 3 SE +/- 1.78, N = 3 SE +/- 0.40, N = 3 101.06 103.17 93.23 84.58 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 11 22 33 44 55 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 47.71 47.84 44.39 39.96 -fvisibility=hidden -fvisibility=hidden 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -logg -lm
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 74.95 75.60 58.83 59.03 1. (CC) gcc options: -lm -O3 -mtune=native -mcpu=native
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 30 60 90 120 150 23 22 154 154 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 40 80 120 160 200 SE +/- 0.67, N = 3 15 15 159 157 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 30 60 90 120 150 20 19 147 148 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 40 80 120 160 200 SE +/- 0.67, N = 3 43 43 164 165 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 40 80 120 160 200 SE +/- 1.20, N = 3 SE +/- 0.33, N = 3 118 118 183 185 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 40 80 120 160 200 SE +/- 0.67, N = 3 SE +/- 0.58, N = 3 89 88 178 177 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 40 80 120 160 200 159 159 197 197 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding GCC 8.2.0 GCC 9.0.1 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 11.30 11.25 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Compression GCC 8.2.0 GCC 9.0.1 5 10 15 20 25 21 22 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.1 Video Input: Summer Nature 1080p Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 7 14 21 28 35 SE +/- 0.19, N = 3 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 0.39, N = 12 31.38 32.19 28.93 27.69 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread
ebizzy This is a test of ebizzy, a program to generate workloads resembling web server workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 200K 400K 600K 800K 1000K SE +/- 18117.64, N = 3 SE +/- 13415.01, N = 3 SE +/- 20488.27, N = 12 SE +/- 17339.39, N = 12 1152232 1071048 1117793 1130592 1. (CC) gcc options: -pthread -lpthread -O3 -mtune=native -mcpu=native
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 5K 10K 15K 20K 25K SE +/- 125.28, N = 3 SE +/- 50.11, N = 3 SE +/- 24.13, N = 3 SE +/- 41.67, N = 3 21209 21444 21110 21008 1. (CC) gcc options: -shared -fPIC -pthread -O3 -mtune=native -mcpu=native
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 13 26 39 52 65 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 56.23 56.53 17.84 17.87 1. (CC) gcc options: -lm -lpthread -O3 -mtune=native -mcpu=native
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Compression GCC 8.2.0 GCC 9.0.1 70 140 210 280 350 322 317 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Decompression GCC 8.2.0 GCC 9.0.1 80 160 240 320 400 353 356 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Compression GCC 8.2.0 GCC 9.0.1 30 60 90 120 150 115 115 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Decompression GCC 8.2.0 GCC 9.0.1 200 400 600 800 1000 SE +/- 9.00, N = 3 SE +/- 0.33, N = 3 790 813 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Compression GCC 8.2.0 GCC 9.0.1 60 120 180 240 300 SE +/- 1.00, N = 3 285 280 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Add Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 11K 22K 33K 44K 55K SE +/- 165.31, N = 3 SE +/- 770.83, N = 4 SE +/- 791.56, N = 3 SE +/- 62.05, N = 3 49578 50110 49781 37853 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 300 600 900 1200 1500 SE +/- 6.10, N = 3 SE +/- 0.37, N = 3 SE +/- 0.23, N = 3 SE +/- 0.67, N = 3 1585 1593 1255 1173 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Append Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 11K 22K 33K 44K 55K SE +/- 251.90, N = 3 SE +/- 79.39, N = 3 SE +/- 218.98, N = 3 SE +/- 131.76, N = 3 51981 52234 51756 39666 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Replace Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 11K 22K 33K 44K 55K SE +/- 246.26, N = 3 SE +/- 509.02, N = 3 SE +/- 796.15, N = 3 SE +/- 31.59, N = 3 52200 52343 52559 39540 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Prepend Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 11K 22K 33K 44K 55K SE +/- 710.17, N = 3 SE +/- 1070.25, N = 3 SE +/- 241.03, N = 3 SE +/- 154.01, N = 3 52563 53049 51972 39442 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 300K 600K 900K 1200K 1500K SE +/- 3347.83, N = 3 SE +/- 13748.37, N = 12 SE +/- 23389.59, N = 3 SE +/- 13393.71, N = 11 1376795 1211944 1217931 1298341 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 1600 3200 4800 6400 8000 SE +/- 34.93, N = 3 SE +/- 12.55, N = 3 SE +/- 44.37, N = 3 SE +/- 14.69, N = 3 6958 7063 7514 7407 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -mtune=native -mcpu=native -lssl -lcrypto -ldl
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 3 6 9 12 15 SE +/- 0.16, N = 3 SE +/- 0.09, N = 3 SE +/- 0.24, N = 12 SE +/- 0.21, N = 3 11.04 11.33 11.37 11.59 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -lz -llzma
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 400K 800K 1200K 1600K 2000K SE +/- 12276.86, N = 3 SE +/- 12435.42, N = 3 SE +/- 24484.83, N = 3 SE +/- 18962.05, N = 12 1721346 1666854 1604180 1618223 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 1 Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 17.92 17.83 17.71 17.80 1. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 18.61 18.86 15.73 15.73 -pipe -pipe 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lncurses -lm
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 400K 800K 1200K 1600K 2000K SE +/- 12153.33, N = 3 SE +/- 18704.28, N = 10 SE +/- 21388.79, N = 3 SE +/- 5953.84, N = 3 1804132 1729936 1685935 1714327 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 200K 400K 600K 800K 1000K SE +/- 18665.14, N = 3 SE +/- 15331.74, N = 5 SE +/- 9563.35, N = 3 SE +/- 16643.13, N = 4 1113384 1033115 1017131 1096671 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 12 24 36 48 60 SE +/- 0.55, N = 3 SE +/- 0.35, N = 3 SE +/- 0.16, N = 3 SE +/- 0.57, N = 3 52.34 51.98 53.75 52.15 1. (CC) gcc options: -ldl -lm -lpthread -O3 -ffast-math -mtune=native -mcpu=native -maltivec -mabi=altivec -mvsx -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 200K 400K 600K 800K 1000K SE +/- 8833.10, N = 3 SE +/- 5628.72, N = 3 SE +/- 2969.35, N = 3 SE +/- 11192.70, N = 3 867234 836662 791788 820878 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 2 4 6 8 10 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 6.88 7.18 6.65 6.99 1. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 0.3825 0.765 1.1475 1.53 1.9125 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.70 1.69 1.54 1.54 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 1.089 2.178 3.267 4.356 5.445 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.84 4.82 4.15 4.13 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.73 8.66 8.22 8.45 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 8.32 8.35 7.69 7.55 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.84 6.85 6.28 6.26 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 300 600 900 1200 1500 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 940 944 1248 1248 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 1000 2000 3000 4000 5000 SE +/- 30.81, N = 3 SE +/- 0.88, N = 3 SE +/- 0.98, N = 3 SE +/- 3.55, N = 3 4605 4509 3441 3042 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 400 800 1200 1600 2000 SE +/- 1.61, N = 3 SE +/- 0.79, N = 3 SE +/- 0.06, N = 3 SE +/- 0.28, N = 3 1724 1829 1119 1116 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 70 140 210 280 350 SE +/- 0.93, N = 3 SE +/- 0.73, N = 3 SE +/- 0.41, N = 3 SE +/- 0.49, N = 3 305 305 307 308 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 80 160 240 320 400 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 348 378 161 153 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
GCC 8.2.0 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 8.2.0 + clang (GCC) 8.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 5 February 2019 20:23 by user phoronix.
GCC 9.0.1 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental), File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 05:10 by user phoronix.
Clang 7.0.1 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 7.0.1 + LLVM 7.0.1, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 08:23 by user phoronix.
Clang 8.0.0-rc Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 8.0.0 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 12:25 by user phoronix.