POWER9 compiler benchmarking for a future article on Phoronix.
GCC 8.2.0 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 8.2.0 + clang (GCC) 8.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
GCC 9.0.1 OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental), File-System: ext4, Screen Resolution: 1024x768
Clang 7.0.1 OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 7.0.1 + LLVM 7.0.1, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Clang 8.0.0-rc OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 8.0.0 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 1024x768
Hierarchical INTegration This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: DOUBLE Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 120M 240M 360M 480M 600M SE +/- 147477.11, N = 3 SE +/- 149978.94, N = 3 SE +/- 91261.04, N = 3 SE +/- 30254.60, N = 3 379561299 377718193 552839732 535814531 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 50M 100M 150M 200M 250M SE +/- 13328.34, N = 3 SE +/- 13561.84, N = 3 SE +/- 8234.46, N = 3 SE +/- 14738.26, N = 3 176740207 174464419 227407921 220611266 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
CacheBench This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 13K 26K 39K 52K 65K SE +/- 25.70, N = 3 SE +/- 13.10, N = 3 SE +/- 9.96, N = 3 SE +/- 0.03, N = 3 58871 58691 19793 21154 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 10K 20K 30K 40K 50K SE +/- 3.02, N = 3 SE +/- 18.02, N = 3 SE +/- 14.98, N = 3 SE +/- 0.52, N = 3 44754 44459 11147 11925 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 1000 2000 3000 4000 5000 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 SE +/- 0.37, N = 3 4750 4750 4897 4782 1. (CC) gcc options: -lrt
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.1 Video Input: Summer Nature 4K Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 20 40 60 80 100 SE +/- 0.44, N = 3 SE +/- 0.92, N = 3 SE +/- 0.40, N = 3 SE +/- 1.78, N = 3 103.17 101.06 84.58 93.23 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 11 22 33 44 55 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 47.84 47.71 39.96 44.39 -fvisibility=hidden -fvisibility=hidden 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -logg -lm
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 75.60 74.95 59.03 58.83 1. (CC) gcc options: -lm -O3 -mtune=native -mcpu=native
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 30 60 90 120 150 22 23 154 154 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 40 80 120 160 200 SE +/- 0.67, N = 3 15 15 157 159 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 30 60 90 120 150 19 20 148 147 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 40 80 120 160 200 SE +/- 0.67, N = 3 43 43 165 164 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 1.20, N = 3 118 118 185 183 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 88 89 177 178 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 40 80 120 160 200 159 159 197 197 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding GCC 9.0.1 GCC 8.2.0 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 11.25 11.30 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Compression GCC 9.0.1 GCC 8.2.0 5 10 15 20 25 22 21 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.1 Video Input: Summer Nature 1080p Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 7 14 21 28 35 SE +/- 0.18, N = 3 SE +/- 0.19, N = 3 SE +/- 0.39, N = 12 SE +/- 0.09, N = 3 32.19 31.38 27.69 28.93 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread
ebizzy This is a test of ebizzy, a program to generate workloads resembling web server workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 200K 400K 600K 800K 1000K SE +/- 13415.01, N = 3 SE +/- 18117.64, N = 3 SE +/- 17339.39, N = 12 SE +/- 20488.27, N = 12 1071048 1152232 1130592 1117793 1. (CC) gcc options: -pthread -lpthread -O3 -mtune=native -mcpu=native
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 5K 10K 15K 20K 25K SE +/- 50.11, N = 3 SE +/- 125.28, N = 3 SE +/- 41.67, N = 3 SE +/- 24.13, N = 3 21444 21209 21008 21110 1. (CC) gcc options: -shared -fPIC -pthread -O3 -mtune=native -mcpu=native
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 56.53 56.23 17.87 17.84 1. (CC) gcc options: -lm -lpthread -O3 -mtune=native -mcpu=native
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Compression GCC 9.0.1 GCC 8.2.0 70 140 210 280 350 317 322 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Decompression GCC 9.0.1 GCC 8.2.0 80 160 240 320 400 356 353 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Compression GCC 9.0.1 GCC 8.2.0 30 60 90 120 150 115 115 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Decompression GCC 9.0.1 GCC 8.2.0 200 400 600 800 1000 SE +/- 0.33, N = 3 813 800 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Compression GCC 9.0.1 GCC 8.2.0 60 120 180 240 300 280 286 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Add Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 11K 22K 33K 44K 55K SE +/- 770.83, N = 4 SE +/- 165.31, N = 3 SE +/- 62.05, N = 3 SE +/- 791.56, N = 3 50110 49578 37853 49781 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 300 600 900 1200 1500 SE +/- 0.37, N = 3 SE +/- 6.10, N = 3 SE +/- 0.67, N = 3 SE +/- 0.23, N = 3 1593 1585 1173 1255 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Append Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 11K 22K 33K 44K 55K SE +/- 79.39, N = 3 SE +/- 251.90, N = 3 SE +/- 131.76, N = 3 SE +/- 218.98, N = 3 52234 51981 39666 51756 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Replace Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 11K 22K 33K 44K 55K SE +/- 509.02, N = 3 SE +/- 246.26, N = 3 SE +/- 31.59, N = 3 SE +/- 796.15, N = 3 52343 52200 39540 52559 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Prepend Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 11K 22K 33K 44K 55K SE +/- 1070.25, N = 3 SE +/- 710.17, N = 3 SE +/- 154.01, N = 3 SE +/- 241.03, N = 3 53049 52563 39442 51972 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 300K 600K 900K 1200K 1500K SE +/- 13748.37, N = 12 SE +/- 3347.83, N = 3 SE +/- 13393.71, N = 11 SE +/- 23389.59, N = 3 1211944 1376795 1298341 1217931 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 1600 3200 4800 6400 8000 SE +/- 12.55, N = 3 SE +/- 34.93, N = 3 SE +/- 14.69, N = 3 SE +/- 44.37, N = 3 7063 6958 7407 7514 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -mtune=native -mcpu=native -lssl -lcrypto -ldl
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 3 6 9 12 15 SE +/- 0.09, N = 3 SE +/- 0.16, N = 3 SE +/- 0.21, N = 3 SE +/- 0.24, N = 12 11.33 11.04 11.59 11.37 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -lz -llzma
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 400K 800K 1200K 1600K 2000K SE +/- 12435.42, N = 3 SE +/- 12276.86, N = 3 SE +/- 18962.05, N = 12 SE +/- 24484.83, N = 3 1666854 1721346 1618223 1604180 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 1 Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 17.83 17.92 17.80 17.71 1. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 18.86 18.61 15.73 15.73 -pipe -pipe 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lncurses -lm
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 400K 800K 1200K 1600K 2000K SE +/- 18704.28, N = 10 SE +/- 12153.33, N = 3 SE +/- 5953.84, N = 3 SE +/- 21388.79, N = 3 1729936 1804132 1714327 1685935 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 200K 400K 600K 800K 1000K SE +/- 15331.74, N = 5 SE +/- 18665.14, N = 3 SE +/- 16643.13, N = 4 SE +/- 9563.35, N = 3 1033115 1113384 1096671 1017131 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 12 24 36 48 60 SE +/- 0.35, N = 3 SE +/- 0.55, N = 3 SE +/- 0.57, N = 3 SE +/- 0.16, N = 3 51.98 52.34 52.15 53.75 1. (CC) gcc options: -ldl -lm -lpthread -O3 -ffast-math -mtune=native -mcpu=native -maltivec -mabi=altivec -mvsx -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 200K 400K 600K 800K 1000K SE +/- 5628.72, N = 3 SE +/- 8833.10, N = 3 SE +/- 11192.70, N = 3 SE +/- 2969.35, N = 3 836662 867234 820878 791788 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 7.18 6.88 6.99 6.65 1. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 0.3825 0.765 1.1475 1.53 1.9125 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.69 1.70 1.54 1.54 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 1.089 2.178 3.267 4.356 5.445 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.82 4.84 4.13 4.15 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.66 8.73 8.45 8.22 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 8.35 8.32 7.55 7.69 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.85 6.84 6.26 6.28 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 300 600 900 1200 1500 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 944 940 1248 1248 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 1000 2000 3000 4000 5000 SE +/- 0.88, N = 3 SE +/- 30.81, N = 3 SE +/- 3.55, N = 3 SE +/- 0.98, N = 3 4509 4605 3042 3441 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 400 800 1200 1600 2000 SE +/- 0.79, N = 3 SE +/- 1.61, N = 3 SE +/- 0.28, N = 3 SE +/- 0.06, N = 3 1829 1724 1116 1119 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 70 140 210 280 350 SE +/- 0.73, N = 3 SE +/- 0.93, N = 3 SE +/- 0.49, N = 3 SE +/- 0.41, N = 3 305 305 308 307 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 80 160 240 320 400 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 378 348 153 161 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
GCC 8.2.0 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 8.2.0 + clang (GCC) 8.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 5 February 2019 20:23 by user phoronix.
GCC 9.0.1 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental), File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 05:10 by user phoronix.
Clang 7.0.1 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 7.0.1 + LLVM 7.0.1, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 08:23 by user phoronix.
Clang 8.0.0-rc Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 8.0.0 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 12:25 by user phoronix.