POWER9 Talos II Compiler Benchmarks

POWER9 compiler benchmarking for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1902066-SP-POWER9TAL97&rdt&grs.

POWER9 Talos II Compiler BenchmarksProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rcPOWER9 @ 3.80GHz (44 Cores / 176 Threads)PowerNV T2P9D01 REV 1.0165536MBSamsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1ASPEED Family2 x Broadcom NetXtreme BCM5719 PCIeUbuntu 19.044.18.0-11-generic (ppc64le)GCC 8.2.0 + clang (GCC) 8.2.0ext41024x768GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental)Clang 7.0.1 + LLVM 7.0.1Clang 8.0.0 + LLVM 8.0.0OpenBenchmarking.orgEnvironment Details- CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Details- GCC 8.2.0: --enable-checking=release- GCC 9.0.1: --enable-checking=release- Clang 7.0.1: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9- Clang 8.0.0-rc: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Details- Scaling Governor: powernv-cpufreq ondemand

POWER9 Talos II Compiler Benchmarksgraphics-magick: Sharpengraphics-magick: Enhancedcachebench: Writegraphics-magick: Swirlc-ray: Total Time - 4K, 16 Rays Per Pixelcachebench: Read / Modify / Writescimark2: Monte Carlographics-magick: Resizinggraphics-magick: Noise-Gaussianscimark2: Sparse Matrix Multiplygraphics-magick: HWB Color Spacescimark2: Dense LU Matrix Factorizationhint: DOUBLEscimark2: Compositemcperf: Prependmcperf: Replacescimark2: Jacobi Successive Over-Relaxationmcperf: Getmcperf: Addmcperf: Appendmcperf: Deletemcperf: Sethint: FLOATaobench: 2048 x 2048 - Total Timegraphics-magick: Rotatedav1d: Summer Nature 4Kencode-mp3: WAV To MP3encode-flac: WAV To FLACtscp: AI Chess Performancebullet: 136 Ragdollsdav1d: Summer Nature 1080predis: SADDcpp-perf-bench: Ctypecpp-perf-bench: Stepanov Abstractionbullet: 1000 Stackbullet: Prim Trimeshbullet: Raytestsredis: LPUSHredis: SETbullet: 3000 Fallbullet: Convex Trimeshcpp-perf-bench: Stepanov Vectoropenssl: RSA 4096-bit Performancet-test1: 2redis: GETredis: LPOPbullet: 1000 Convexlzbench: XZ 0 - Compressioncompress-pbzip2: 256MB File Compressionx264: H.264 Video Encodingcachebench: Readlzbench: Zstd 1 - Decompressionlzbench: XZ 0 - Decompressiontjbench: Decompression Throughputcompress-7zip: Compress Speed Testcpp-perf-bench: Function Objectslzbench: Brotli 0 - Decompressionlzbench: Zstd 1 - Compressionapache: Static Web Page Servinglzbench: Brotli 0 - Compressionxsbench: t-test1: 1scimark2: Fast Fourier Transformlzbench: Libdeflate 1 - Decompressioncpp-perf-bench: Atolx265: H.265 1080p Video Encodingm-queens: Time To Solvelzbench: Libdeflate 1 - Compressioncompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19ebizzy: himeno: Poisson Pressure Solvermafft: Multiple Sequence AlignmentGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1471541192516417.842115416117815911191833441535814531125551972525591248740154978151756745154974422061126658.8319793.2315.7344.397143684.1528.93121793175.0259.987.691.544.9179178810171316.281.9514275146.65160418016859358.22212.3453.7547828007010615787020.0029228621110322556000717.7130735389.1711.3020.3611511.3711177936184.511481541114716517.871979315317715711161853042552839732117339442395401248562853785339666565993813522740792159.0319784.5815.7339.967402934.1327.69129834174.8656.897.551.544.9182087810966716.261.9814074076.99161822317143278.45222.2652.1548978137210615369219.7029928021008317548622017.8030835689.2411.2520.3511511.5911305926563.792023444594356.235869134889151724118460537771819315855256352200940740614957851981744094945317446441974.95159101.0618.6147.718357784.8431.38137679566.2553.048.321.705.3886723411133846.842.1213169586.88172134618041328.7352.34475010919.532120917.9230589.7511.0411522325944.391922447544356.535887137888151829118450937956129915935304952343944745475011052234739184968217674020775.60159103.1718.8647.848499664.8232.19121194466.3453.248.351.695.4183666210331156.852.1113270637.18166685417299368.6651.98475010719.502144417.8330589.4911.3310710486234.24OpenBenchmarking.org

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SharpenGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc3060901201501471482019-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc3060901201501541542322-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: WriteGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc10K20K30K40K50KSE +/- 0.52, N = 3SE +/- 14.98, N = 3SE +/- 18.02, N = 3SE +/- 3.02, N = 3119251114744459447541. (CC) gcc options: -lrt

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc4080120160200SE +/- 0.67, N = 31641654343-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1326395265SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 317.8417.8756.2356.531. (CC) gcc options: -lm -lpthread -O3 -mtune=native -mcpu=native

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WriteGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc13K26K39K52K65KSE +/- 0.03, N = 3SE +/- 9.96, N = 3SE +/- 13.10, N = 3SE +/- 25.70, N = 3211541979358691588711. (CC) gcc options: -lrt

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc80160240320400SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 31611533483781. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc4080120160200SE +/- 0.67, N = 3SE +/- 0.58, N = 31781778988-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-GaussianGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc4080120160200SE +/- 0.67, N = 31591571515-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc400800120016002000SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 1.61, N = 3SE +/- 0.79, N = 311191116172418291. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpaceGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc4080120160200SE +/- 1.20, N = 3SE +/- 0.33, N = 3183185118118-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc10002000300040005000SE +/- 0.98, N = 3SE +/- 3.55, N = 3SE +/- 30.81, N = 3SE +/- 0.88, N = 334413042460545091. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

Hierarchical INTegration

Test: DOUBLE

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: DOUBLEGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc120M240M360M480M600MSE +/- 30254.60, N = 3SE +/- 91261.04, N = 3SE +/- 149978.94, N = 3SE +/- 147477.11, N = 35358145315528397323777181933795612991. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc30060090012001500SE +/- 0.23, N = 3SE +/- 0.67, N = 3SE +/- 6.10, N = 3SE +/- 0.37, N = 312551173158515931. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

Memcached mcperf

Method: Prepend

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: PrependGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc11K22K33K44K55KSE +/- 241.03, N = 3SE +/- 154.01, N = 3SE +/- 710.17, N = 3SE +/- 1070.25, N = 3519723944252563530491. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

Memcached mcperf

Method: Replace

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: ReplaceGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc11K22K33K44K55KSE +/- 796.15, N = 3SE +/- 31.59, N = 3SE +/- 246.26, N = 3SE +/- 509.02, N = 3525593954052200523431. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc30060090012001500SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3124812489409441. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

Memcached mcperf

Method: Get

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: GetGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc16K32K48K64K80KSE +/- 368.57, N = 3SE +/- 479.89, N = 3SE +/- 103.90, N = 3SE +/- 753.43, N = 11740155628574061745471. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

Memcached mcperf

Method: Add

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: AddGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc11K22K33K44K55KSE +/- 791.56, N = 3SE +/- 62.05, N = 3SE +/- 165.31, N = 3SE +/- 770.83, N = 4497813785349578501101. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

Memcached mcperf

Method: Append

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: AppendGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc11K22K33K44K55KSE +/- 218.98, N = 3SE +/- 131.76, N = 3SE +/- 251.90, N = 3SE +/- 79.39, N = 3517563966651981522341. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

Memcached mcperf

Method: Delete

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: DeleteGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc16K32K48K64K80KSE +/- 838.42, N = 9SE +/- 202.38, N = 3SE +/- 902.55, N = 8SE +/- 113.11, N = 3745155659974409739181. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

Memcached mcperf

Method: Set

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: SetGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc11K22K33K44K55KSE +/- 973.53, N = 3SE +/- 57.29, N = 3SE +/- 492.09, N = 3SE +/- 89.60, N = 3497443813549453496821. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc50M100M150M200M250MSE +/- 14738.26, N = 3SE +/- 8234.46, N = 3SE +/- 13561.84, N = 3SE +/- 13328.34, N = 32206112662274079211744644191767402071. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc20406080100SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 358.8359.0374.9575.601. (CC) gcc options: -lm -O3 -mtune=native -mcpu=native

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotateGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc4080120160200197197159159-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.1Video Input: Summer Nature 4KGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc20406080100SE +/- 1.78, N = 3SE +/- 0.40, N = 3SE +/- 0.92, N = 3SE +/- 0.44, N = 393.2384.58101.06103.171. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 315.7315.7318.6118.86-pipe-pipe1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lncurses -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1122334455SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 544.3939.9647.7147.84-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -logg -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc200K400K600K800K1000K7143687402938357788499661. (CC) gcc options: -O3 -mtune=native -mcpu=native

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1.0892.1783.2674.3565.445SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.154.134.844.82-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.1Video Input: Summer Nature 1080pGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc714212835SE +/- 0.09, N = 3SE +/- 0.39, N = 12SE +/- 0.19, N = 3SE +/- 0.18, N = 328.9327.6931.3832.191. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADDGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc300K600K900K1200K1500KSE +/- 23389.59, N = 3SE +/- 13393.71, N = 11SE +/- 3347.83, N = 3SE +/- 13748.37, N = 1212179311298341137679512119441. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc20406080100SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 375.0274.8666.2566.341. (CXX) g++ options: -O3 -mtune=native -mcpu=native -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1326395265SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 359.9856.8953.0453.241. (CXX) g++ options: -O3 -mtune=native -mcpu=native -std=c++11

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc246810SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 37.697.558.328.35-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc0.38250.7651.14751.531.9125SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.541.541.701.69-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1.21732.43463.65194.86926.0865SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.914.915.385.41-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSHGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc200K400K600K800K1000KSE +/- 2969.35, N = 3SE +/- 11192.70, N = 3SE +/- 8833.10, N = 3SE +/- 5628.72, N = 37917888208788672348366621. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SETGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc200K400K600K800K1000KSE +/- 9563.35, N = 3SE +/- 16643.13, N = 4SE +/- 18665.14, N = 3SE +/- 15331.74, N = 510171311096671111338410331151. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.286.266.846.85-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc0.4770.9541.4311.9082.385SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.951.982.122.11-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31421401311321. (CXX) g++ options: -O3 -mtune=native -mcpu=native -std=c++11

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc16003200480064008000SE +/- 44.37, N = 3SE +/- 14.69, N = 3SE +/- 34.93, N = 3SE +/- 12.55, N = 37514740769587063-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -mtune=native -mcpu=native -lssl -lcrypto -ldl

t-test1

Threads: 2

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 2GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc246810SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 36.656.996.887.181. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GETGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc400K800K1200K1600K2000KSE +/- 24484.83, N = 3SE +/- 18962.05, N = 12SE +/- 12276.86, N = 3SE +/- 12435.42, N = 316041801618223172134616668541. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOPGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc400K800K1200K1600K2000KSE +/- 21388.79, N = 3SE +/- 5953.84, N = 3SE +/- 12153.33, N = 3SE +/- 18704.28, N = 1016859351714327180413217299361. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.228.458.738.66-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: CompressionGCC 8.2.0GCC 9.0.151015202521221. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Parallel BZIP2 Compression

256MB File Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.12256MB File CompressionGCC 8.2.0GCC 9.0.10.52651.0531.57952.1062.6325SE +/- 0.03, N = 3SE +/- 0.03, N = 52.342.261. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1224364860SE +/- 0.16, N = 3SE +/- 0.57, N = 3SE +/- 0.55, N = 3SE +/- 0.35, N = 353.7552.1552.3451.981. (CC) gcc options: -ldl -lm -lpthread -O3 -ffast-math -mtune=native -mcpu=native -maltivec -mabi=altivec -mvsx -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: ReadGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc10002000300040005000SE +/- 0.37, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 347824897475047501. (CC) gcc options: -lrt

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: DecompressionGCC 8.2.0GCC 9.0.12004006008001000SE +/- 9.00, N = 3SE +/- 0.33, N = 37908131. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: DecompressionGCC 8.2.0GCC 9.0.1163248648070721. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.3Test: Decompression ThroughputGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc20406080100SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3106106109107-lm-lm-lm1. (CC) gcc options: -O3 -mtune=native -mcpu=native

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestGCC 8.2.0GCC 9.0.130K60K90K120K150KSE +/- 2707.02, N = 4SE +/- 2561.27, N = 31578701536921. (CXX) g++ options: -pipe -lpthread

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 320.0019.7019.5319.501. (CXX) g++ options: -O3 -mtune=native -mcpu=native -std=c++11

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Brotli 0 - Process: DecompressionGCC 8.2.0GCC 9.0.170140210280350SE +/- 0.33, N = 32922991. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: CompressionGCC 8.2.0GCC 9.0.160120180240300SE +/- 1.00, N = 32852801. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc5K10K15K20K25KSE +/- 24.13, N = 3SE +/- 41.67, N = 3SE +/- 125.28, N = 3SE +/- 50.11, N = 3211102100821209214441. (CC) gcc options: -shared -fPIC -pthread -O3 -mtune=native -mcpu=native

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Brotli 0 - Process: CompressionGCC 8.2.0GCC 9.0.1701402102803503223171. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Xsbench

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06GCC 8.2.0GCC 9.0.11.2M2.4M3.6M4.8M6MSE +/- 58462.89, N = 3SE +/- 62432.57, N = 12556000754862201. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

t-test1

Threads: 1

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc48121620SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 317.7117.8017.9217.831. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc70140210280350SE +/- 0.41, N = 3SE +/- 0.49, N = 3SE +/- 0.93, N = 3SE +/- 0.73, N = 33073083053051. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: DecompressionGCC 8.2.0GCC 9.0.1801602403204003533561. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc20406080100SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 389.1789.2489.7589.491. (CXX) g++ options: -O3 -mtune=native -mcpu=native -std=c++11

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingGCC 8.2.0GCC 9.0.13691215SE +/- 0.09, N = 3SE +/- 0.03, N = 311.3011.251. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic -lpthread -lrt -ldl -lnuma

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveGCC 8.2.0GCC 9.0.1510152025SE +/- 0.05, N = 3SE +/- 0.01, N = 320.3620.351. (CXX) g++ options: -fopenmp -O3 -mtune=native -mcpu=native -O2

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: CompressionGCC 8.2.0GCC 9.0.13060901201501151151. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc3691215SE +/- 0.24, N = 12SE +/- 0.21, N = 3SE +/- 0.16, N = 3SE +/- 0.09, N = 311.3711.5911.0411.331. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -lz -llzma

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc200K400K600K800K1000KSE +/- 20488.27, N = 12SE +/- 17339.39, N = 12SE +/- 18117.64, N = 3SE +/- 13415.01, N = 311177931130592115223210710481. (CC) gcc options: -pthread -lpthread -O3 -mtune=native -mcpu=native

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc140280420560700SE +/- 14.49, N = 12SE +/- 8.80, N = 3SE +/- 7.79, N = 7SE +/- 7.21, N = 126186565946231. (CC) gcc options: -O3 -mtune=native -mcpu=native

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence AlignmentGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1.01482.02963.04444.05925.074SE +/- 0.22, N = 12SE +/- 0.25, N = 12SE +/- 0.37, N = 9SE +/- 0.30, N = 94.513.794.394.241. (CC) gcc options: -std=c99 -O3 -lm -lpthread


Phoronix Test Suite v10.8.5