Znver2 GCC9 Compiler Tests

AMD Zen 2 GCC compiler benchmarks on Ubuntu Linux. Tests by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1907117-HV-ZNVER2GCC44
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
-O3 -march=x86-64
July 10 2019
  1 Hour, 35 Minutes
-O3 -march=znver1
July 07 2019
  2 Hours, 6 Minutes
-O3 -march=znver2
July 07 2019
  1 Hour, 49 Minutes
Invert Behavior (Only Show Selected Data)
  1 Hour, 50 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Znver2 GCC9 Compiler TestsOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0066 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Baffin [Polaris11] 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.2.0-999-generic (x86_64) 20190703GNOME Shell 3.28.3X Server 1.20.1modesetting 1.20.14.5 Mesa 18.2.2 (LLVM 7.0.0)GCC 9.1.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionZnver2 GCC9 Compiler Tests PerformanceSystem Logs- -O3 -march=x86-64: CXXFLAGS=-O3-march=x86-64 CFLAGS=-O3-march=x86-64- -O3 -march=znver1: CXXFLAGS=-O3-march=znver1 CFLAGS=-O3-march=znver1- -O3 -march=znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2- --disable-multilib --enable-checking=release- Scaling Governor: acpi-cpufreq ondemand- Python 2.7.15+ + Python 3.6.8- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled

-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver2Result OverviewPhoronix Test Suite100%110%120%130%140%John The RipperFFTWSciMarkC-RayMemcached mcperfGraphicsMagickAOBenchFLAC Audio EncodingBullet Physics EngineTSCPLAME MP3 EncodingMKL-DNNCppPerformanceBenchmarksRedisTimed LLVM Compilationx264PostgreSQL pgbenchStockfishTimed PHP CompilationVP9 libvpx EncodingHimeno Benchmarkx265XZ Compression7-Zip Compression

Znver2 GCC9 Compiler Testscpp-perf-bench: Math Libraryfftw: Float + SSE - 2D FFT Size 4096pgbench: Buffer Test - Normal Load - Read Writefftw: Stock - 2D FFT Size 4096build-llvm: Time To Compilevpxenc: vpxenc VP9 1080p Video Encodemcperf: Setmkl-dnn: IP Batch 1D - f32mcperf: Getpgbench: Buffer Test - Normal Load - Read Onlystockfish: Total Timegraphics-magick: Sharpengraphics-magick: Rotategraphics-magick: Resizinghimeno: Poisson Pressure Solverbuild-php: Time To Compileredis: SETmkl-dnn: Deconvolution Batch deconv_1d - f32redis: GETc-ray: Total Time - 4K, 16 Rays Per Pixelaobench: 2048 x 2048 - Total Timecompress-7zip: Compress Speed Testjohn-the-ripper: Blowfishscimark2: Compositecompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9mkl-dnn: Convolution Batch conv_alexnet - f32cpp-perf-bench: Function Objectsencode-flac: WAV To FLACx265: H.265 1080p Video Encodingencode-mp3: WAV To MP3fftw: Float + SSE - 1D FFT Size 4096x264: H.264 Video Encodingfftw: Stock - 1D FFT Size 4096bullet: Rayteststscp: AI Chess Performancebullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carlo-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver2312.0130143.807039.43281.90175.9960770.80152.36107814.85385510.95395379301792612701336.7752.892074827.24221.003026826.3243.2036.4978655284012786.3325.222512.0414.907.7553.337.16143.279534.702.0913339262.173.704.083.372145.576959.693762.43297.13766.81311.771940529380.507660.90284.24174.4060810.38159.95112447.77383329.30399087511932602791345.9553.442080013.12219.293126726.2339.4235.1478412282213128.6525.092562.8914.998.1553.156.9851757141.82114482.1313721482.133.733.983.362291.628631.933702.03260.12757.56309.021996030044.328001.67286.56175.3469121.64158.46110755.89382751.07394717261942762851347.9653.412089609.47216.703090850.6939.4634.6478562202323700.6425.212524.9314.568.1252.917.0456652140.96112312.0613216812.053.593.793.232422.1011431.933575.96274.11799.07OpenBenchmarking.org

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math Library-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-6470140210280350SE +/- 1.69, N = 3SE +/- 5.00, N = 3SE +/- 4.58, N = 4309.02311.77312.01-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -std=c++11

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O3 -march=znver2-O3 -march=znver14K8K12K16K20KSE +/- 71.31, N = 3SE +/- 83.01, N = 31996019405-march=znver2-march=znver11. (CC) gcc options: -pthread -O3 -lm

PostgreSQL pgbench

This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver16K12K18K24K30KSE +/- 169.86, N = 3SE +/- 197.60, N = 3SE +/- 361.55, N = 1530143.8030044.3229380.50-march=x86-64-march=znver2-march=znver11. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-642K4K6K8K10KSE +/- 28.05, N = 3SE +/- 34.96, N = 3SE +/- 7.75, N = 38001.677660.907039.43-march=znver2-march=znver11. (CC) gcc options: -pthread -O3 -lm

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To Compile-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver260120180240300281.90284.24286.56

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver14080120160200SE +/- 0.86, N = 3SE +/- 0.55, N = 3SE +/- 0.98, N = 3175.99175.34174.40-march=x86-64-march=znver2-march=znver11. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

Memcached mcperf

This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Set-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-6415K30K45K60K75KSE +/- 4868.10, N = 15SE +/- 1488.06, N = 15SE +/- 807.63, N = 469121.6460810.3860770.80-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -lm -rdynamic

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver14080120160200SE +/- 3.65, N = 15SE +/- 2.44, N = 15SE +/- 2.27, N = 15152.36158.46159.95-march=x86-64 - MIN: 109.43-march=znver2 - MIN: 112.01-march=znver1 - MIN: 108.371. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

Memcached mcperf

This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Get-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-6420K40K60K80K100KSE +/- 1283.75, N = 15SE +/- 1084.23, N = 15SE +/- 879.11, N = 15112447.77110755.89107814.85-march=znver1-march=znver2-march=x86-641. (CC) gcc options: -O3 -lm -rdynamic

PostgreSQL pgbench

This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver280K160K240K320K400KSE +/- 837.27, N = 3SE +/- 539.80, N = 3SE +/- 738.26, N = 3385510.95383329.30382751.07-march=x86-64-march=znver1-march=znver21. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O3 -march=znver1-O3 -march=x86-64-O3 -march=znver29M18M27M36M45MSE +/- 208989.61, N = 3SE +/- 75524.60, N = 3SE +/- 265193.75, N = 3399087513953793039471726-march=znver1-march=x86-64-march=znver21. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-644080120160200SE +/- 0.58, N = 3194193179-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Rotate-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver160120180240300SE +/- 1.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3276261260-march=znver2-march=x86-64-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Resizing-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-6460120180240300SE +/- 1.53, N = 3285279270-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-6430060090012001500SE +/- 12.91, N = 3SE +/- 22.06, N = 3SE +/- 13.46, N = 31347.961345.951336.77-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -mavx2

Timed PHP Compilation

This test times how long it takes to build PHP 5 with the Zend engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver11224364860SE +/- 0.34, N = 3SE +/- 0.20, N = 3SE +/- 0.10, N = 352.8953.4153.44-march=x86-64-march=znver2-march=znver11. (CC) gcc options: -O3 -pedantic -ldl -lz -lm

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64400K800K1200K1600K2000KSE +/- 26820.33, N = 15SE +/- 26581.97, N = 15SE +/- 28039.49, N = 152089609.472080013.122074827.241. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-6450100150200250SE +/- 2.63, N = 6SE +/- 1.89, N = 11SE +/- 2.51, N = 3216.70219.29221.00-march=znver2 - MIN: 202.77-march=znver1 - MIN: 203.58-march=x86-64 - MIN: 202.641. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-64700K1400K2100K2800K3500KSE +/- 62689.04, N = 13SE +/- 61045.92, N = 12SE +/- 58033.83, N = 153126726.233090850.693026826.321. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-641020304050SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 339.4239.4643.20-march=znver1-march=znver2-march=x86-641. (CC) gcc options: -lm -lpthread -O3

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64816243240SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.32, N = 334.6435.1436.49-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -lm -O3

7-Zip Compression

This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver120K40K60K80K100KSE +/- 419.64, N = 3SE +/- 205.86, N = 3SE +/- 193.52, N = 37865578562784121. (CXX) g++ options: -pipe -lpthread

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: Blowfish-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver26K12K18K24K30KSE +/- 61.75, N = 3SE +/- 51.72, N = 3SE +/- 50.35, N = 32840128221202321. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-648001600240032004000SE +/- 7.82, N = 3SE +/- 7.70, N = 3SE +/- 27.49, N = 33700.643128.652786.33-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -lm

XZ Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-64612182430SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 325.0925.2125.22-march=znver1-march=znver2-march=x86-641. (CC) gcc options: -pthread -fvisibility=hidden -O3

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver15001000150020002500SE +/- 11.23, N = 3SE +/- 12.02, N = 3SE +/- 19.59, N = 32512.042524.932562.89-march=x86-64 - MIN: 2466.12-march=znver2 - MIN: 2478.96-march=znver1 - MIN: 2476.41. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function Objects-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver148121620SE +/- 0.16, N = 3SE +/- 0.16, N = 3SE +/- 0.01, N = 314.5614.9014.99-march=znver2-march=x86-64-march=znver11. (CXX) g++ options: -O3 -std=c++11

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver1246810SE +/- 0.04, N = 5SE +/- 0.06, N = 5SE +/- 0.04, N = 57.758.128.15-march=x86-64-march=znver2-march=znver11. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

x265

This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver21224364860SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 353.3353.1552.91-march=x86-64-march=znver1-march=znver21. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-64246810SE +/- 0.09, N = 4SE +/- 0.12, N = 3SE +/- 0.07, N = 36.987.047.16-march=znver1-march=znver2-march=x86-641. (CC) gcc options: -O3 -lncurses -lm

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096-O3 -march=znver2-O3 -march=znver112K24K36K48K60KSE +/- 647.09, N = 3SE +/- 455.36, N = 35665251757-march=znver2-march=znver11. (CC) gcc options: -pthread -O3 -lm

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver2306090120150SE +/- 0.64, N = 3SE +/- 0.82, N = 3SE +/- 0.93, N = 3143.27141.82140.96-march=x86-64-march=znver1-march=znver21. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-642K4K6K8K10KSE +/- 53.59, N = 3SE +/- 123.02, N = 3SE +/- 11.52, N = 311448.0011231.009534.70-march=znver1-march=znver21. (CC) gcc options: -pthread -O3 -lm

Bullet Physics Engine

This is a benchmark of the Bullet Physics Engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver10.47930.95861.43791.91722.3965SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 32.062.092.13-march=znver2-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O3 -march=znver1-O3 -march=x86-64-O3 -march=znver2300K600K900K1200K1500KSE +/- 1369.78, N = 5SE +/- 10274.76, N = 5SE +/- 8868.66, N = 5137214813339261321681-march=znver1-march=x86-64-march=znver21. (CC) gcc options: -O3 -march=native

Bullet Physics Engine

This is a benchmark of the Bullet Physics Engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-640.48830.97661.46491.95322.4415SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 32.052.132.17-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver10.83931.67862.51793.35724.1965SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 33.593.703.73-march=znver2-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-640.9181.8362.7543.6724.59SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 33.793.984.08-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-640.75831.51662.27493.03323.7915SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 33.233.363.37-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-645001000150020002500SE +/- 0.28, N = 3SE +/- 0.13, N = 3SE +/- 20.09, N = 32422.102291.622145.57-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-642K4K6K8K10KSE +/- 10.78, N = 3SE +/- 25.99, N = 3SE +/- 73.09, N = 311431.938631.936959.69-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver28001600240032004000SE +/- 48.28, N = 3SE +/- 48.59, N = 3SE +/- 48.59, N = 33762.433702.033575.96-march=x86-64-march=znver1-march=znver21. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver160120180240300SE +/- 3.16, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 3297.13274.11260.12-march=x86-64-march=znver2-march=znver11. (CC) gcc options: -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver12004006008001000SE +/- 1.53, N = 3SE +/- 7.83, N = 3SE +/- 0.16, N = 3799.07766.81757.56-march=znver2-march=x86-64-march=znver11. (CC) gcc options: -O3 -lm