Znver2 GCC9 Compiler Tests

AMD Zen 2 GCC compiler benchmarks on Ubuntu Linux. Tests by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1907117-HV-ZNVER2GCC44&grr&sor.

Znver2 GCC9 Compiler TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver2AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0066 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Baffin [Polaris11] 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.2.0-999-generic (x86_64) 20190703GNOME Shell 3.28.3X Server 1.20.1modesetting 1.20.14.5 Mesa 18.2.2 (LLVM 7.0.0)GCC 9.1.0ext43840x2160OpenBenchmarking.orgEnvironment Details- -O3 -march=x86-64: CXXFLAGS=-O3-march=x86-64 CFLAGS=-O3-march=x86-64- -O3 -march=znver1: CXXFLAGS=-O3-march=znver1 CFLAGS=-O3-march=znver1- -O3 -march=znver2: CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2Compiler Details- --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: acpi-cpufreq ondemandPython Details- Python 2.7.15+ + Python 3.6.8Security Details- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled

Znver2 GCC9 Compiler Testscpp-perf-bench: Math Libraryfftw: Float + SSE - 2D FFT Size 4096pgbench: Buffer Test - Normal Load - Read Writefftw: Stock - 2D FFT Size 4096build-llvm: Time To Compilevpxenc: vpxenc VP9 1080p Video Encodemcperf: Setmkl-dnn: IP Batch 1D - f32mcperf: Getpgbench: Buffer Test - Normal Load - Read Onlystockfish: Total Timegraphics-magick: Sharpengraphics-magick: Rotategraphics-magick: Resizinghimeno: Poisson Pressure Solverbuild-php: Time To Compileredis: SETmkl-dnn: Deconvolution Batch deconv_1d - f32redis: GETc-ray: Total Time - 4K, 16 Rays Per Pixelaobench: 2048 x 2048 - Total Timecompress-7zip: Compress Speed Testjohn-the-ripper: Blowfishscimark2: Compositecompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9mkl-dnn: Convolution Batch conv_alexnet - f32cpp-perf-bench: Function Objectsencode-flac: WAV To FLACx265: H.265 1080p Video Encodingencode-mp3: WAV To MP3fftw: Float + SSE - 1D FFT Size 4096x264: H.264 Video Encodingfftw: Stock - 1D FFT Size 4096bullet: Rayteststscp: AI Chess Performancebullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carlo-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver2312.0130143.807039.43281.90175.9960770.80152.36107814.85385510.95395379301792612701336.7752.892074827.24221.003026826.3243.2036.4978655284012786.3325.222512.0414.907.7553.337.16143.279534.702.0913339262.173.704.083.372145.576959.693762.43297.13766.81311.771940529380.507660.90284.24174.4060810.38159.95112447.77383329.30399087511932602791345.9553.442080013.12219.293126726.2339.4235.1478412282213128.6525.092562.8914.998.1553.156.9851757141.82114482.1313721482.133.733.983.362291.628631.933702.03260.12757.56309.021996030044.328001.67286.56175.3469121.64158.46110755.89382751.07394717261942762851347.9653.412089609.47216.703090850.6939.4634.6478562202323700.6425.212524.9314.568.1252.917.0456652140.96112312.0613216812.053.593.793.232422.1011431.933575.96274.11799.07OpenBenchmarking.org

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math Library-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-6470140210280350SE +/- 1.69, N = 3SE +/- 5.00, N = 3SE +/- 4.58, N = 4309.02311.77312.01-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -std=c++11

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096-O3 -march=znver2-O3 -march=znver14K8K12K16K20KSE +/- 71.31, N = 3SE +/- 83.01, N = 31996019405-march=znver2-march=znver11. (CC) gcc options: -pthread -O3 -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Write-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver16K12K18K24K30KSE +/- 169.86, N = 3SE +/- 197.60, N = 3SE +/- 361.55, N = 1530143.8030044.3229380.50-march=x86-64-march=znver2-march=znver11. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-642K4K6K8K10KSE +/- 28.05, N = 3SE +/- 34.96, N = 3SE +/- 7.75, N = 38001.677660.907039.43-march=znver2-march=znver11. (CC) gcc options: -pthread -O3 -lm

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To Compile-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver260120180240300281.90284.24286.56

VP9 libvpx Encoding

vpxenc VP9 1080p Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.8.0vpxenc VP9 1080p Video Encode-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver14080120160200SE +/- 0.86, N = 3SE +/- 0.55, N = 3SE +/- 0.98, N = 3175.99175.34174.40-march=x86-64-march=znver2-march=znver11. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=c++11

Memcached mcperf

Method: Set

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Set-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-6415K30K45K60K75KSE +/- 4868.10, N = 15SE +/- 1488.06, N = 15SE +/- 807.63, N = 469121.6460810.3860770.80-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -lm -rdynamic

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver14080120160200SE +/- 3.65, N = 15SE +/- 2.44, N = 15SE +/- 2.27, N = 15152.36158.46159.95-march=x86-64 - MIN: 109.43-march=znver2 - MIN: 112.01-march=znver1 - MIN: 108.371. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

Memcached mcperf

Method: Get

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: Get-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-6420K40K60K80K100KSE +/- 1283.75, N = 15SE +/- 1084.23, N = 15SE +/- 879.11, N = 15112447.77110755.89107814.85-march=znver1-march=znver2-march=x86-641. (CC) gcc options: -O3 -lm -rdynamic

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver280K160K240K320K400KSE +/- 837.27, N = 3SE +/- 539.80, N = 3SE +/- 738.26, N = 3385510.95383329.30382751.07-march=x86-64-march=znver1-march=znver21. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time-O3 -march=znver1-O3 -march=x86-64-O3 -march=znver29M18M27M36M45MSE +/- 208989.61, N = 3SE +/- 75524.60, N = 3SE +/- 265193.75, N = 3399087513953793039471726-march=znver1-march=x86-64-march=znver21. (CXX) g++ options: -m64 -lpthread -O3 -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-644080120160200SE +/- 0.58, N = 3194193179-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Rotate-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver160120180240300SE +/- 1.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3276261260-march=znver2-march=x86-64-march=znver11. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Resizing-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-6460120180240300SE +/- 1.53, N = 3285279270-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -fopenmp -O3 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -ldl -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-6430060090012001500SE +/- 12.91, N = 3SE +/- 22.06, N = 3SE +/- 13.46, N = 31347.961345.951336.77-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -mavx2

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To Compile-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver11224364860SE +/- 0.34, N = 3SE +/- 0.20, N = 3SE +/- 0.10, N = 352.8953.4153.44-march=x86-64-march=znver2-march=znver11. (CC) gcc options: -O3 -pedantic -ldl -lz -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SET-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64400K800K1200K1600K2000KSE +/- 26820.33, N = 15SE +/- 26581.97, N = 15SE +/- 28039.49, N = 152089609.472080013.122074827.241. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

MKL-DNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-6450100150200250SE +/- 2.63, N = 6SE +/- 1.89, N = 11SE +/- 2.51, N = 3216.70219.29221.00-march=znver2 - MIN: 202.77-march=znver1 - MIN: 203.58-march=x86-64 - MIN: 202.641. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GET-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-64700K1400K2100K2800K3500KSE +/- 62689.04, N = 13SE +/- 61045.92, N = 12SE +/- 58033.83, N = 153126726.233090850.693026826.321. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-641020304050SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 339.4239.4643.20-march=znver1-march=znver2-march=x86-641. (CC) gcc options: -lm -lpthread -O3

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-64816243240SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.32, N = 334.6435.1436.49-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -lm -O3

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver120K40K60K80K100KSE +/- 419.64, N = 3SE +/- 205.86, N = 3SE +/- 193.52, N = 37865578562784121. (CXX) g++ options: -pipe -lpthread

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: Blowfish-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver26K12K18K24K30KSE +/- 61.75, N = 3SE +/- 51.72, N = 3SE +/- 50.35, N = 32840128221202321. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Composite-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-648001600240032004000SE +/- 7.82, N = 3SE +/- 7.70, N = 3SE +/- 27.49, N = 33700.643128.652786.33-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -lm

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-64612182430SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 325.0925.2125.22-march=znver1-march=znver2-march=x86-641. (CC) gcc options: -pthread -fvisibility=hidden -O3

MKL-DNN

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver15001000150020002500SE +/- 11.23, N = 3SE +/- 12.02, N = 3SE +/- 19.59, N = 32512.042524.932562.89-march=x86-64 - MIN: 2466.12-march=znver2 - MIN: 2478.96-march=znver1 - MIN: 2476.41. (CXX) g++ options: -O3 -std=c++11 -march=native -mtune=native -fPIC -fopenmp -pie -lmklml_intel -ldl

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function Objects-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver148121620SE +/- 0.16, N = 3SE +/- 0.16, N = 3SE +/- 0.01, N = 314.5614.9014.99-march=znver2-march=x86-64-march=znver11. (CXX) g++ options: -O3 -std=c++11

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver1246810SE +/- 0.04, N = 5SE +/- 0.06, N = 5SE +/- 0.04, N = 57.758.128.15-march=x86-64-march=znver2-march=znver11. (CXX) g++ options: -O3 -fvisibility=hidden -logg -lm

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video Encoding-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver21224364860SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 353.3353.1552.91-march=x86-64-march=znver1-march=znver21. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-64246810SE +/- 0.09, N = 4SE +/- 0.12, N = 3SE +/- 0.07, N = 36.987.047.16-march=znver1-march=znver2-march=x86-641. (CC) gcc options: -O3 -lncurses -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096-O3 -march=znver2-O3 -march=znver112K24K36K48K60KSE +/- 647.09, N = 3SE +/- 455.36, N = 35665251757-march=znver2-march=znver11. (CC) gcc options: -pthread -O3 -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video Encoding-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver2306090120150SE +/- 0.64, N = 3SE +/- 0.82, N = 3SE +/- 0.93, N = 3143.27141.82140.96-march=x86-64-march=znver1-march=znver21. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096-O3 -march=znver1-O3 -march=znver2-O3 -march=x86-642K4K6K8K10KSE +/- 53.59, N = 3SE +/- 123.02, N = 3SE +/- 11.52, N = 311448.0011231.009534.70-march=znver1-march=znver21. (CC) gcc options: -pthread -O3 -lm

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Raytests-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver10.47930.95861.43791.91722.3965SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 32.062.092.13-march=znver2-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance-O3 -march=znver1-O3 -march=x86-64-O3 -march=znver2300K600K900K1200K1500KSE +/- 1369.78, N = 5SE +/- 10274.76, N = 5SE +/- 8868.66, N = 5137214813339261321681-march=znver1-march=x86-64-march=znver21. (CC) gcc options: -O3 -march=native

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 Ragdolls-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-640.48830.97661.46491.95322.4415SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 32.052.132.17-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Convex-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver10.83931.67862.51793.35724.1965SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 33.593.703.73-march=znver2-march=x86-64-march=znver11. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 Stack-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-640.9181.8362.7543.6724.59SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 33.793.984.08-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 Fall-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-640.75831.51662.27493.03323.7915SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 33.233.363.37-march=znver2-march=znver1-march=x86-641. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxation-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-645001000150020002500SE +/- 0.28, N = 3SE +/- 0.13, N = 3SE +/- 20.09, N = 32422.102291.622145.57-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorization-O3 -march=znver2-O3 -march=znver1-O3 -march=x86-642K4K6K8K10KSE +/- 10.78, N = 3SE +/- 25.99, N = 3SE +/- 73.09, N = 311431.938631.936959.69-march=znver2-march=znver1-march=x86-641. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiply-O3 -march=x86-64-O3 -march=znver1-O3 -march=znver28001600240032004000SE +/- 48.28, N = 3SE +/- 48.59, N = 3SE +/- 48.59, N = 33762.433702.033575.96-march=x86-64-march=znver1-march=znver21. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transform-O3 -march=x86-64-O3 -march=znver2-O3 -march=znver160120180240300SE +/- 3.16, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 3297.13274.11260.12-march=x86-64-march=znver2-march=znver11. (CC) gcc options: -O3 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlo-O3 -march=znver2-O3 -march=x86-64-O3 -march=znver12004006008001000SE +/- 1.53, N = 3SE +/- 7.83, N = 3SE +/- 0.16, N = 3799.07766.81757.56-march=znver2-march=x86-64-march=znver11. (CC) gcc options: -O3 -lm


Phoronix Test Suite v10.8.5