EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11

patch testing by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2102055-HA-EPYC7F52L26&grr&sor.

EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionOndemandSchedutilPerformanceAMD EPYC 7F52 16-Core @ 3.91GHz (16 Cores / 32 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse8 x 8192 MB DDR4-3200MT/s HMA81GR7CJR8N-XN280GB INTEL SSDPE21D280GAASPEEDVE228Ubuntu 20.045.11.0-rc6-phx (x86_64) 20210203GNOME Shell 3.36.1X Server 1.20.7aspeedGCC 9.3.0ext41920x10801024x768OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Ondemand: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034- Schedutil: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301034- Performance: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x8301034Python Details- Python 2.7.18rc1 + Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC Ondemand vs. Performance vs. Schedutil Linux 5.11blender: Barbershop - CPU-Onlyopenvkl: vklBenchmarkospray: San Miguel - Path Tracerrodinia: OpenMP LavaMDdav1d: Chimera 1080p 10-bitrodinia: OpenMP HotSpot3Drodinia: OpenMP Leukocytebuild-gdb: Time To Compileblender: BMW27 - CPU-Onlybuild-godot: Time To Compiletensorflow-lite: Inception V4compress-rar: Linux Source Tree Archiving To RARtensorflow-lite: Inception ResNet V2luxcorerender: DLSCluxcorerender: Rainbow Colors and Prismtensorflow-lite: SqueezeNettensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Quanttensorflow-lite: Mobilenet Floatjohn-the-ripper: MD5graphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Noise-Gaussiangraphics-magick: Rotategraphics-magick: Resizinggraphics-magick: Swirlgraphics-magick: HWB Color Spacehugin: Panorama Photo Assistant + Stitching Timeclomp: Static OMP Speedupospray: San Miguel - SciVisospray: NASA Streamlines - Path Tracerjohn-the-ripper: Blowfishx265: Bosphorus 4Kpennant: sedovbigkvazaar: Bosphorus 4K - Very Fastredis: SETocrmypdf: Processing 60 Page PDF Documentdav1d: Chimera 1080psvt-av1: Enc Mode 4 - 1080ppennant: leblancbigredis: GETdav1d: Summer Nature 4Koidn: Memorialrodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solverkvazaar: Bosphorus 4K - Ultra Fastx265: Bosphorus 1080psvt-av1: Enc Mode 8 - 1080pospray: NASA Streamlines - SciViskvazaar: Bosphorus 1080p - Very Fastdav1d: Summer Nature 1080pkvazaar: Bosphorus 1080p - Ultra Fastx264: H.264 Video EncodingOndemandSchedutilPerformance357.572191.86126.389119.1390.04289.26995.49283.6684.472150441374.98313514403.153.4810726012687870655.169039.317196672363764216381582922127150.65449.724.397.092629821.6525.6107425.351308179.8319.546625.435.24416.029671545026.54254.2914.1913.76913.24346.8665.4741.02333.3387.16592.34155.97172.64357.492171.87125.97792.2191.31090.43189.60683.5783.313150309376.92113506533.193.5710729412598170760.669012.917210002363764216391587923112150.46450.324.397.112631021.0525.5967026.271313860.0319.483536.655.37915.993471599356.29246.8614.2613.75413.24747.6355.0743.04033.3386.89488.95153.01172.60356.902231.87125.881131.7492.08789.70983.70983.7383.164150329767.67613506673.183.5110739012620370739.168976.917213332373774206291609928123950.40850.724.397.112632321.5025.5425726.391318380.4219.281713.185.43115.909801604323.00264.9814.2313.70513.33147.8068.4842.82333.3392.22668.50161.97176.54OpenBenchmarking.org

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: Barbershop - Compute: CPU-OnlyPerformanceSchedutilOndemand80160240320400SE +/- 0.68, N = 3SE +/- 0.45, N = 3SE +/- 0.51, N = 3356.90357.49357.57

OpenVKL

Benchmark: vklBenchmark

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 0.9Benchmark: vklBenchmarkPerformanceOndemandSchedutil50100150200250SE +/- 0.58, N = 3SE +/- 0.88, N = 3223219217MIN: 1 / MAX: 785MIN: 1 / MAX: 776MIN: 1 / MAX: 786

OSPray

Demo: San Miguel - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: Path TracerPerformanceSchedutilOndemand0.42080.84161.26241.68322.104SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.871.871.86MIN: 1.86 / MAX: 1.88MIN: 1.85 / MAX: 1.88MIN: 1.85 / MAX: 1.88

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDPerformanceSchedutilOndemand306090120150SE +/- 0.28, N = 3SE +/- 0.27, N = 3SE +/- 0.25, N = 3125.88125.98126.391. (CXX) g++ options: -O2 -lOpenCL

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitPerformanceOndemandSchedutil306090120150SE +/- 0.09, N = 3SE +/- 0.19, N = 3SE +/- 0.08, N = 3131.74119.1392.21MIN: 85.18 / MAX: 277.62MIN: 78.3 / MAX: 259.88MIN: 59.88 / MAX: 204.441. (CC) gcc options: -pthread

Rodinia

Test: OpenMP HotSpot3D

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3DOndemandSchedutilPerformance20406080100SE +/- 0.08, N = 3SE +/- 0.24, N = 3SE +/- 0.46, N = 390.0491.3192.091. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteOndemandPerformanceSchedutil20406080100SE +/- 0.58, N = 3SE +/- 0.29, N = 3SE +/- 0.33, N = 389.2789.7190.431. (CXX) g++ options: -O2 -lOpenCL

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompilePerformanceSchedutilOndemand20406080100SE +/- 0.08, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 383.7189.6195.49

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.90Blend File: BMW27 - Compute: CPU-OnlySchedutilOndemandPerformance20406080100SE +/- 0.25, N = 3SE +/- 0.38, N = 3SE +/- 0.50, N = 383.5783.6683.73

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To CompilePerformanceSchedutilOndemand20406080100SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 383.1683.3184.47

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4SchedutilPerformanceOndemand300K600K900K1200K1500KSE +/- 262.95, N = 3SE +/- 236.24, N = 3SE +/- 385.24, N = 3150309315032971504413

RAR Compression

Linux Source Tree Archiving To RAR

OpenBenchmarking.orgSeconds, Fewer Is BetterRAR Compression 5.6.1Linux Source Tree Archiving To RARPerformanceOndemandSchedutil20406080100SE +/- 0.24, N = 3SE +/- 0.18, N = 3SE +/- 0.41, N = 367.6874.9876.92

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2SchedutilPerformanceOndemand300K600K900K1200K1500KSE +/- 394.05, N = 3SE +/- 483.75, N = 3SE +/- 365.29, N = 3135065313506671351440

LuxCoreRender

Scene: DLSC

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: DLSCSchedutilPerformanceOndemand0.71781.43562.15342.87123.589SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 33.193.183.15MIN: 3.04 / MAX: 3.41MIN: 3.05 / MAX: 3.39MIN: 3.06 / MAX: 3.29

LuxCoreRender

Scene: Rainbow Colors and Prism

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.3Scene: Rainbow Colors and PrismSchedutilPerformanceOndemand0.80331.60662.40993.21324.0165SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.573.513.48MIN: 3.48 / MAX: 3.6MIN: 3.48 / MAX: 3.55MIN: 3.44 / MAX: 3.5

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetOndemandSchedutilPerformance20K40K60K80K100KSE +/- 84.90, N = 3SE +/- 80.49, N = 3SE +/- 43.47, N = 3107260107294107390

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileSchedutilPerformanceOndemand30K60K90K120K150KSE +/- 40.55, N = 3SE +/- 287.23, N = 3SE +/- 136.25, N = 3125981126203126878

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantOndemandPerformanceSchedutil15K30K45K60K75KSE +/- 45.84, N = 3SE +/- 30.62, N = 3SE +/- 30.64, N = 370655.170739.170760.6

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatPerformanceSchedutilOndemand15K30K45K60K75KSE +/- 36.63, N = 3SE +/- 21.41, N = 3SE +/- 42.21, N = 368976.969012.969039.3

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5PerformanceSchedutilOndemand400K800K1200K1600K2000KSE +/- 2962.73, N = 3SE +/- 3000.00, N = 3SE +/- 3179.80, N = 31721333172100017196671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenPerformanceSchedutilOndemand501001502002502372362361. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedPerformanceSchedutilOndemand80160240320400SE +/- 0.33, N = 33773763761. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianSchedutilOndemandPerformance90180270360450SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 34214214201. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateSchedutilOndemandPerformance140280420560700SE +/- 4.18, N = 3SE +/- 4.70, N = 3SE +/- 4.18, N = 36396386291. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingPerformanceSchedutilOndemand30060090012001500SE +/- 1.20, N = 3SE +/- 18.52, N = 3SE +/- 5.49, N = 31609158715821. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlPerformanceSchedutilOndemand2004006008001000SE +/- 0.88, N = 39289239221. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceOndemandPerformanceSchedutil30060090012001500SE +/- 2.52, N = 3SE +/- 3.67, N = 3SE +/- 2.40, N = 31271123911211. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Hugin

Panorama Photo Assistant + Stitching Time

OpenBenchmarking.orgSeconds, Fewer Is BetterHuginPanorama Photo Assistant + Stitching TimePerformanceSchedutilOndemand1122334455SE +/- 0.29, N = 3SE +/- 0.17, N = 3SE +/- 0.33, N = 350.4150.4650.65

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupPerformanceSchedutilOndemand1122334455SE +/- 0.43, N = 3SE +/- 0.33, N = 3SE +/- 0.25, N = 350.750.349.71. (CC) gcc options: -fopenmp -O3 -lm

OSPray

Demo: San Miguel - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: San Miguel - Renderer: SciVisPerformanceSchedutilOndemand612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 324.3924.3924.39MIN: 23.26 / MAX: 26.32MIN: 23.26 / MAX: 26.32MIN: 23.26 / MAX: 26.32

OSPray

Demo: NASA Streamlines - Renderer: Path Tracer

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: Path TracerPerformanceSchedutilOndemand246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 37.117.117.09MIN: 6.99 / MAX: 7.25MIN: 6.99 / MAX: 7.19MIN: 6.99 / MAX: 7.19

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishPerformanceSchedutilOndemand6K12K18K24K30KSE +/- 23.97, N = 3SE +/- 22.45, N = 3SE +/- 22.67, N = 32632326310262981. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KOndemandPerformanceSchedutil510152025SE +/- 0.12, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 321.6521.5021.051. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigPerformanceSchedutilOndemand612182430SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 325.5425.6025.611. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastPerformanceSchedutilOndemand612182430SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 326.3926.2725.351. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETPerformanceSchedutilOndemand300K600K900K1200K1500KSE +/- 17244.90, N = 3SE +/- 16949.35, N = 4SE +/- 3347.62, N = 31318380.421313860.031308179.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OCRMyPDF

Processing 60 Page PDF Document

OpenBenchmarking.orgSeconds, Fewer Is BetterOCRMyPDF 9.6.0+dfsgProcessing 60 Page PDF DocumentPerformanceSchedutilOndemand510152025SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 319.2819.4819.55

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080pPerformanceOndemandSchedutil150300450600750SE +/- 1.61, N = 3SE +/- 1.15, N = 3SE +/- 0.42, N = 3713.18625.43536.65MIN: 554.58 / MAX: 886.76MIN: 491.48 / MAX: 770.18MIN: 417.73 / MAX: 665.431. (CC) gcc options: -pthread

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pPerformanceSchedutilOndemand1.2222.4443.6664.8886.11SE +/- 0.023, N = 3SE +/- 0.025, N = 3SE +/- 0.049, N = 35.4315.3795.2441. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigPerformanceSchedutilOndemand48121620SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 315.9115.9916.031. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETPerformanceSchedutilOndemand300K600K900K1200K1500KSE +/- 13020.90, N = 3SE +/- 26190.66, N = 3SE +/- 12901.14, N = 31604323.001599356.291545026.541. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KPerformanceOndemandSchedutil60120180240300SE +/- 0.38, N = 3SE +/- 0.06, N = 3SE +/- 0.18, N = 3264.98254.29246.86MIN: 222.23 / MAX: 302.43MIN: 211.06 / MAX: 289.79MIN: 205.59 / MAX: 285.261. (CC) gcc options: -pthread

Intel Open Image Denoise

Scene: Memorial

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.2.0Scene: MemorialSchedutilPerformanceOndemand48121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 314.2614.2314.19

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterPerformanceSchedutilOndemand48121620SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 313.7113.7513.771. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD SolverOndemandSchedutilPerformance3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 313.2413.2513.331. (CXX) g++ options: -O2 -lOpenCL

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastPerformanceSchedutilOndemand1122334455SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.14, N = 347.8047.6346.861. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pPerformanceOndemandSchedutil1530456075SE +/- 0.12, N = 3SE +/- 0.15, N = 3SE +/- 0.34, N = 368.4865.4755.071. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pSchedutilPerformanceOndemand1020304050SE +/- 0.22, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 343.0442.8241.021. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OSPray

Demo: NASA Streamlines - Renderer: SciVis

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: NASA Streamlines - Renderer: SciVisPerformanceSchedutilOndemand816243240SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 333.3333.3333.33MIN: 31.25 / MAX: 34.48MIN: 31.25MIN: 31.25 / MAX: 34.48

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastPerformanceOndemandSchedutil20406080100SE +/- 0.08, N = 3SE +/- 0.29, N = 3SE +/- 0.21, N = 392.2287.1686.891. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080pPerformanceOndemandSchedutil140280420560700SE +/- 0.74, N = 3SE +/- 2.45, N = 3SE +/- 1.06, N = 3668.50592.34488.95MIN: 502.68 / MAX: 730.8MIN: 443.85 / MAX: 644.65MIN: 390.53 / MAX: 532.251. (CC) gcc options: -pthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastPerformanceOndemandSchedutil4080120160200SE +/- 0.32, N = 3SE +/- 0.36, N = 3SE +/- 0.28, N = 3161.97155.97153.011. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingPerformanceOndemandSchedutil4080120160200SE +/- 1.40, N = 3SE +/- 1.12, N = 3SE +/- 1.39, N = 3176.54172.64172.601. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize


Phoronix Test Suite v10.8.5