3700x march AMD Ryzen 7 3700X 8-Core testing with a Gigabyte A320M-S2H-CF (F52a BIOS) and HIS AMD Radeon HD 7750/8740 / R7 250E 1GB on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2204019-NE-3700XMARC38&grr&sor .
3700x march Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution A B C AMD Ryzen 7 3700X 8-Core @ 3.60GHz (8 Cores / 16 Threads) Gigabyte A320M-S2H-CF (F52a BIOS) AMD Starship/Matisse 8GB 240GB TOSHIBA RC100 HIS AMD Radeon HD 7750/8740 / R7 250E 1GB AMD Oland/Hainan/Cape VA2431 Realtek RTL8111/8168/8411 Ubuntu 20.04 5.11.0-38-generic (x86_64) GNOME Shell 3.36.4 X Server 1.20.9 4.5 Mesa 20.0.8 (LLVM 10.0.0) GCC 9.4.0 ext4 1920x1080 4.5 Mesa 21.2.6 (LLVM 12.0.0) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8701021 Java Details - OpenJDK Runtime Environment (build 11.0.14.1+1-Ubuntu-0ubuntu1.20.04) Python Details - Python 3.8.10 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
3700x march build-gem5: Time To Compile java-jmh: Throughput toktx: UASTC 4 + Zstd Compression 19 ospray: particle_volume/pathtracer/real_time ospray: particle_volume/scivis/real_time rodinia: OpenMP LavaMD gcrypt: ospray: particle_volume/ao/real_time tachyon: Total Time ospray-studio: 3 - 1080p - 32 - Path Tracer libgav1: Chimera 1080p 10-bit ospray: gravity_spheres_volume/dim_512/pathtracer/real_time ospray-studio: 3 - 1080p - 1 - Path Tracer ospray-studio: 2 - 1080p - 32 - Path Tracer ospray-studio: 1 - 1080p - 1 - Path Tracer ospray-studio: 1 - 1080p - 32 - Path Tracer ospray-studio: 2 - 1080p - 1 - Path Tracer rodinia: OpenMP Leukocyte ospray: gravity_spheres_volume/dim_512/scivis/real_time ospray: gravity_spheres_volume/dim_512/ao/real_time aom-av1: Speed 6 Two-Pass - Bosphorus 4K build-linux-kernel: defconfig rodinia: OpenMP HotSpot3D astcenc: Exhaustive ospray-studio: 3 - 1080p - 16 - Path Tracer onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU kvazaar: Bosphorus 4K - Medium ospray-studio: 2 - 1080p - 16 - Path Tracer svt-av1: Preset 4 - Bosphorus 4K ospray-studio: 1 - 1080p - 16 - Path Tracer aom-av1: Speed 6 Realtime - Bosphorus 1080p rocksdb: Update Rand rocksdb: Read Rand Write Rand rocksdb: Rand Read rocksdb: Read While Writing libgav1: Summer Nature 4K libgav1: Chimera 1080p aom-av1: Speed 6 Realtime - Bosphorus 4K srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM srsran: OFDM_Test kvazaar: Bosphorus 4K - Very Fast srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM rodinia: OpenMP Streamcluster perf-bench: Epoll Wait perf-bench: Futex Hash perf-bench: Futex Lock-Pi aom-av1: Speed 6 Two-Pass - Bosphorus 1080p etcpak: ETC2 svt-av1: Preset 8 - Bosphorus 4K rodinia: OpenMP CFD Solver svt-av1: Preset 4 - Bosphorus 1080p srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM aom-av1: Speed 8 Realtime - Bosphorus 4K onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU kvazaar: Bosphorus 4K - Ultra Fast srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM toktx: Zstd Compression 19 perf-bench: Memcpy 1MB toktx: UASTC 3 + Zstd Compression 19 aom-av1: Speed 9 Realtime - Bosphorus 4K aom-av1: Speed 10 Realtime - Bosphorus 4K etcpak: ETC1 + Dithering srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM libgav1: Summer Nature 1080p etcpak: ETC1 kvazaar: Bosphorus 1080p - Medium onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU svt-av1: Preset 10 - Bosphorus 4K perf-bench: Sched Pipe astcenc: Thorough svt-av1: Preset 12 - Bosphorus 4K toktx: UASTC 3 onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU draco: Church Facade kvazaar: Bosphorus 1080p - Very Fast openjpeg: NASA Curiosity Panorama M34 perf-bench: Memset 1MB svt-av1: Preset 8 - Bosphorus 1080p draco: Lion onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU aom-av1: Speed 8 Realtime - Bosphorus 1080p astcenc: Medium aom-av1: Speed 9 Realtime - Bosphorus 1080p aom-av1: Speed 10 Realtime - Bosphorus 1080p perf-bench: Syscall Basic kvazaar: Bosphorus 1080p - Ultra Fast etcpak: DXT1 svt-av1: Preset 10 - Bosphorus 1080p onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU toktx: Zstd Compression 9 svt-av1: Preset 12 - Bosphorus 1080p onednn: IP Shapes 1D - bf16bf16bf16 - CPU A B C 527.795 15971827325.799 340.447 166.469 11.209 240.61 203.974 11.8457 164.5427 148288 57.63 2.02104 4472 128042 3721 125344 3815 124.859 1.25609 1.30538 8.29 96.139 88.953 83.4929 77460 4000.15 3960.3 3978.05 2793.39 2768.09 2780.14 7.52 67031 2.211 65883 8.38 438705 1451343 40413130 1685693 65.55 169.41 13.57 158.1 387.1 125000000 16.92 146.4 354.4 33.878 70515 4824413 1022 23.98 179.383 26.163 22.75 7.203 58.9 105 38.8 6.30423 3.63064 29.02 167.1 386.6 19.569 13.592719 18.338 51.95 54.8 288.656 153.6 358.9 223.9 305.018 37.71 6.67346 2.50697 42.919 344927 9.1383 48.365 12.56 4.95672 1.27343 10.248 2.52331 7531 70.07 62855 66.394612 94.87 4957 23.0464 21.3262 102.81 5.0545 127.73 122.26 20651833 126.81 1329.935 209.889 6.73567 4.61914 2.545 374.172 511.637 16016773575.675 340.135 167.46 11.2413 239.251 204.773 11.8107 164.4666 148470 57.95 2.01811 4461 128064 3715 125761 3810 124.749 1.25248 1.30382 8.31 95.675 87.572 83.4223 77272 3989.49 3969.06 3995.75 2829.23 2807.28 2785.84 7.53 67130 2.233 65777 8.47 441448 1456612 38836394 1694715 65.98 168.9 13.49 157.6 385.7 124200000 16.97 146 354.3 33.731 76990 5141654 1027 24.15 179.572 26.309 23.991 7.28 59.2 105.6 39.2 6.55418 3.64345 28.97 167.8 383.6 19.719 13.921405 18.396 54.5 55.72 285.141 159.4 359.5 224.06 306.517 37.92 5.38239 2.51867 43.585 349100 9.1454 47.78 12.78 4.96916 1.28282 10.4932 2.53097 7609 69.61 62242 69.370794 95.243 4980 23.1473 21.4534 103.63 5.401 123.79 132.23 20221808 127.71 1350.363 208.721 6.75419 4.63708 2.582 370.421 504.2 15985465912.414 340.007 167.683 11.2682 239.986 205.106 11.8137 164.2377 148448 57.56 2.02204 4459 128171 3718 125186 3809 124.244 1.25364 1.3133 8.29 96.405 90.817 83.402 77238 3976.52 3997.84 3977.63 2834.58 2814.1 2842.52 7.52 66999 2.235 65552 8.47 441787 1442100 40469857 1709393 65.69 167.87 13.53 158.2 385.3 124600000 16.97 141 354.9 31.155 78006 5121390 1026 24.18 179.367 26.438 22.699 7.21 59.1 105.4 39.27 6.35941 3.64286 29.03 168.8 387.7 19.411 13.589469 18.333 54.86 52.82 286.024 159 357.9 222.17 302.705 37.78 5.38075 2.51488 43.356 345671 9.1352 49.797 12.765 5.00068 1.26382 10.8503 2.65863 7677 69.96 61979 66.398269 95.449 4947 23.1748 21.7906 102.23 5.1039 124.53 127.1 20599176 126.62 1335.559 211.49 6.76793 4.61019 2.746 368.321 OpenBenchmarking.org
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 21.2 Time To Compile C B A 110 220 330 440 550 504.20 511.64 527.80
Java JMH Throughput OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput B C A 3000M 6000M 9000M 12000M 15000M 16016773575.68 15985465912.41 15971827325.80
KTX-Software toktx Settings: UASTC 4 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 4 + Zstd Compression 19 C B A 70 140 210 280 350 340.01 340.14 340.45
OSPray Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/pathtracer/real_time C B A 40 80 120 160 200 167.68 167.46 166.47
OSPray Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/scivis/real_time C B A 3 6 9 12 15 11.27 11.24 11.21
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD B C A 50 100 150 200 250 239.25 239.99 240.61 1. (CXX) g++ options: -O2 -lOpenCL
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 A B C 50 100 150 200 250 203.97 204.77 205.11 1. (CC) gcc options: -O2 -fvisibility=hidden
OSPray Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: particle_volume/ao/real_time A C B 3 6 9 12 15 11.85 11.81 11.81
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99.2 Total Time C B A 40 80 120 160 200 164.24 164.47 164.54 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer A C B 30K 60K 90K 120K 150K 148288 148448 148470 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 0.17 Video Input: Chimera 1080p 10-bit B A C 13 26 39 52 65 57.95 57.63 57.56 1. (CXX) g++ options: -O3 -lpthread -lrt
OSPray Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time C A B 0.455 0.91 1.365 1.82 2.275 2.02204 2.02104 2.01811
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer C B A 1000 2000 3000 4000 5000 4459 4461 4472 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer A B C 30K 60K 90K 120K 150K 128042 128064 128171 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer B C A 800 1600 2400 3200 4000 3715 3718 3721 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer C A B 30K 60K 90K 120K 150K 125186 125344 125761 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer C B A 800 1600 2400 3200 4000 3809 3810 3815 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte C B A 30 60 90 120 150 124.24 124.75 124.86 1. (CXX) g++ options: -O2 -lOpenCL
OSPray Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time A C B 0.2826 0.5652 0.8478 1.1304 1.413 1.25609 1.25364 1.25248
OSPray Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPray 2.9 Benchmark: gravity_spheres_volume/dim_512/ao/real_time C A B 0.2955 0.591 0.8865 1.182 1.4775 1.31330 1.30538 1.30382
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K B C A 2 4 6 8 10 8.31 8.29 8.29 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.16 Build: defconfig B A C 20 40 60 80 100 95.68 96.14 96.41
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D B A C 20 40 60 80 100 87.57 88.95 90.82 1. (CXX) g++ options: -O2 -lOpenCL
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Exhaustive C B A 20 40 60 80 100 83.40 83.42 83.49 1. (CXX) g++ options: -O3 -flto -pthread
OSPray Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer C B A 17K 34K 51K 68K 85K 77238 77272 77460 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU C B A 900 1800 2700 3600 4500 3976.52 3989.49 4000.15 MIN: 3968.89 MIN: 3980 MIN: 3984.76 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU A B C 900 1800 2700 3600 4500 3960.30 3969.06 3997.84 MIN: 3946.69 MIN: 3963.33 MIN: 3986.46 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU C A B 900 1800 2700 3600 4500 3977.63 3978.05 3995.75 MIN: 3967.61 MIN: 3963.46 MIN: 3983.37 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU A B C 600 1200 1800 2400 3000 2793.39 2829.23 2834.58 MIN: 2780.33 MIN: 2801.68 MIN: 2818.46 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU A B C 600 1200 1800 2400 3000 2768.09 2807.28 2814.10 MIN: 2742.44 MIN: 2785.2 MIN: 2783.96 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU A B C 600 1200 1800 2400 3000 2780.14 2785.84 2842.52 MIN: 2760.3 MIN: 2768.25 MIN: 2827.85 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Medium B C A 2 4 6 8 10 7.53 7.52 7.52 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OSPray Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer C A B 14K 28K 42K 56K 70K 66999 67031 67130 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 4 - Input: Bosphorus 4K C B A 0.5029 1.0058 1.5087 2.0116 2.5145 2.235 2.233 2.211 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
OSPray Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer OpenBenchmarking.org ms, Fewer Is Better OSPray Studio 0.10 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer C B A 14K 28K 42K 56K 70K 65552 65777 65883 1. (CXX) g++ options: -O3 -lm -ldl -lpthread -pthread
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p C B A 2 4 6 8 10 8.47 8.47 8.38 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Facebook RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Update Random C B A 90K 180K 270K 360K 450K 441787 441448 438705 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Read Random Write Random B A C 300K 600K 900K 1200K 1500K 1456612 1451343 1442100 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Random Read C A B 9M 18M 27M 36M 45M 40469857 40413130 38836394 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 7.0.1 Test: Read While Writing C B A 400K 800K 1200K 1600K 2000K 1709393 1694715 1685693 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.17 Video Input: Summer Nature 4K B C A 15 30 45 60 75 65.98 65.69 65.55 1. (CXX) g++ options: -O3 -lpthread -lrt
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.17 Video Input: Chimera 1080p A B C 40 80 120 160 200 169.41 168.90 167.87 1. (CXX) g++ options: -O3 -lpthread -lrt
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K A C B 3 6 9 12 15 13.57 13.53 13.49 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM C A B 30 60 90 120 150 158.2 158.1 157.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM A B C 80 160 240 320 400 387.1 385.7 385.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.10 Test: OFDM_Test A C B 30M 60M 90M 120M 150M 125000000 124600000 124200000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Very Fast C B A 4 8 12 16 20 16.97 16.97 16.92 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM A B C 30 60 90 120 150 146.4 146.0 141.0 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM C A B 80 160 240 320 400 354.9 354.4 354.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster C B A 8 16 24 32 40 31.16 33.73 33.88 1. (CXX) g++ options: -O2 -lOpenCL
perf-bench Benchmark: Epoll Wait OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Epoll Wait C B A 20K 40K 60K 80K 100K 78006 76990 70515 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
perf-bench Benchmark: Futex Hash OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Hash B C A 1.1M 2.2M 3.3M 4.4M 5.5M 5141654 5121390 4824413 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
perf-bench Benchmark: Futex Lock-Pi OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Futex Lock-Pi B C A 200 400 600 800 1000 1027 1026 1022 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p C B A 6 12 18 24 30 24.18 24.15 23.98 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 B A C 40 80 120 160 200 179.57 179.38 179.37 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 8 - Input: Bosphorus 4K C B A 6 12 18 24 30 26.44 26.31 26.16 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver C A B 6 12 18 24 30 22.70 22.75 23.99 1. (CXX) g++ options: -O2 -lOpenCL
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 4 - Input: Bosphorus 1080p B C A 2 4 6 8 10 7.280 7.210 7.203 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.10 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM B C A 13 26 39 52 65 59.2 59.1 58.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.10 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM B C A 20 40 60 80 100 105.6 105.4 105.0 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K C B A 9 18 27 36 45 39.27 39.20 38.80 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU A C B 2 4 6 8 10 6.30423 6.35941 6.55418 MIN: 5.99 MIN: 6.03 MIN: 6.02 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU A C B 0.8198 1.6396 2.4594 3.2792 4.099 3.63064 3.64286 3.64345 MIN: 3.47 MIN: 3.46 MIN: 3.48 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast C A B 7 14 21 28 35 29.03 29.02 28.97 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM C B A 40 80 120 160 200 168.8 167.8 167.1 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM C A B 80 160 240 320 400 387.7 386.6 383.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
KTX-Software toktx Settings: Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 19 C A B 5 10 15 20 25 19.41 19.57 19.72
perf-bench Benchmark: Memcpy 1MB OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memcpy 1MB B A C 4 8 12 16 20 13.92 13.59 13.59 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
KTX-Software toktx Settings: UASTC 3 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 + Zstd Compression 19 C A B 5 10 15 20 25 18.33 18.34 18.40
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K C B A 12 24 36 48 60 54.86 54.50 51.95 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K B A C 13 26 39 52 65 55.72 54.80 52.82 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering A C B 60 120 180 240 300 288.66 286.02 285.14 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM B C A 40 80 120 160 200 159.4 159.0 153.6 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.10 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM B A C 80 160 240 320 400 359.5 358.9 357.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.17 Video Input: Summer Nature 1080p B A C 50 100 150 200 250 224.06 223.90 222.17 1. (CXX) g++ options: -O3 -lpthread -lrt
Etcpak Configuration: ETC1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 B A C 70 140 210 280 350 306.52 305.02 302.71 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 1080p - Video Preset: Medium B C A 9 18 27 36 45 37.92 37.78 37.71 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU C B A 2 4 6 8 10 5.38075 5.38239 6.67346 MIN: 5.25 MIN: 5.25 MIN: 5.27 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU A C B 0.5667 1.1334 1.7001 2.2668 2.8335 2.50697 2.51488 2.51867 MIN: 2.47 MIN: 2.48 MIN: 2.47 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
SVT-AV1 Encoder Mode: Preset 10 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 10 - Input: Bosphorus 4K B C A 10 20 30 40 50 43.59 43.36 42.92 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
perf-bench Benchmark: Sched Pipe OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Sched Pipe B C A 70K 140K 210K 280K 350K 349100 345671 344927 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Thorough C A B 3 6 9 12 15 9.1352 9.1383 9.1454 1. (CXX) g++ options: -O3 -flto -pthread
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 12 - Input: Bosphorus 4K C A B 11 22 33 44 55 49.80 48.37 47.78 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
KTX-Software toktx Settings: UASTC 3 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 A C B 3 6 9 12 15 12.56 12.77 12.78
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU A B C 1.1252 2.2504 3.3756 4.5008 5.626 4.95672 4.96916 5.00068 MIN: 4.84 MIN: 4.87 MIN: 4.92 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU C A B 0.2886 0.5772 0.8658 1.1544 1.443 1.26382 1.27343 1.28282 MIN: 1.19 MIN: 1.19 MIN: 1.21 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU A B C 3 6 9 12 15 10.25 10.49 10.85 MIN: 9.82 MIN: 10.24 MIN: 10.66 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU A B C 0.5982 1.1964 1.7946 2.3928 2.991 2.52331 2.53097 2.65863 MIN: 2.47 MIN: 2.48 MIN: 2.61 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.0 Model: Church Facade A B C 1600 3200 4800 6400 8000 7531 7609 7677 1. (CXX) g++ options: -O3
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 1080p - Video Preset: Very Fast A C B 16 32 48 64 80 70.07 69.96 69.61 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenJPEG Encode: NASA Curiosity Panorama M34 OpenBenchmarking.org ms, Fewer Is Better OpenJPEG 2.4 Encode: NASA Curiosity Panorama M34 C B A 13K 26K 39K 52K 65K 61979 62242 62855 1. (CXX) g++ options: -rdynamic
perf-bench Benchmark: Memset 1MB OpenBenchmarking.org GB/sec, More Is Better perf-bench Benchmark: Memset 1MB B C A 15 30 45 60 75 69.37 66.40 66.39 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 8 - Input: Bosphorus 1080p C B A 20 40 60 80 100 95.45 95.24 94.87 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.5.0 Model: Lion C A B 1100 2200 3300 4400 5500 4947 4957 4980 1. (CXX) g++ options: -O3
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU A B C 6 12 18 24 30 23.05 23.15 23.17 MIN: 22.76 MIN: 22.87 MIN: 23 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU A B C 5 10 15 20 25 21.33 21.45 21.79 MIN: 21.17 MIN: 21.31 MIN: 21.31 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p B A C 20 40 60 80 100 103.63 102.81 102.23 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.2 Preset: Medium A C B 1.2152 2.4304 3.6456 4.8608 6.076 5.0545 5.1039 5.4010 1. (CXX) g++ options: -O3 -flto -pthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p A C B 30 60 90 120 150 127.73 124.53 123.79 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.3 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p B C A 30 60 90 120 150 132.23 127.10 122.26 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
perf-bench Benchmark: Syscall Basic OpenBenchmarking.org ops/sec, More Is Better perf-bench Benchmark: Syscall Basic A C B 4M 8M 12M 16M 20M 20651833 20599176 20221808 1. (CC) gcc options: -O6 -ggdb3 -funwind-tables -std=gnu99 -lunwind-x86_64 -lunwind -llzma -Xlinker -lpthread -lrt -lm -ldl -lelf -lcrypto -lslang -lz -lnuma
Kvazaar Video Input: Bosphorus 1080p - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast B A C 30 60 90 120 150 127.71 126.81 126.62 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 B C A 300 600 900 1200 1500 1350.36 1335.56 1329.94 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
SVT-AV1 Encoder Mode: Preset 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 10 - Input: Bosphorus 1080p C A B 50 100 150 200 250 211.49 209.89 208.72 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU A B C 2 4 6 8 10 6.73567 6.75419 6.76793 MIN: 6.64 MIN: 6.66 MIN: 6.63 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.6 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU C A B 1.0433 2.0866 3.1299 4.1732 5.2165 4.61019 4.61914 4.63708 MIN: 4.45 MIN: 4.43 MIN: 4.48 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -lpthread -ldl
KTX-Software toktx Settings: Zstd Compression 9 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 9 A B C 0.6179 1.2358 1.8537 2.4716 3.0895 2.545 2.582 2.746
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.9 Encoder Mode: Preset 12 - Input: Bosphorus 1080p A B C 80 160 240 320 400 374.17 370.42 368.32 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Phoronix Test Suite v10.8.5