aug11 AMD Ryzen Threadripper 3990X 64-Core testing with a Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS) and AMD Radeon RX 5700 8GB on Pop 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2408115-PTS-AUG1122881&rdt&grs .
aug11 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution a b c d e AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads) Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS) AMD Starship/Matisse 4 x 32GB DDR4-3000MT/s CMK64GX4M2D3000C16 Samsung SSD 970 EVO Plus 500GB AMD Radeon RX 5700 8GB AMD Navi 10 HDMI Audio DELL P2415Q Intel I211 + Intel Wi-Fi 6 AX200 Pop 22.04 6.8.0-76060800daily20240311-generic (x86_64) GNOME Shell 42.5 X Server 1.21.1.4 4.6 Mesa 24.0.3-1pop1~1711635559~22.04~7a9f319 (LLVM 15.0.7 DRM 3.57) 1.3.274 GCC 11.4.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x830107a Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
aug11 xnnpack: FP16MobileNetV3Large xnnpack: FP16MobileNetV2 xnnpack: FP16MobileNetV3Small lczero: Eigen simdjson: TopTweet mnn: squeezenetv1.1 mnn: resnet-v2-50 xnnpack: FP32MobileNetV3Small xnnpack: FP32MobileNetV3Large mnn: inception-v3 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: mobilenetV3 build2: Time To Compile xnnpack: FP32MobileNetV2 simdjson: DistinctUserID simdjson: LargeRand simdjson: PartialTweets mnn: SqueezeNetV1.0 simdjson: Kostya mnn: nasnet y-cruncher: 500M y-cruncher: 1B xnnpack: QU8MobileNetV3Small xnnpack: QU8MobileNetV3Large xnnpack: QU8MobileNetV2 a b c d e 7166 6029 5205 63 4.33 3.923 33.188 5141 9402 32.083 4.617 4.093 2.425 81.745 7134 4.17 0.96 4.19 5.654 2.76 15.453 9.412 19.026 5432 8184 5137 7170 5977 5031 58 4.12 3.697 33.218 5218 9429 32.66 4.623 4.118 2.447 81.407 7140 4.19 0.95 4.23 5.668 2.77 15.536 9.434 19.04 5176 8010 5061 9308 7573 6398 59 4.32 3.8 32.518 5239 9560 32.127 4.714 4.007 2.411 82.054 7167 4.2 0.96 4.21 5.615 2.78 15.469 9.39 19.025 5345 8150 5124 7078 5849 5088 60 4.37 3.753 31.781 5015 9222 31.507 4.548 3.983 2.395 82.366 7056 4.18 0.95 4.21 5.639 2.76 15.553 9.407 18.989 5204 8142 5136 7138 5928 5113 59 4.38 3.806 32.955 5048 9254 31.979 4.635 4.020 2.421 82.968 7035 4.22 0.96 4.20 5.668 2.77 15.529 9.408 18.992 5818 9202 5818 OpenBenchmarking.org
XNNPACK Model: FP16MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Large a b c d e 2K 4K 6K 8K 10K SE +/- 44.77, N = 3 7166 7170 9308 7078 7138 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV2 a b c d e 1600 3200 4800 6400 8000 SE +/- 40.35, N = 3 6029 5977 7573 5849 5928 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Small a b c d e 1400 2800 4200 5600 7000 SE +/- 51.00, N = 3 5205 5031 6398 5088 5113 1. (CXX) g++ options: -O3 -lrt -lm
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: Eigen a b c d e 14 28 42 56 70 SE +/- 0.56, N = 6 63 58 59 60 59 1. (CXX) g++ options: -flto -pthread
simdjson Throughput Test: TopTweet OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: TopTweet a b c d e 0.9855 1.971 2.9565 3.942 4.9275 SE +/- 0.01, N = 3 4.33 4.12 4.32 4.37 4.38 1. (CXX) g++ options: -O3 -lrt
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: squeezenetv1.1 a b c d e 0.8827 1.7654 2.6481 3.5308 4.4135 SE +/- 0.014, N = 3 3.923 3.697 3.800 3.753 3.806 MIN: 3.78 / MAX: 4.37 MIN: 3.64 / MAX: 3.85 MIN: 3.7 / MAX: 4.39 MIN: 3.69 / MAX: 4.2 MIN: 3.67 / MAX: 4.73 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: resnet-v2-50 a b c d e 8 16 24 32 40 SE +/- 0.42, N = 3 33.19 33.22 32.52 31.78 32.96 MIN: 32.48 / MAX: 36.08 MIN: 32.82 / MAX: 35.72 MIN: 32.12 / MAX: 35.1 MIN: 31.33 / MAX: 33.59 MIN: 31.79 / MAX: 36.41 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
XNNPACK Model: FP32MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Small a b c d e 1100 2200 3300 4400 5500 SE +/- 33.35, N = 3 5141 5218 5239 5015 5048 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP32MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Large a b c d e 2K 4K 6K 8K 10K SE +/- 23.99, N = 3 9402 9429 9560 9222 9254 1. (CXX) g++ options: -O3 -lrt -lm
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: inception-v3 a b c d e 8 16 24 32 40 SE +/- 0.55, N = 3 32.08 32.66 32.13 31.51 31.98 MIN: 30.96 / MAX: 34.11 MIN: 31.62 / MAX: 39.68 MIN: 31.17 / MAX: 33.89 MIN: 30.22 / MAX: 35.17 MIN: 29.68 / MAX: 36.53 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: MobileNetV2_224 a b c d e 1.0607 2.1214 3.1821 4.2428 5.3035 SE +/- 0.036, N = 3 4.617 4.623 4.714 4.548 4.635 MIN: 4.5 / MAX: 4.8 MIN: 4.43 / MAX: 4.94 MIN: 4.52 / MAX: 5.18 MIN: 4.41 / MAX: 5.32 MIN: 4.42 / MAX: 5.41 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: mobilenet-v1-1.0 a b c d e 0.9266 1.8532 2.7798 3.7064 4.633 SE +/- 0.057, N = 3 4.093 4.118 4.007 3.983 4.020 MIN: 3.91 / MAX: 4.32 MIN: 3.92 / MAX: 4.32 MIN: 3.78 / MAX: 4.7 MIN: 3.81 / MAX: 4.19 MIN: 3.76 / MAX: 4.9 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: mobilenetV3 a b c d e 0.5506 1.1012 1.6518 2.2024 2.753 SE +/- 0.017, N = 3 2.425 2.447 2.411 2.395 2.421 MIN: 2.31 / MAX: 2.71 MIN: 2.35 / MAX: 2.91 MIN: 2.26 / MAX: 2.9 MIN: 2.3 / MAX: 3.02 MIN: 2.3 / MAX: 2.83 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.17 Time To Compile a b c d e 20 40 60 80 100 SE +/- 0.08, N = 3 81.75 81.41 82.05 82.37 82.97
XNNPACK Model: FP32MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV2 a b c d e 1500 3000 4500 6000 7500 SE +/- 38.18, N = 3 7134 7140 7167 7056 7035 1. (CXX) g++ options: -O3 -lrt -lm
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: DistinctUserID a b c d e 0.9495 1.899 2.8485 3.798 4.7475 SE +/- 0.00, N = 3 4.17 4.19 4.20 4.18 4.22 1. (CXX) g++ options: -O3 -lrt
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: LargeRandom a b c d e 0.216 0.432 0.648 0.864 1.08 SE +/- 0.00, N = 3 0.96 0.95 0.96 0.95 0.96 1. (CXX) g++ options: -O3 -lrt
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: PartialTweets a b c d e 0.9518 1.9036 2.8554 3.8072 4.759 SE +/- 0.01, N = 3 4.19 4.23 4.21 4.21 4.20 1. (CXX) g++ options: -O3 -lrt
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: SqueezeNetV1.0 a b c d e 1.2753 2.5506 3.8259 5.1012 6.3765 SE +/- 0.055, N = 3 5.654 5.668 5.615 5.639 5.668 MIN: 5.51 / MAX: 6.64 MIN: 5.52 / MAX: 5.97 MIN: 5.53 / MAX: 6.11 MIN: 5.55 / MAX: 6.13 MIN: 5.45 / MAX: 6.24 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: Kostya a b c d e 0.6255 1.251 1.8765 2.502 3.1275 SE +/- 0.01, N = 3 2.76 2.77 2.78 2.76 2.77 1. (CXX) g++ options: -O3 -lrt
Mobile Neural Network Model: nasnet OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: nasnet a b c d e 4 8 12 16 20 SE +/- 0.14, N = 3 15.45 15.54 15.47 15.55 15.53 MIN: 15.16 / MAX: 16.59 MIN: 15.18 / MAX: 16.54 MIN: 15.14 / MAX: 17.33 MIN: 15.17 / MAX: 17.07 MIN: 14.88 / MAX: 17.41 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Y-Cruncher Pi Digits To Calculate: 500M OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 500M a b c d e 3 6 9 12 15 SE +/- 0.004, N = 3 9.412 9.434 9.390 9.407 9.408
Y-Cruncher Pi Digits To Calculate: 1B OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 1B a b c d e 5 10 15 20 25 SE +/- 0.03, N = 3 19.03 19.04 19.03 18.99 18.99
XNNPACK Model: QU8MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Small a b c d e 1200 2400 3600 4800 6000 SE +/- 290.26, N = 3 5432 5176 5345 5204 5818 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QU8MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Large a b c d e 2K 4K 6K 8K 10K SE +/- 485.42, N = 3 8184 8010 8150 8142 9202 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QU8MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV2 a b c d e 1200 2400 3600 4800 6000 SE +/- 346.41, N = 3 5137 5061 5124 5136 5818 1. (CXX) g++ options: -O3 -lrt -lm
Phoronix Test Suite v10.8.5