aug11 AMD Ryzen Threadripper 3990X 64-Core testing with a Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS) and AMD Radeon RX 5700 8GB on Pop 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2408115-PTS-AUG1122881&sor&gru .
aug11 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution a b c d e AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads) Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS) AMD Starship/Matisse 4 x 32GB DDR4-3000MT/s CMK64GX4M2D3000C16 Samsung SSD 970 EVO Plus 500GB AMD Radeon RX 5700 8GB AMD Navi 10 HDMI Audio DELL P2415Q Intel I211 + Intel Wi-Fi 6 AX200 Pop 22.04 6.8.0-76060800daily20240311-generic (x86_64) GNOME Shell 42.5 X Server 1.21.1.4 4.6 Mesa 24.0.3-1pop1~1711635559~22.04~7a9f319 (LLVM 15.0.7 DRM 3.57) 1.3.274 GCC 11.4.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x830107a Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
aug11 simdjson: Kostya simdjson: TopTweet simdjson: LargeRand simdjson: PartialTweets simdjson: DistinctUserID lczero: Eigen mnn: nasnet mnn: mobilenetV3 mnn: squeezenetv1.1 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 build2: Time To Compile y-cruncher: 1B y-cruncher: 500M xnnpack: FP32MobileNetV2 xnnpack: FP32MobileNetV3Large xnnpack: FP32MobileNetV3Small xnnpack: FP16MobileNetV2 xnnpack: FP16MobileNetV3Large xnnpack: FP16MobileNetV3Small xnnpack: QU8MobileNetV2 xnnpack: QU8MobileNetV3Large xnnpack: QU8MobileNetV3Small a b c d e 2.76 4.33 0.96 4.19 4.17 63 15.453 2.425 3.923 33.188 5.654 4.617 4.093 32.083 81.745 19.026 9.412 7134 9402 5141 6029 7166 5205 5137 8184 5432 2.77 4.12 0.95 4.23 4.19 58 15.536 2.447 3.697 33.218 5.668 4.623 4.118 32.66 81.407 19.04 9.434 7140 9429 5218 5977 7170 5031 5061 8010 5176 2.78 4.32 0.96 4.21 4.2 59 15.469 2.411 3.8 32.518 5.615 4.714 4.007 32.127 82.054 19.025 9.39 7167 9560 5239 7573 9308 6398 5124 8150 5345 2.76 4.37 0.95 4.21 4.18 60 15.553 2.395 3.753 31.781 5.639 4.548 3.983 31.507 82.366 18.989 9.407 7056 9222 5015 5849 7078 5088 5136 8142 5204 2.77 4.38 0.96 4.20 4.22 59 15.529 2.421 3.806 32.955 5.668 4.635 4.020 31.979 82.968 18.992 9.408 7035 9254 5048 5928 7138 5113 5818 9202 5818 OpenBenchmarking.org
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: Kostya c e b d a 0.6255 1.251 1.8765 2.502 3.1275 SE +/- 0.01, N = 3 2.78 2.77 2.77 2.76 2.76 1. (CXX) g++ options: -O3 -lrt
simdjson Throughput Test: TopTweet OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: TopTweet e d a c b 0.9855 1.971 2.9565 3.942 4.9275 SE +/- 0.01, N = 3 4.38 4.37 4.33 4.32 4.12 1. (CXX) g++ options: -O3 -lrt
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: LargeRandom e c a d b 0.216 0.432 0.648 0.864 1.08 SE +/- 0.00, N = 3 0.96 0.96 0.96 0.95 0.95 1. (CXX) g++ options: -O3 -lrt
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: PartialTweets b d c e a 0.9518 1.9036 2.8554 3.8072 4.759 SE +/- 0.01, N = 3 4.23 4.21 4.21 4.20 4.19 1. (CXX) g++ options: -O3 -lrt
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: DistinctUserID e c b d a 0.9495 1.899 2.8485 3.798 4.7475 SE +/- 0.00, N = 3 4.22 4.20 4.19 4.18 4.17 1. (CXX) g++ options: -O3 -lrt
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: Eigen a d e c b 14 28 42 56 70 SE +/- 0.56, N = 6 63 60 59 59 58 1. (CXX) g++ options: -flto -pthread
Mobile Neural Network Model: nasnet OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: nasnet a c e b d 4 8 12 16 20 SE +/- 0.14, N = 3 15.45 15.47 15.53 15.54 15.55 MIN: 15.16 / MAX: 16.59 MIN: 15.14 / MAX: 17.33 MIN: 14.88 / MAX: 17.41 MIN: 15.18 / MAX: 16.54 MIN: 15.17 / MAX: 17.07 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: mobilenetV3 d c e a b 0.5506 1.1012 1.6518 2.2024 2.753 SE +/- 0.017, N = 3 2.395 2.411 2.421 2.425 2.447 MIN: 2.3 / MAX: 3.02 MIN: 2.26 / MAX: 2.9 MIN: 2.3 / MAX: 2.83 MIN: 2.31 / MAX: 2.71 MIN: 2.35 / MAX: 2.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: squeezenetv1.1 b d c e a 0.8827 1.7654 2.6481 3.5308 4.4135 SE +/- 0.014, N = 3 3.697 3.753 3.800 3.806 3.923 MIN: 3.64 / MAX: 3.85 MIN: 3.69 / MAX: 4.2 MIN: 3.7 / MAX: 4.39 MIN: 3.67 / MAX: 4.73 MIN: 3.78 / MAX: 4.37 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: resnet-v2-50 d c e a b 8 16 24 32 40 SE +/- 0.42, N = 3 31.78 32.52 32.96 33.19 33.22 MIN: 31.33 / MAX: 33.59 MIN: 32.12 / MAX: 35.1 MIN: 31.79 / MAX: 36.41 MIN: 32.48 / MAX: 36.08 MIN: 32.82 / MAX: 35.72 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: SqueezeNetV1.0 c d a b e 1.2753 2.5506 3.8259 5.1012 6.3765 SE +/- 0.055, N = 3 5.615 5.639 5.654 5.668 5.668 MIN: 5.53 / MAX: 6.11 MIN: 5.55 / MAX: 6.13 MIN: 5.51 / MAX: 6.64 MIN: 5.52 / MAX: 5.97 MIN: 5.45 / MAX: 6.24 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: MobileNetV2_224 d a b e c 1.0607 2.1214 3.1821 4.2428 5.3035 SE +/- 0.036, N = 3 4.548 4.617 4.623 4.635 4.714 MIN: 4.41 / MAX: 5.32 MIN: 4.5 / MAX: 4.8 MIN: 4.43 / MAX: 4.94 MIN: 4.42 / MAX: 5.41 MIN: 4.52 / MAX: 5.18 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: mobilenet-v1-1.0 d c e a b 0.9266 1.8532 2.7798 3.7064 4.633 SE +/- 0.057, N = 3 3.983 4.007 4.020 4.093 4.118 MIN: 3.81 / MAX: 4.19 MIN: 3.78 / MAX: 4.7 MIN: 3.76 / MAX: 4.9 MIN: 3.91 / MAX: 4.32 MIN: 3.92 / MAX: 4.32 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: inception-v3 d e a c b 8 16 24 32 40 SE +/- 0.55, N = 3 31.51 31.98 32.08 32.13 32.66 MIN: 30.22 / MAX: 35.17 MIN: 29.68 / MAX: 36.53 MIN: 30.96 / MAX: 34.11 MIN: 31.17 / MAX: 33.89 MIN: 31.62 / MAX: 39.68 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.17 Time To Compile b a c d e 20 40 60 80 100 SE +/- 0.08, N = 3 81.41 81.75 82.05 82.37 82.97
Y-Cruncher Pi Digits To Calculate: 1B OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 1B d e c a b 5 10 15 20 25 SE +/- 0.03, N = 3 18.99 18.99 19.03 19.03 19.04
Y-Cruncher Pi Digits To Calculate: 500M OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 500M c d e a b 3 6 9 12 15 SE +/- 0.004, N = 3 9.390 9.407 9.408 9.412 9.434
XNNPACK Model: FP32MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV2 e d a b c 1500 3000 4500 6000 7500 SE +/- 38.18, N = 3 7035 7056 7134 7140 7167 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP32MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Large d e a b c 2K 4K 6K 8K 10K SE +/- 23.99, N = 3 9222 9254 9402 9429 9560 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP32MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Small d e a b c 1100 2200 3300 4400 5500 SE +/- 33.35, N = 3 5015 5048 5141 5218 5239 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV2 d e b a c 1600 3200 4800 6400 8000 SE +/- 40.35, N = 3 5849 5928 5977 6029 7573 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Large d e a b c 2K 4K 6K 8K 10K SE +/- 44.77, N = 3 7078 7138 7166 7170 9308 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Small b d e a c 1400 2800 4200 5600 7000 SE +/- 51.00, N = 3 5031 5088 5113 5205 6398 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QU8MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV2 b c d a e 1200 2400 3600 4800 6000 SE +/- 346.41, N = 3 5061 5124 5136 5137 5818 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QU8MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Large b d c a e 2K 4K 6K 8K 10K SE +/- 485.42, N = 3 8010 8142 8150 8184 9202 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QU8MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Small b d c a e 1200 2400 3600 4800 6000 SE +/- 290.26, N = 3 5176 5204 5345 5432 5818 1. (CXX) g++ options: -O3 -lrt -lm
Phoronix Test Suite v10.8.5