new rn tr

AMD Ryzen Threadripper 7980X 64-Cores testing with a System76 Thelio Major (FA Z5 BIOS) and AMD Radeon RX 6700 XT 12GB on Ubuntu 24.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2410161-PTS-NEWRNTR258&sor&grr.

new rn trProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionabcdAMD Ryzen Threadripper 7980X 64-Cores @ 5.37GHz (64 Cores / 128 Threads)System76 Thelio Major (FA Z5 BIOS)AMD Device 14a44 x 32GB DDR5-4800MT/s Micron MTC20F1045S1RC48BA21000GB CT1000T700SSD5AMD Radeon RX 6700 XT 12GBAMD Device 14ccDELL P2415QAquantia AQC113C NBase-T/IEEE + Realtek RTL8125 2.5GbE + Intel Wi-Fi 6EUbuntu 24.106.11.0-8-generic (x86_64)GNOME Shell 47.0X Server + Wayland4.6 Mesa 24.2.3-1ubuntu1 (LLVM 19.1.0 DRM 3.58)GCC 14.2.0ext41920x1200OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp powersave (Boost: Enabled EPP: balance_performance) - CPU Microcode: 0xa108105 Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

new rn trxnnpack: QS8MobileNetV2xnnpack: FP16MobileNetV3Smallxnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV2xnnpack: FP16MobileNetV1xnnpack: FP32MobileNetV3Smallxnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV2xnnpack: FP32MobileNetV1litert: Quantized COCO SSD MobileNet v1litert: Mobilenet Quantlitert: Inception ResNet V2litert: DeepLab V3litert: Inception V4litert: NASNet Mobileonednn: Recurrent Neural Network Training - CPUonednn: Recurrent Neural Network Inference - CPUlitert: SqueezeNetlitert: Mobilenet Floatonednn: Deconvolution Batch shapes_1d - CPUonednn: IP Shapes 1D - CPUonednn: IP Shapes 3D - CPUonednn: Convolution Batch Shapes Auto - CPUonednn: Deconvolution Batch shapes_3d - CPUabcd4031416456243743203142385962403221197924.8615789.834923.913513.826612.8173455546.398326.2363266.422102.997.460410.5820290.3351410.5576761.029763988430157483804205242716005405321567935.0316068.833651.912117.326574.6156703548.732326.7883303.962141.967.426100.5732700.3378900.5543511.026724025425558273825207042696047405521567672.1316501.234821.711896.526433.2142541548.888325.4483264.062112.357.504050.5773030.3330220.5603901.027433985429458063762205242855932400721597531.4315994.034742.112363.426390.1151275550.540325.7783282.502132.457.489710.5727390.3373840.5544051.02498OpenBenchmarking.org

XNNPACK

Model: QS8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: QS8MobileNetV2dbca9001800270036004500SE +/- 27.10, N = 3SE +/- 32.71, N = 3SE +/- 32.13, N = 339853988402540311. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3Smallacdb9001800270036004500SE +/- 19.50, N = 3SE +/- 72.75, N = 3SE +/- 51.08, N = 341644255429443011. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3Largeabdc12002400360048006000SE +/- 67.87, N = 3SE +/- 39.03, N = 3SE +/- 63.97, N = 356245748580658271. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV2adbc8001600240032004000SE +/- 23.68, N = 3SE +/- 37.65, N = 3SE +/- 48.96, N = 337433762380438251. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV1abdc400800120016002000SE +/- 11.84, N = 3SE +/- 17.89, N = 3SE +/- 19.34, N = 320312052205220701. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3Smallacbd9001800270036004500SE +/- 20.51, N = 3SE +/- 28.17, N = 3SE +/- 34.96, N = 342384269427142851. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3Largedabc13002600390052006500SE +/- 38.74, N = 3SE +/- 43.59, N = 3SE +/- 3.71, N = 359325962600560471. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV2dabc9001800270036004500SE +/- 4.98, N = 3SE +/- 40.70, N = 3SE +/- 22.36, N = 340074032405340551. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV1abcd5001000150020002500SE +/- 28.29, N = 3SE +/- 15.37, N = 3SE +/- 9.40, N = 321192156215621591. (CXX) g++ options: -O3 -lrt -lm

LiteRT

Model: Quantized COCO SSD MobileNet v1

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Quantized COCO SSD MobileNet v1dcab2K4K6K8K10KSE +/- 184.40, N = 12SE +/- 83.25, N = 15SE +/- 199.12, N = 127531.437672.137924.867935.03

LiteRT

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Mobilenet Quantadbc4K8K12K16K20KSE +/- 187.77, N = 3SE +/- 321.23, N = 12SE +/- 268.08, N = 1315789.815994.016068.816501.2

LiteRT

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Inception ResNet V2bdca7K14K21K28K35KSE +/- 242.28, N = 3SE +/- 375.95, N = 3SE +/- 330.78, N = 1533651.934742.134821.734923.9

LiteRT

Model: DeepLab V3

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: DeepLab V3cbda3K6K9K12K15KSE +/- 118.42, N = 3SE +/- 244.20, N = 15SE +/- 50.22, N = 311896.512117.312363.413513.8

LiteRT

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Inception V4dcba6K12K18K24K30KSE +/- 113.50, N = 3SE +/- 209.28, N = 3SE +/- 227.17, N = 1426390.126433.226574.626612.8

LiteRT

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: NASNet Mobilecdba40K80K120K160K200KSE +/- 1575.26, N = 3SE +/- 2554.81, N = 12SE +/- 1880.66, N = 3142541151275156703173455

oneDNN

Harness: Recurrent Neural Network Training - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Recurrent Neural Network Training - Engine: CPUabcd120240360480600SE +/- 0.35, N = 3SE +/- 0.64, N = 3SE +/- 1.00, N = 3546.40548.73548.89550.54MIN: 540.95MIN: 542.36MIN: 542.07MIN: 542.261. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Recurrent Neural Network Inference - Engine: CPUcdab70140210280350SE +/- 0.28, N = 3SE +/- 0.91, N = 3SE +/- 1.83, N = 3325.45325.78326.24326.79MIN: 321.39MIN: 320.56MIN: 322.1MIN: 319.811. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

LiteRT

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: SqueezeNetcadb7001400210028003500SE +/- 34.80, N = 4SE +/- 2.50, N = 3SE +/- 24.67, N = 33264.063266.423282.503303.96

LiteRT

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Mobilenet Floatacdb5001000150020002500SE +/- 12.05, N = 3SE +/- 14.67, N = 3SE +/- 20.11, N = 32102.992112.352132.452141.96

oneDNN

Harness: Deconvolution Batch shapes_1d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Deconvolution Batch shapes_1d - Engine: CPUbadc246810SE +/- 0.00222, N = 3SE +/- 0.07017, N = 3SE +/- 0.04488, N = 37.426107.460417.489717.50405MIN: 6.55MIN: 6.52MIN: 4.6MIN: 4.631. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: IP Shapes 1D - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: IP Shapes 1D - Engine: CPUdbca0.1310.2620.3930.5240.655SE +/- 0.004087, N = 3SE +/- 0.003697, N = 3SE +/- 0.000734, N = 30.5727390.5732700.5773030.582029MIN: 0.54MIN: 0.54MIN: 0.54MIN: 0.551. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: IP Shapes 3D - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: IP Shapes 3D - Engine: CPUcadb0.0760.1520.2280.3040.38SE +/- 0.002114, N = 3SE +/- 0.001618, N = 3SE +/- 0.001917, N = 30.3330220.3351410.3373840.337890MIN: 0.31MIN: 0.31MIN: 0.31MIN: 0.311. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Convolution Batch Shapes Auto - Engine: CPUbdac0.12610.25220.37830.50440.6305SE +/- 0.004992, N = 3SE +/- 0.003803, N = 3SE +/- 0.000955, N = 30.5543510.5544050.5576760.560390MIN: 0.51MIN: 0.52MIN: 0.52MIN: 0.521. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Deconvolution Batch shapes_3d - Engine: CPUdbca0.23170.46340.69510.92681.1585SE +/- 0.00347, N = 3SE +/- 0.00042, N = 3SE +/- 0.00238, N = 31.024981.026721.027431.02976MIN: 0.96MIN: 0.97MIN: 0.96MIN: 0.961. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl


Phoronix Test Suite v10.8.5