odpdfd Tests for a future article. Intel Core Ultra 7 155H testing with a MTL Swift SFG14-72T Coral_MTH (V1.01 BIOS) and Intel Arc MTL 8GB on Ubuntu 24.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2408220-NE-ODPDFD40936&rdt&grs .
odpdfd Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution a b Intel Core Ultra 7 155H @ 4.80GHz (16 Cores / 22 Threads) MTL Swift SFG14-72T Coral_MTH (V1.01 BIOS) Intel Device 7e7f 8 x 2GB LPDDR5-6400MT/s Micron MT62F1G32D2DS-026 1024GB Micron_2550_MTFDKBA1T0TGE Intel Arc MTL 8GB Intel Meteor Lake-P HD Audio Intel Meteor Lake PCH CNVi WiFi Ubuntu 24.04 6.10.0-061000rc4daily20240621-generic (x86_64) GNOME Shell 46.0 X Server 1.21.1.11 + Wayland 4.6 Mesa 24.2~git2406250600.5cb15a~oibaf~n (git-5cb15a6 2024-06-25 noble-oibaf-ppa) GCC 13.2.0 ext4 1920x1200 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x1c - Thermald 2.5.6 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected
odpdfd xnnpack: FP32MobileNetV3Large xnnpack: QU8MobileNetV3Small svt-av1: Preset 13 - Beauty 4K 10-bit xnnpack: QU8MobileNetV3Large xnnpack: FP32MobileNetV3Small xnnpack: QU8MobileNetV2 xnnpack: FP16MobileNetV3Small y-cruncher: 500M xnnpack: FP16MobileNetV2 mnn: inception-v3 xnnpack: FP32MobileNetV2 build2: Time To Compile svt-av1: Preset 3 - Beauty 4K 10-bit xnnpack: FP16MobileNetV3Large mnn: squeezenetv1.1 mnn: mobilenetV3 mnn: mobilenet-v1-1.0 y-cruncher: 1B mnn: nasnet mnn: MobileNetV2_224 mnn: resnet-v2-50 svt-av1: Preset 5 - Bosphorus 4K svt-av1: Preset 5 - Beauty 4K 10-bit svt-av1: Preset 3 - Bosphorus 1080p svt-av1: Preset 5 - Bosphorus 1080p whisperfile: Tiny mnn: SqueezeNetV1.0 svt-av1: Preset 13 - Bosphorus 4K simdjson: DistinctUserID svt-av1: Preset 8 - Bosphorus 4K svt-av1: Preset 8 - Beauty 4K 10-bit svt-av1: Preset 8 - Bosphorus 1080p whisperfile: Medium svt-av1: Preset 3 - Bosphorus 4K whisperfile: Small simdjson: PartialTweets etcpak: Multi-Threaded - ETC2 svt-av1: Preset 13 - Bosphorus 1080p simdjson: LargeRand simdjson: TopTweet simdjson: Kostya a b 3459 1525 5.306 2849 1636 2859 1672 27.616 3273 50.032 3096 253.404 0.514 3494 5.391 2.361 5.591 60.743 17.502 4.876 31.777 9.717 2.283 9.193 32.421 94.06066 7.185 91.281 6.77 21.62 3.147 74.535 1273.5925 2.608 443.99641 6.53 319.522 376.407 1.47 6.76 4.18 4497 1881 4.493 3357 1898 3306 1917 25.082 3593 54.737 3382 233.764 0.555 3743 5.763 2.503 5.92 57.405 18.468 5.131 33.42 9.249 2.398 9.621 33.736 97.87499 7.443 88.238 6.57 22.099 3.211 75.897 1255.47275 2.643 438.38991 6.6 316.678 373.828 1.48 6.8 4.17 OpenBenchmarking.org
XNNPACK Model: FP32MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Large a b 1000 2000 3000 4000 5000 3459 4497 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QU8MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Small a b 400 800 1200 1600 2000 1525 1881 1. (CXX) g++ options: -O3 -lrt -lm
SVT-AV1 Encoder Mode: Preset 13 - Input: Beauty 4K 10-bit OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 13 - Input: Beauty 4K 10-bit a b 1.1939 2.3878 3.5817 4.7756 5.9695 5.306 4.493 1. (CXX) g++ options: -march=native -mno-avx
XNNPACK Model: QU8MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Large a b 700 1400 2100 2800 3500 2849 3357 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP32MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Small a b 400 800 1200 1600 2000 1636 1898 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: QU8MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV2 a b 700 1400 2100 2800 3500 2859 3306 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Small a b 400 800 1200 1600 2000 1672 1917 1. (CXX) g++ options: -O3 -lrt -lm
Y-Cruncher Pi Digits To Calculate: 500M OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 500M a b 6 12 18 24 30 27.62 25.08
XNNPACK Model: FP16MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV2 a b 800 1600 2400 3200 4000 3273 3593 1. (CXX) g++ options: -O3 -lrt -lm
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: inception-v3 a b 12 24 36 48 60 50.03 54.74 MIN: 41.95 / MAX: 67.9 MIN: 43.91 / MAX: 257.36 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
XNNPACK Model: FP32MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV2 a b 700 1400 2100 2800 3500 3096 3382 1. (CXX) g++ options: -O3 -lrt -lm
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.17 Time To Compile a b 60 120 180 240 300 253.40 233.76
SVT-AV1 Encoder Mode: Preset 3 - Input: Beauty 4K 10-bit OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 3 - Input: Beauty 4K 10-bit a b 0.1249 0.2498 0.3747 0.4996 0.6245 0.514 0.555 1. (CXX) g++ options: -march=native -mno-avx
XNNPACK Model: FP16MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Large a b 800 1600 2400 3200 4000 3494 3743 1. (CXX) g++ options: -O3 -lrt -lm
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: squeezenetv1.1 a b 1.2967 2.5934 3.8901 5.1868 6.4835 5.391 5.763 MIN: 2.96 / MAX: 26.05 MIN: 2.87 / MAX: 33.67 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: mobilenetV3 a b 0.5632 1.1264 1.6896 2.2528 2.816 2.361 2.503 MIN: 2.28 / MAX: 3.14 MIN: 2.17 / MAX: 4.88 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: mobilenet-v1-1.0 a b 1.332 2.664 3.996 5.328 6.66 5.591 5.920 MIN: 5.19 / MAX: 11.13 MIN: 4.67 / MAX: 27.91 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Y-Cruncher Pi Digits To Calculate: 1B OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 1B a b 14 28 42 56 70 60.74 57.41
Mobile Neural Network Model: nasnet OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: nasnet a b 5 10 15 20 25 17.50 18.47 MIN: 16.4 / MAX: 38.49 MIN: 15.72 / MAX: 42.55 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: MobileNetV2_224 a b 1.1545 2.309 3.4635 4.618 5.7725 4.876 5.131 MIN: 4.39 / MAX: 24.98 MIN: 4.03 / MAX: 15.71 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: resnet-v2-50 a b 8 16 24 32 40 31.78 33.42 MIN: 29.12 / MAX: 48.94 MIN: 27.43 / MAX: 67.42 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
SVT-AV1 Encoder Mode: Preset 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 5 - Input: Bosphorus 4K a b 3 6 9 12 15 9.717 9.249 1. (CXX) g++ options: -march=native -mno-avx
SVT-AV1 Encoder Mode: Preset 5 - Input: Beauty 4K 10-bit OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 5 - Input: Beauty 4K 10-bit a b 0.5396 1.0792 1.6188 2.1584 2.698 2.283 2.398 1. (CXX) g++ options: -march=native -mno-avx
SVT-AV1 Encoder Mode: Preset 3 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 3 - Input: Bosphorus 1080p a b 3 6 9 12 15 9.193 9.621 1. (CXX) g++ options: -march=native -mno-avx
SVT-AV1 Encoder Mode: Preset 5 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 5 - Input: Bosphorus 1080p a b 8 16 24 32 40 32.42 33.74 1. (CXX) g++ options: -march=native -mno-avx
Whisperfile Model Size: Tiny OpenBenchmarking.org Seconds, Fewer Is Better Whisperfile 20Aug24 Model Size: Tiny a b 20 40 60 80 100 94.06 97.87
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.9.b11b7037d Model: SqueezeNetV1.0 a b 2 4 6 8 10 7.185 7.443 MIN: 6.93 / MAX: 28.57 MIN: 5.98 / MAX: 29.85 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 13 - Input: Bosphorus 4K a b 20 40 60 80 100 91.28 88.24 1. (CXX) g++ options: -march=native -mno-avx
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: DistinctUserID a b 2 4 6 8 10 6.77 6.57 1. (CXX) g++ options: -O3 -lrt
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 8 - Input: Bosphorus 4K a b 5 10 15 20 25 21.62 22.10 1. (CXX) g++ options: -march=native -mno-avx
SVT-AV1 Encoder Mode: Preset 8 - Input: Beauty 4K 10-bit OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 8 - Input: Beauty 4K 10-bit a b 0.7225 1.445 2.1675 2.89 3.6125 3.147 3.211 1. (CXX) g++ options: -march=native -mno-avx
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 8 - Input: Bosphorus 1080p a b 20 40 60 80 100 74.54 75.90 1. (CXX) g++ options: -march=native -mno-avx
Whisperfile Model Size: Medium OpenBenchmarking.org Seconds, Fewer Is Better Whisperfile 20Aug24 Model Size: Medium a b 300 600 900 1200 1500 1273.59 1255.47
SVT-AV1 Encoder Mode: Preset 3 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 3 - Input: Bosphorus 4K a b 0.5947 1.1894 1.7841 2.3788 2.9735 2.608 2.643 1. (CXX) g++ options: -march=native -mno-avx
Whisperfile Model Size: Small OpenBenchmarking.org Seconds, Fewer Is Better Whisperfile 20Aug24 Model Size: Small a b 100 200 300 400 500 444.00 438.39
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: PartialTweets a b 2 4 6 8 10 6.53 6.60 1. (CXX) g++ options: -O3 -lrt
Etcpak Benchmark: Multi-Threaded - Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 2.0 Benchmark: Multi-Threaded - Configuration: ETC2 a b 70 140 210 280 350 319.52 316.68 1. (CXX) g++ options: -flto -pthread
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 13 - Input: Bosphorus 1080p a b 80 160 240 320 400 376.41 373.83 1. (CXX) g++ options: -march=native -mno-avx
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: LargeRandom a b 0.333 0.666 0.999 1.332 1.665 1.47 1.48 1. (CXX) g++ options: -O3 -lrt
simdjson Throughput Test: TopTweet OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: TopTweet a b 2 4 6 8 10 6.76 6.80 1. (CXX) g++ options: -O3 -lrt
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: Kostya a b 0.9405 1.881 2.8215 3.762 4.7025 4.18 4.17 1. (CXX) g++ options: -O3 -lrt
Phoronix Test Suite v10.8.5