TNN

TNN is an open-source deep learning reasoning framework developed by Tencent.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark tnn.

Project Site

github.com

Test Created

24 September 2020

Last Updated

18 June 2021

Test Maintainer

Michael Larabel

Test Type

System

Average Install Time

37 Seconds

Average Run Time

1 Minute, 11 Seconds

Test Dependencies

CMake + C/C++ Compiler Toolchain

Accolades

10k+ Downloads

Supported Platforms

* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 9 November 2024.

Revision History

pts/tnn-1.1.0 [View Source] Fri, 18 Jun 2021 07:29:44 GMT
Update against TNN 0.3 upstream release.

pts/tnn-1.0.1 [View Source] Mon, 11 Jan 2021 13:15:50 GMT
Update download mirror as the GitHub URL changed its checksums...

pts/tnn-1.0.0 [View Source] Thu, 24 Sep 2020 18:33:29 GMT
Initial commit of Tencent TNN framework.

Suites Using This Test

Machine Learning

HPC - High Performance Computing

Performance Metrics

Analyze Test Configuration:

TNN 0.3

Target: CPU - Model: DenseNet

OpenBenchmarking.org metrics for this test profile configuration based on 1,025 public results since 18 June 2021 with the latest data as of 6 September 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component

Percentile Rank

# Compatible Public Results

ms (Average)

AMD Ryzen 9 7950X3D 16-Core

95th

2050 ^{+/- 41}

AMD Ryzen 9 7900X 12-Core

94th

2055 ^{+/- 10}

AMD Ryzen 7 7700X 8-Core

94th

2062 ^{+/- 92}

AMD Ryzen 7 7700 8-Core

94th

2068 ^{+/- 2}

AMD Ryzen 9 7900 12-Core

93rd

2108 ^{+/- 4}

AMD Ryzen 5 7600X 6-Core

92nd

2138 ^{+/- 19}

AMD Ryzen 5 7600 6-Core

92nd

2234 ^{+/- 4}

AMD Ryzen 9 5950X 16-Core

85th

2431 ^{+/- 37}

Intel Core i5-12400

83rd

2460 ^{+/- 283}

AMD Ryzen 9 5900X 12-Core

81st

2531 ^{+/- 24}

AMD Ryzen Threadripper 3960X 24-Core

78th

2589 ^{+/- 82}

AMD Ryzen 7 7840U

76th

2598

AMD Ryzen 7 5800X3D 8-Core

76th

2602 ^{+/- 18}

Mid-Tier

75th

> 2603

AMD Ryzen 9 3950X 16-Core

75th

2609 ^{+/- 56}

AMD Ryzen 9 5900HX

75th

2618 ^{+/- 58}

AMD Ryzen 5 5600X 6-Core

75th

2634 ^{+/- 36}

AMD Ryzen Threadripper 3970X 32-Core

73rd

2655 ^{+/- 4}

AMD Ryzen 7 5800X 8-Core

72nd

2674 ^{+/- 256}

Intel Core i9-11900K

71st

2707 ^{+/- 21}

AMD Ryzen 7 5700G

68th

2741 ^{+/- 27}

AMD Ryzen 9 3900X 12-Core

66th

2787 ^{+/- 6}

AMD EPYC 74F3 24-Core

66th

2805 ^{+/- 46}

AMD Ryzen Threadripper 3990X 64-Core

66th

2824 ^{+/- 93}

2 x AMD EPYC 75F3 32-Core

65th

2836 ^{+/- 62}

AMD Ryzen Threadripper 2950X 16-Core

65th

2841 ^{+/- 112}

Intel Core i9-10900K

64th

2847 ^{+/- 43}

AMD Ryzen 9 PRO 3900 12-Core

64th

2849

AMD Ryzen 5 5600G

64th

2849 ^{+/- 20}

AMD EPYC 75F3 32-Core

62nd

2868 ^{+/- 3}

Ampere Altra ARMv8 Neoverse-N1 160-Core

61st

2877 ^{+/- 165}

AMD Ryzen 7 3800XT 8-Core

61st

2883 ^{+/- 6}

AMD Ryzen 7 PRO 3700 8-Core

60th

2884

AMD Ryzen Threadripper 2990WX 32-Core

60th

2893 ^{+/- 33}

Intel Xeon E-2388G

60th

2900

AMD Ryzen 7 PRO 6850U

58th

2922 ^{+/- 113}

2 x AMD EPYC 7373X 16-Core

56th

2976 ^{+/- 4}

AMD EPYC 7543 32-Core

56th

2984 ^{+/- 94}

AMD Ryzen 7 3700X 8-Core

56th

2987 ^{+/- 39}

2 x AMD EPYC 7742 64-Core

55th

2995 ^{+/- 15}

AMD EPYC 7373X 16-Core

55th

3009 ^{+/- 2}

AMD Ryzen 9 3900XT 12-Core

54th

3014 ^{+/- 16}

AMD Ryzen Threadripper 2970WX 24-Core

53rd

3018 ^{+/- 32}

AMD EPYC 7742 64-Core

51st

3058 ^{+/- 1}

Median

50th

3066

Intel Xeon E-2278G

50th

3069

AMD Ryzen 5 5500

49th

3101 ^{+/- 271}

Intel Core i5-13400

49th

3101 ^{+/- 8}

Intel Core i9-9900KS

49th

3103 ^{+/- 17}

AMD Ryzen 7 PRO 5850U

49th

3104 ^{+/- 110}

AMD Ryzen 5 3600XT 6-Core

48th

3120 ^{+/- 11}

AMD EPYC 7F52 16-Core

48th

3152 ^{+/- 6}

2 x AMD EPYC 7763 64-Core

47th

3165 ^{+/- 10}

Intel Core i5-11600K

47th

3207 ^{+/- 226}

Intel Xeon E-2288G

47th

3221 ^{+/- 79}

Intel Core i5-10600K

44th

3313 ^{+/- 58}

2 x AMD EPYC 7601 32-Core

44th

3331 ^{+/- 8}

AMD Ryzen 3 3300X 4-Core

42nd

3407 ^{+/- 31}

Intel Core i7-8086K

42nd

3412 ^{+/- 13}

Intel Core i7-8700K

39th

3475 ^{+/- 12}

2 x Intel Xeon Platinum 8362

39th

3477 ^{+/- 5}

Intel Core i9-10980XE

39th

3490 ^{+/- 35}

AMD EPYC 7F32 8-Core

37th

3516 ^{+/- 3}

AMD Ryzen 7 2700X Eight-Core

37th

3519 ^{+/- 31}

2 x Intel Xeon Gold 6346

36th

3558 ^{+/- 9}

2 x Intel Xeon Gold 6342

35th

3590 ^{+/- 2}

AMD Ryzen 7 2700 Eight-Core

34th

3655 ^{+/- 3}

AMD EPYC 7551 32-Core

33rd

3674 ^{+/- 4}

Intel Xeon Gold 6346

33rd

3677 ^{+/- 3}

Intel Xeon Gold 6342

33rd

3687 ^{+/- 1}

Intel Xeon Platinum 8380

32nd

3693 ^{+/- 7}

AMD EPYC 72F3 8-Core

32nd

3703 ^{+/- 296}

Intel Core i7-7740K

32nd

3707 ^{+/- 10}

AMD Ryzen 5 3400G

32nd

3728 ^{+/- 9}

2 x Intel Xeon Gold 5220R

30th

3806 ^{+/- 18}

Intel Core i7-6800K

30th

3837 ^{+/- 3}

2 x Intel Xeon Platinum 8280

29th

3845 ^{+/- 265}

Intel Xeon Gold 6226R

28th

3956 ^{+/- 1}

Intel Core i7-5960X

26th

4008 ^{+/- 5}

AMD Ryzen 7 4700U

26th

4027 ^{+/- 59}

Low-Tier

25th

> 4035

Intel Core i3-10100

25th

4049 ^{+/- 109}

2 x Intel Xeon Platinum 8380

25th

4063 ^{+/- 404}

Intel Xeon E3-1231 v3

25th

4081 ^{+/- 12}

Intel Core i7-4770K

25th

4085 ^{+/- 80}

Intel Core i5-9400F

23rd

4201 ^{+/- 49}

Intel Core i7-1165G7

23rd

4210 ^{+/- 219}

Intel Xeon E3-1275 v6

23rd

4257 ^{+/- 5}

Intel Core i5-8400

23rd

4258 ^{+/- 90}

Intel Core i7-10700T

22nd

4275 ^{+/- 50}

Intel Core i5-7600K

22nd

4290 ^{+/- 7}

Intel Core i7-5775C

21st

4319 ^{+/- 19}

AMD Ryzen 5 2400G

21st

4365 ^{+/- 483}

Intel Xeon E3-1270 v5

18th

4467 ^{+/- 26}

AMD Ryzen 5 4500U

18th

4503 ^{+/- 67}

Intel Xeon E3-1280 v5

17th

4523 ^{+/- 5}

Intel Core i7-3770K

17th

4565 ^{+/- 5}

Intel Celeron G6900

16th

4612 ^{+/- 494}

Intel Xeon Silver 4216

15th

4738 ^{+/- 8}

Intel Xeon E3-1245 v5

15th

4743 ^{+/- 12}

Intel Core i7-1185G7

15th

4864 ^{+/- 445}

Intel Core i7-2700K

15th

4873 ^{+/- 44}

Intel Core i3-8100

14th

4897 ^{+/- 26}

Intel Core i7-8550U

14th

4999 ^{+/- 7}

Intel Core i7-8565U

13th

5058 ^{+/- 84}

Intel Core i5-6500

13th

5128 ^{+/- 3}

Apple M2

12th

5201 ^{+/- 268}

Intel Core i7-1065G7

11th

5275 ^{+/- 160}

Apple M1

9th

5520 ^{+/- 27}

AMD Athlon 3000G

8th

5818 ^{+/- 65}

Intel Core i3-7100

8th

5886 ^{+/- 88}

Intel Xeon E3-1235L v5

7th

6179 ^{+/- 5}

AMD Ryzen 3 3200U

7th

6188 ^{+/- 41}

ARMv8 Cortex-A76 4-Core

7th

6358 ^{+/- 949}

Rockchip ARMv8 Cortex-A76 4-Core

6th

7343 ^{+/- 28}

AMD FX-8150 Eight-Core

5th

7748 ^{+/- 8}

Intel Core i7-3517U

5th

8347 ^{+/- 9}

Intel Core i5-6300U

4th

8548 ^{+/- 10}

ARMv8 Cortex-A72 4-Core

2nd

20666 ^{+/- 1638}

Based on OpenBenchmarking.org data, the selected test / test configuration (TNN 0.3 - Target: CPU - Model: DenseNet) has an average run-time of 15 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.1%.

Does It Scale Well With Increasing Cores?

No, based on the automated analysis of the collected public benchmark data, this test / test settings does not generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

Notable Instruction Set Usage

Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.

Instruction Set

Support

Instructions Detected

SSE2 (SSE2)

Used by default on supported hardware.

PUNPCKLQDQ MOVDQA MOVDQU PSRLDQ CVTSS2SD CVTSD2SS MOVAPD ADDSD UCOMISD SQRTSD MULSD DIVSD COMISD MOVD UNPCKLPD CVTPD2PS PSHUFD PMULUDQ PADDQ CVTDQ2PS CVTSI2SD CVTTSD2SI MOVUPD MAXSD XORPD SUBSD CVTTPS2DQ CVTPS2PD UNPCKHPD MULPD ANDPD CMPNLESD ANDNPD ORPD ADDPD SUBPD CVTDQ2PD CMPLEPD CVTTPD2DQ

Advanced Vector Extensions (AVX)

Requires passing a supported compiler/build flag (verified with targets: sandybridge, skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Sandy Bridge (2011).
Found on AMD processors since Bulldozer (2011).

VZEROUPPER VEXTRACTF128 VINSERTF128 VPERMILPS VPERM2F128 VBROADCASTSS VMASKMOVPS VBROADCASTSD

Advanced Vector Extensions 2 (AVX2)

Requires passing a supported compiler/build flag (verified with targets: skylake, tigerlake, cascadelake, sapphirerapids, alderlake, znver2, znver3).
Found on Intel processors since Haswell (2013).
Found on AMD processors since Excavator (2016).

VEXTRACTI128 VPERMD VPBROADCASTQ VPERM2I128 VPBROADCASTD VPERMQ VINSERTI128 VPGATHERDD VPMASKMOVD VPBROADCASTW VPBROADCASTB

FMA (FMA)

VFMADD132PS VFNMADD132SS VFMADD231SS VFMADD231PS VFNMADD132PS VFMADD132SS VFMADD132SD VFMSUB231SD VFNMSUB231SD VFNMSUB132SD VFMADD132PD VFNMADD132PD VFMADD213PS VFMADD213SS VFNMADD231SD VFNMADD132SD VFMADD231PD VFMSUB231PS VFMSUB231SS VFNMADD231PD VFMADD231SD VFMSUB132SS VFMADD213SD

Advanced Vector Extensions 512 (AVX512)

Requires passing a supported compiler/build flag (verified with targets: cascadelake, sapphirerapids).

(ZMM REGISTER USE)

The test / benchmark does honor compiler flag changes.

Last automated analysis: 18 January 2022

This test profile binary relies on the shared libraries libTNN.so.0, libm.so.6, libc.so.6, libgomp.so.1.

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture

Kernel Identifier

Verified On

Intel / AMD x86 64-bit

x86_64

(Many Processors)

RISC-V 64-bit

riscv64

SiFive RISC-V

Loongson LoongArch 64-bit

loongarch64

Loongson-3A5000LL, Loongson-3C5000

ARMv7 32-bit

armv7l

ARMv7 Cortex-A72 4-Core, ARMv7 rev 1 4-Core

ARMv8 64-bit

aarch64

ARMv8 Cortex-A57 4-Core, ARMv8 Cortex-A72 16-Core, ARMv8 Cortex-A72 4-Core, ARMv8 Cortex-A76 4-Core, ARMv8 Cortex-A78E 6-Core, ARMv8 Neoverse-N1, ARMv8 Neoverse-N1 32-Core, ARMv8 Neoverse-N1 64-Core, ARMv8 Neoverse-V1, Ampere ARMv8 Neoverse-N1 256-Core, Ampere Altra ARMv8 Neoverse-N1 160-Core, Apple M1, Apple M2, HiSilicon TSV110, Phytium D2000, Rockchip ARMv8 Cortex-A76 4-Core, Snapdragon

102424machinelearningtest 1 System - 342 Benchmark Results	Intel Core i9-12900K - ASUS PRIME Z790-V AX - Intel Raptor Lake-S PCH Ubuntu 24.04 - 6.8.0-47-generic - GNOME Shell 46.0
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2 2 Systems - 195 Benchmark Results
tnn 0905 1 System - 4 Benchmark Results	Unknown - LOONGSON Dabieshan Loongson-TC542F0 - Loongson LLC Hyper Transport Bridge Anolis OS 23.1 - 6.6.25-2.1.an23.loongarch64 - X Server 1.20.14
tnn 0905 1 System - 4 Benchmark Results	HiSilicon TSV110 - Huawei TaiShan 200 - Huawei HiSilicon Ubuntu 22.04 - 5.15.0-119-generic - 1.3.255
H610-i312100-1 1 System - 334 Benchmark Results	Intel Core i3-12100 - ASRock H610M-HDV/M.2 R2.0 - Intel Device 7aa7 Ubuntu 20.04 - 5.15.0-89-generic - GNOME Shell 3.36.9
core-i5-12500 1 System - 275 Benchmark Results	Intel Core i5-12500 - ASUS PRIME Z690M-HZ - Intel Alder Lake-S PCH Debian 12 - 6.1.0-21-amd64 - GCC 12.2.0

TNN

Project Site

Test Created

Last Updated

Test Maintainer

Test Type

Average Install Time

Average Run Time

Test Dependencies

Accolades

Supported Platforms

Revision History

Suites Using This Test

Machine Learning

HPC - High Performance Computing

Performance Metrics

TNN 0.3

Target: CPU - Model: DenseNet

Does It Scale Well With Increasing Cores?

Notable Instruction Set Usage

Tested CPU Architectures

Recent Test Results

Compare

102424machinelearningtest

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

m600_7940hs-96gb-5600mhz-16gb-igpu-performance-tpd-2tb-sn850x-2023-09-01-2

tnn 0905

tnn 0905

H610-i312100-1

core-i5-12500