haeglnano4

ARMv8 Cortex-A78E testing with a EDK II 4.1-33958178 and NVIDIA Tegra Orin on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2406157-NE-HAEGLNANO21
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
stress-ng
June 14
  13 Minutes
stream
June 14
  4 Minutes
storage
June 14
 
M.2
June 14
  52 Minutes
stream-2
June 15
  4 Minutes
cuda
June 15
  59 Minutes
Invert Behavior (Only Show Selected Data)
  22 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


haeglnano4ProcessorMotherboardMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverVulkanCompilerFile-SystemScreen ResolutionOpenGLstress-ngstreamstorageM.2stream-2cudaARMv8 Cortex-A78E @ 1.51GHz (6 Cores)EDK II 4.1-339581783584MB256GB TS256GMTE712A-I + 32GB USB EDC H 3SE3simple27B3HMIntel Device 0d9f + Realtek RTL8111/8168/8411 + Intel-AC 9260Ubuntu 20.045.10.120-tegra (aarch64)GNOME Shell 3.36.9X Server 1.20.13NVIDIA1.3.212GCC 9.4.0 + CUDA 11.4ext41920x10804.6.0256GB TS256GMTE712A-I + 128GB DX2NVIDIA Tegra OrinVA2732-FHDNVIDIA 35.4.1OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details- Scaling Governor: tegra194 schedutilSecurity Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 but not BHB + srbds: Not affected + tsx_async_abort: Not affected Disk Details- storage, M.2: NONE / relatime,rw / Block Size: 4096

haeglnano4cuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zerofio: Seq Read - Linux AIO - Yes - 8MB - 1 - Default Test Directoryfio: Seq Read - Linux AIO - Yes - 8MB - 1 - Default Test Directoryfio: Seq Write - Linux AIO - Yes - 8MB - 1 - Default Test Directoryfio: Seq Write - Linux AIO - Yes - 8MB - 1 - Default Test Directorystream: Copystream: Scalestream: Triadstream: Addstress-ng: Cryptostress-ng: Forkingstress-ng: CPU Stressstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Memory Copyingstress-ng: Glibc C String Functionsstress-ng: Glibc Qsort Data Sortingstress-ngstreamstorageM.2stream-2cuda7458.356270.38874.5415502.089649.49907.161460639.0662.3722005.021677.721480.821283.32222276130316122378.321907.222187.921881.712.83119.34119.99214.55815.138OpenBenchmarking.org

CUDA Mini-Nbody

The CUDA version of Harrism's mini-nbody tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Originalcuda3691215SE +/- 0.01, N = 312.83

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache Blockingcuda510152025SE +/- 0.00, N = 319.34

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop Unrollingcuda510152025SE +/- 0.00, N = 319.99

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data Layoutcuda48121620SE +/- 0.00, N = 314.56

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To Zerocuda48121620SE +/- 0.01, N = 315.14

Flexible IO Tester

FIO, the Flexible I/O Tester, is an advanced Linux disk benchmark supporting multiple I/O engines and a wealth of options. FIO was written by Jens Axboe for testing of the Linux I/O subsystem and schedulers. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.36Type: Sequential Read - Engine: Linux AIO - Direct: Yes - Block Size: 8MB - Job Count: 1 - Disk Target: Default Test DirectoryM.25001000150020002500SE +/- 18.21, N = 322221. (CC) gcc options: -rdynamic -lrt -lz -lpthread -lm -laio -ldl -std=gnu99 -ffast-math -include -O3 -fcommon

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.36Type: Sequential Read - Engine: Linux AIO - Direct: Yes - Block Size: 8MB - Job Count: 1 - Disk Target: Default Test DirectoryM.2601201802403002761. (CC) gcc options: -rdynamic -lrt -lz -lpthread -lm -laio -ldl -std=gnu99 -ffast-math -include -O3 -fcommon

OpenBenchmarking.orgMB/s, More Is BetterFlexible IO Tester 3.36Type: Sequential Write - Engine: Linux AIO - Direct: Yes - Block Size: 8MB - Job Count: 1 - Disk Target: Default Test DirectoryM.230060090012001500SE +/- 113.91, N = 1513031. (CC) gcc options: -rdynamic -lrt -lz -lpthread -lm -laio -ldl -std=gnu99 -ffast-math -include -O3 -fcommon

OpenBenchmarking.orgIOPS, More Is BetterFlexible IO Tester 3.36Type: Sequential Write - Engine: Linux AIO - Direct: Yes - Block Size: 8MB - Job Count: 1 - Disk Target: Default Test DirectoryM.24080120160200SE +/- 14.21, N = 151611. (CC) gcc options: -rdynamic -lrt -lz -lpthread -lm -laio -ldl -std=gnu99 -ffast-math -include -O3 -fcommon

Stream

This is a benchmark of Stream, the popular system memory (RAM) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copystreamstream-25K10K15K20K25KSE +/- 145.70, N = 5SE +/- 80.31, N = 522005.022378.31. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scalestreamstream-25K10K15K20K25KSE +/- 104.68, N = 5SE +/- 146.89, N = 521677.721907.21. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triadstreamstream-25K10K15K20K25KSE +/- 117.43, N = 5SE +/- 198.22, N = 521480.822187.91. (CC) gcc options: -O3 -march=native -fopenmp

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Addstreamstream-25K10K15K20K25KSE +/- 147.62, N = 5SE +/- 146.16, N = 521283.321881.71. (CC) gcc options: -O3 -march=native -fopenmp

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Cryptostress-ng16003200480064008000SE +/- 4.34, N = 37458.351. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Forkingstress-ng13002600390052006500SE +/- 85.52, N = 36270.381. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: CPU Stressstress-ng2004006008001000SE +/- 1.05, N = 3874.541. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Matrix Mathstress-ng3K6K9K12K15KSE +/- 7.52, N = 315502.081. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Vector Mathstress-ng2K4K6K8K10KSE +/- 3.50, N = 39649.491. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Memory Copyingstress-ng2004006008001000SE +/- 0.46, N = 3907.161. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Glibc C String Functionsstress-ng300K600K900K1200K1500KSE +/- 507.51, N = 31460639.061. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Glibc Qsort Data Sortingstress-ng1428425670SE +/- 0.02, N = 362.371. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

21 Results Shown

CUDA Mini-Nbody:
  Original
  Cache Blocking
  Loop Unrolling
  SOA Data Layout
  Flush Denormals To Zero
Flexible IO Tester:
  Seq Read - Linux AIO - Yes - 8MB - 1 - Default Test Directory:
    MB/s
    IOPS
  Seq Write - Linux AIO - Yes - 8MB - 1 - Default Test Directory:
    MB/s
    IOPS
Stream:
  Copy
  Scale
  Triad
  Add
Stress-NG:
  Crypto
  Forking
  CPU Stress
  Matrix Math
  Vector Math
  Memory Copying
  Glibc C String Functions
  Glibc Qsort Data Sorting