luajit-1.1.0run 2 x Intel Xeon E5-2620 v2 testing with a ASUS Z9PE-D8 WS (5503 BIOS) and ASPEED on CentOS Stream 9 via the Phoronix Test Suite. debug: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P, Graphics: ASPEED, Audio: Realtek ALC898, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-467.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 NoGVNO2: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P, Graphics: ASPEED, Audio: Realtek ALC898, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-467.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 NoGVNO3: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P, Graphics: ASPEED, Audio: Realtek ALC898, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-467.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 OptNoSimplO2: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-474.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 OptNoSimplO3: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-474.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 OptSimplO2: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-474.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 OptSimplO3: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-474.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 2 x Intel Xeon E5-2620 v2: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-474.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 OptRedO2: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-474.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 OptRedO3: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-474.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 OptPREO2: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-480.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 GVNO2: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-480.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 NewGVNO2: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-480.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 OptPREO3: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 2000GB Western Digital WD20EARX-00P + 256GB Samsung SSD 850, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-480.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.4.1 20231218 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 NewGVNO333: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-496.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 GVNO3: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-496.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 PessimisticO3: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-503.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 PessimisticO2: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-503.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 PessimisticNewGVNO2: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-503.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 PessimisticNewGVNO3: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-503.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 NewGVNO2-debug: Processor: 2 x Intel Xeon E5-2620 v2 @ 2.60GHz (12 Cores / 24 Threads), Motherboard: ASUS Z9PE-D8 WS (5503 BIOS), Chipset: Intel Xeon E7 v2/Xeon, Memory: 32GB, Disk: 256GB Samsung SSD 850 + 2000GB Western Digital WD20EARX-00P, Graphics: ASPEED, Audio: Realtek ALC898, Monitor: ASUS VW190, Network: 2 x Intel 82574L OS: CentOS Stream 9, Kernel: 5.14.0-514.el9.x86_64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, Compiler: GCC 11.5.0 20240719 + PGI Compiler 16.10-0 + LLVM 3.1 + CUDA 11.2, File-System: ext4, Screen Resolution: 1024x768 LuaJIT 2.1-git Test: Fast Fourier Transform Mflops > Higher Is Better debug ..................... 229.20 |=========================================== NoGVNO2 ................... 227.63 |========================================== NoGVNO3 ................... 227.45 |========================================== OptNoSimplO2 .............. 225.26 |========================================== OptNoSimplO3 .............. 230.90 |=========================================== OptSimplO2 ................ 231.05 |=========================================== OptSimplO3 ................ 229.77 |=========================================== 2 x Intel Xeon E5-2620 v2 . 228.45 |=========================================== OptRedO2 .................. 227.21 |========================================== OptRedO3 .................. 227.58 |========================================== GVNO2 ..................... 229.33 |=========================================== NewGVNO2 .................. 229.25 |=========================================== NewGVNO333 ................ 229.18 |=========================================== GVNO3 ..................... 227.90 |========================================== PessimisticO3 ............. 225.93 |========================================== PessimisticO2 ............. 228.73 |=========================================== PessimisticNewGVNO2 ....... 229.66 |=========================================== PessimisticNewGVNO3 ....... 216.88 |======================================== NewGVNO2-debug ............ 228.87 |=========================================== LuaJIT 2.1-git Test: Monte Carlo Mflops > Higher Is Better debug ..................... 297.98 |========================================== NoGVNO2 ................... 302.55 |=========================================== NoGVNO3 ................... 302.09 |=========================================== OptNoSimplO2 .............. 295.62 |========================================== OptNoSimplO3 .............. 302.96 |=========================================== OptSimplO2 ................ 305.49 |=========================================== OptSimplO3 ................ 299.86 |========================================== 2 x Intel Xeon E5-2620 v2 . 301.23 |========================================== OptRedO2 .................. 302.90 |=========================================== OptRedO3 .................. 304.24 |=========================================== GVNO2 ..................... 297.40 |========================================== NewGVNO2 .................. 297.86 |========================================== NewGVNO333 ................ 296.82 |========================================== GVNO3 ..................... 298.48 |========================================== PessimisticO3 ............. 303.78 |=========================================== PessimisticO2 ............. 296.47 |========================================== PessimisticNewGVNO2 ....... 291.04 |========================================= PessimisticNewGVNO3 ....... 291.41 |========================================= NewGVNO2-debug ............ 301.00 |========================================== LuaJIT 2.1-git Test: Dense LU Matrix Factorization Mflops > Higher Is Better debug ..................... 1555.47 |========================================== NoGVNO2 ................... 1555.95 |========================================== NoGVNO3 ................... 1553.37 |========================================= OptNoSimplO2 .............. 1531.41 |========================================= OptNoSimplO3 .............. 1550.45 |========================================= OptSimplO2 ................ 1562.30 |========================================== OptSimplO3 ................ 1543.77 |========================================= 2 x Intel Xeon E5-2620 v2 . 1572.78 |========================================== OptRedO2 .................. 1558.36 |========================================== OptRedO3 .................. 1558.40 |========================================== GVNO2 ..................... 1556.06 |========================================== NewGVNO2 .................. 1561.37 |========================================== NewGVNO333 ................ 1552.92 |========================================= GVNO3 ..................... 1555.55 |========================================== PessimisticO3 ............. 1554.55 |========================================== PessimisticO2 ............. 1559.52 |========================================== PessimisticNewGVNO2 ....... 1539.78 |========================================= PessimisticNewGVNO3 ....... 1549.74 |========================================= NewGVNO2-debug ............ 1546.60 |========================================= LuaJIT 2.1-git Test: Composite Mflops > Higher Is Better debug ..................... 795.47 |=========================================== NoGVNO2 ................... 796.27 |=========================================== NoGVNO3 ................... 794.58 |=========================================== OptNoSimplO2 .............. 784.78 |========================================== OptNoSimplO3 .............. 794.40 |=========================================== OptSimplO2 ................ 800.65 |=========================================== OptSimplO3 ................ 789.74 |========================================== 2 x Intel Xeon E5-2620 v2 . 800.61 |=========================================== OptRedO2 .................. 796.71 |=========================================== OptRedO3 .................. 797.24 |=========================================== GVNO2 ..................... 795.78 |=========================================== NewGVNO2 .................. 797.41 |=========================================== NewGVNO333 ................ 792.36 |=========================================== GVNO3 ..................... 795.47 |=========================================== PessimisticO3 ............. 795.92 |=========================================== PessimisticO2 ............. 797.15 |=========================================== PessimisticNewGVNO2 ....... 787.70 |========================================== PessimisticNewGVNO3 ....... 788.42 |========================================== NewGVNO2-debug ............ 790.26 |========================================== LuaJIT 2.1-git Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better debug ..................... 1106.10 |========================================== NoGVNO2 ................... 1104.44 |========================================== NoGVNO3 ................... 1102.40 |========================================== OptNoSimplO2 .............. 1094.35 |========================================= OptNoSimplO3 .............. 1103.36 |========================================== OptSimplO2 ................ 1114.37 |========================================== OptSimplO3 ................ 1097.86 |========================================= 2 x Intel Xeon E5-2620 v2 . 1111.43 |========================================== OptRedO2 .................. 1107.42 |========================================== OptRedO3 .................. 1106.92 |========================================== GVNO2 ..................... 1107.79 |========================================== NewGVNO2 .................. 1108.74 |========================================== NewGVNO333 ................ 1100.63 |========================================= GVNO3 ..................... 1107.95 |========================================== PessimisticO3 ............. 1109.16 |========================================== PessimisticO2 ............. 1112.69 |========================================== PessimisticNewGVNO2 ....... 1098.67 |========================================= PessimisticNewGVNO3 ....... 1102.33 |========================================== NewGVNO2-debug ............ 1096.08 |========================================= LuaJIT 2.1-git Test: Sparse Matrix Multiply Mflops > Higher Is Better debug ..................... 788.60 |=========================================== NoGVNO2 ................... 790.79 |=========================================== NoGVNO3 ................... 787.58 |=========================================== OptNoSimplO2 .............. 777.26 |========================================== OptNoSimplO3 .............. 784.30 |=========================================== OptSimplO2 ................ 790.05 |=========================================== OptSimplO3 ................ 777.44 |========================================== 2 x Intel Xeon E5-2620 v2 . 789.19 |=========================================== OptRedO2 .................. 787.64 |=========================================== OptRedO3 .................. 789.07 |=========================================== GVNO2 ..................... 788.31 |=========================================== NewGVNO2 .................. 789.83 |=========================================== NewGVNO333 ................ 782.25 |=========================================== GVNO3 ..................... 787.46 |=========================================== PessimisticO3 ............. 786.20 |=========================================== PessimisticO2 ............. 788.34 |=========================================== PessimisticNewGVNO2 ....... 779.35 |========================================== PessimisticNewGVNO3 ....... 781.71 |=========================================== NewGVNO2-debug ............ 778.74 |========================================== LuaJIT 2.1-git Test Install Size Bytes < Lower Is Better NewGVNO2-debug . 10456 |======================================================= NewGVNO2-debug . 10444 |=======================================================