169 lines
8.1 KiB
Plaintext
169 lines
8.1 KiB
Plaintext
--------------------------------------------------------------------------------
|
|
CPU name: Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz
|
|
CPU type: Intel Cascadelake SP processor
|
|
CPU clock: 2.49 GHz
|
|
--------------------------------------------------------------------------------
|
|
Parameters:
|
|
Force field: lj
|
|
Kernel: plain-C
|
|
Data layout: AoS
|
|
Floating-point precision: double
|
|
Unit cells (nx, ny, nz): 32, 32, 32
|
|
Domain box sizes (x, y, z): 5.374708e+01, 5.374708e+01, 5.374708e+01
|
|
Periodic (x, y, z): 1, 1, 1
|
|
Lattice size: 1.679596e+00
|
|
Epsilon: 1.000000e+00
|
|
Sigma: 1.000000e+00
|
|
Spring constant: 1.000000e+00
|
|
Damping constant: 1.000000e+00
|
|
Temperature: 1.440000e+00
|
|
RHO: 8.442000e-01
|
|
Mass: 1.000000e+00
|
|
Number of types: 4
|
|
Number of timesteps: 200
|
|
Report stats every (timesteps): 100
|
|
Reneighbor every (timesteps): 20
|
|
Prune every (timesteps): 1000
|
|
Output positions every (timesteps): 20
|
|
Output velocities every (timesteps): 5
|
|
Delta time (dt): 5.000000e-03
|
|
Cutoff radius: 2.500000e+00
|
|
Skin: 3.000000e-01
|
|
Half neighbor lists: 0
|
|
Processor frequency (GHz): 2.0000
|
|
----------------------------------------------------------------------------
|
|
step temp pressure
|
|
0 1.440000e+00 1.215639e+00
|
|
100 8.200895e-01 6.923143e-01
|
|
200 7.961495e-01 6.721043e-01
|
|
----------------------------------------------------------------------------
|
|
System: 131072 atoms 47265 ghost atoms, Steps: 200
|
|
TOTAL 11.50s FORCE 5.28s NEIGH 5.91s REST 0.31s
|
|
----------------------------------------------------------------------------
|
|
Performance: 2.28 million atom updates per second
|
|
Statistics:
|
|
Vector width: 8, Processor frequency: 2.0000 GHz
|
|
Average neighbors per atom: 76.0352
|
|
Average SIMD iterations per atom: 9.9181
|
|
Total number of computed pair interactions: 2003182862
|
|
Total number of SIMD iterations: 261297661
|
|
Useful read data volume for force computation: 57.46GB
|
|
Cycles/SIMD iteration: 40.4432
|
|
--------------------------------------------------------------------------------
|
|
Region force, Group 1: MEM_DP
|
|
+-------------------+------------+
|
|
| Region Info | HWThread 0 |
|
|
+-------------------+------------+
|
|
| RDTSC Runtime [s] | 5.115807 |
|
|
| call count | 201 |
|
|
+-------------------+------------+
|
|
|
|
+------------------------------------------+---------+-------------+
|
|
| Event | Counter | HWThread 0 |
|
|
+------------------------------------------+---------+-------------+
|
|
| INSTR_RETIRED_ANY | FIXC0 | 12592470000 |
|
|
| CPU_CLK_UNHALTED_CORE | FIXC1 | 10196910000 |
|
|
| CPU_CLK_UNHALTED_REF | FIXC2 | 12746120000 |
|
|
| PWR_PKG_ENERGY | PWR0 | 307.9429 |
|
|
| PWR_DRAM_ENERGY | PWR3 | 0 |
|
|
| FP_ARITH_INST_RETIRED_128B_PACKED_DOUBLE | PMC0 | 0 |
|
|
| FP_ARITH_INST_RETIRED_SCALAR_DOUBLE | PMC1 | 79042240 |
|
|
| FP_ARITH_INST_RETIRED_256B_PACKED_DOUBLE | PMC2 | 0 |
|
|
| FP_ARITH_INST_RETIRED_512B_PACKED_DOUBLE | PMC3 | 8076039000 |
|
|
| CAS_COUNT_RD | MBOX0C0 | 22734550 |
|
|
| CAS_COUNT_WR | MBOX0C1 | 1147714 |
|
|
| CAS_COUNT_RD | MBOX1C0 | 22755180 |
|
|
| CAS_COUNT_WR | MBOX1C1 | 1144415 |
|
|
| CAS_COUNT_RD | MBOX2C0 | 22762780 |
|
|
| CAS_COUNT_WR | MBOX2C1 | 1129051 |
|
|
| CAS_COUNT_RD | MBOX3C0 | 22905660 |
|
|
| CAS_COUNT_WR | MBOX3C1 | 1143324 |
|
|
| CAS_COUNT_RD | MBOX4C0 | 22914860 |
|
|
| CAS_COUNT_WR | MBOX4C1 | 1169116 |
|
|
| CAS_COUNT_RD | MBOX5C0 | 22890220 |
|
|
| CAS_COUNT_WR | MBOX5C1 | 1180739 |
|
|
+------------------------------------------+---------+-------------+
|
|
|
|
+-----------------------------------+------------+
|
|
| Metric | HWThread 0 |
|
|
+-----------------------------------+------------+
|
|
| Runtime (RDTSC) [s] | 5.1158 |
|
|
| Runtime unhalted [s] | 4.0885 |
|
|
| Clock [MHz] | 1995.2508 |
|
|
| CPI | 0.8098 |
|
|
| Energy [J] | 307.9429 |
|
|
| Power [W] | 60.1944 |
|
|
| Energy DRAM [J] | 0 |
|
|
| Power DRAM [W] | 0 |
|
|
| DP [MFLOP/s] | 12644.6041 |
|
|
| AVX DP [MFLOP/s] | 12629.1535 |
|
|
| Packed [MUOPS/s] | 1578.6442 |
|
|
| Scalar [MUOPS/s] | 15.4506 |
|
|
| Memory read bandwidth [MBytes/s] | 1713.4438 |
|
|
| Memory read data volume [GBytes] | 8.7656 |
|
|
| Memory write bandwidth [MBytes/s] | 86.5003 |
|
|
| Memory write data volume [GBytes] | 0.4425 |
|
|
| Memory bandwidth [MBytes/s] | 1799.9442 |
|
|
| Memory data volume [GBytes] | 9.2082 |
|
|
| Operational intensity | 7.0250 |
|
|
+-----------------------------------+------------+
|
|
|
|
Region reneighbour, Group 1: MEM_DP
|
|
+-------------------+------------+
|
|
| Region Info | HWThread 0 |
|
|
+-------------------+------------+
|
|
| RDTSC Runtime [s] | 5.897385 |
|
|
| call count | 10 |
|
|
+-------------------+------------+
|
|
|
|
+------------------------------------------+---------+-------------+
|
|
| Event | Counter | HWThread 0 |
|
|
+------------------------------------------+---------+-------------+
|
|
| INSTR_RETIRED_ANY | FIXC0 | 18212540000 |
|
|
| CPU_CLK_UNHALTED_CORE | FIXC1 | 11728500000 |
|
|
| CPU_CLK_UNHALTED_REF | FIXC2 | 14660630000 |
|
|
| PWR_PKG_ENERGY | PWR0 | 338.9000 |
|
|
| PWR_DRAM_ENERGY | PWR3 | 0 |
|
|
| FP_ARITH_INST_RETIRED_128B_PACKED_DOUBLE | PMC0 | 0 |
|
|
| FP_ARITH_INST_RETIRED_SCALAR_DOUBLE | PMC1 | 6240402000 |
|
|
| FP_ARITH_INST_RETIRED_256B_PACKED_DOUBLE | PMC2 | 0 |
|
|
| FP_ARITH_INST_RETIRED_512B_PACKED_DOUBLE | PMC3 | 983040 |
|
|
| CAS_COUNT_RD | MBOX0C0 | 2086787 |
|
|
| CAS_COUNT_WR | MBOX0C1 | 1115626 |
|
|
| CAS_COUNT_RD | MBOX1C0 | 2089964 |
|
|
| CAS_COUNT_WR | MBOX1C1 | 1117021 |
|
|
| CAS_COUNT_RD | MBOX2C0 | 2103832 |
|
|
| CAS_COUNT_WR | MBOX2C1 | 1117965 |
|
|
| CAS_COUNT_RD | MBOX3C0 | 2086930 |
|
|
| CAS_COUNT_WR | MBOX3C1 | 1102471 |
|
|
| CAS_COUNT_RD | MBOX4C0 | 2094688 |
|
|
| CAS_COUNT_WR | MBOX4C1 | 1103018 |
|
|
| CAS_COUNT_RD | MBOX5C0 | 2097438 |
|
|
| CAS_COUNT_WR | MBOX5C1 | 1102525 |
|
|
+------------------------------------------+---------+-------------+
|
|
|
|
+-----------------------------------+------------+
|
|
| Metric | HWThread 0 |
|
|
+-----------------------------------+------------+
|
|
| Runtime (RDTSC) [s] | 5.8974 |
|
|
| Runtime unhalted [s] | 4.7026 |
|
|
| Clock [MHz] | 1995.2473 |
|
|
| CPI | 0.6440 |
|
|
| Energy [J] | 338.9000 |
|
|
| Power [W] | 57.4661 |
|
|
| Energy DRAM [J] | 0 |
|
|
| Power DRAM [W] | 0 |
|
|
| DP [MFLOP/s] | 1059.4978 |
|
|
| AVX DP [MFLOP/s] | 1.3335 |
|
|
| Packed [MUOPS/s] | 0.1667 |
|
|
| Scalar [MUOPS/s] | 1058.1643 |
|
|
| Memory read bandwidth [MBytes/s] | 136.3006 |
|
|
| Memory read data volume [GBytes] | 0.8038 |
|
|
| Memory write bandwidth [MBytes/s] | 72.2612 |
|
|
| Memory write data volume [GBytes] | 0.4262 |
|
|
| Memory bandwidth [MBytes/s] | 208.5618 |
|
|
| Memory data volume [GBytes] | 1.2300 |
|
|
| Operational intensity | 5.0800 |
|
|
+-----------------------------------+------------+
|
|
|