169 lines
8.1 KiB
Plaintext
169 lines
8.1 KiB
Plaintext
--------------------------------------------------------------------------------
|
|
CPU name: Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz
|
|
CPU type: Intel Cascadelake SP processor
|
|
CPU clock: 2.49 GHz
|
|
--------------------------------------------------------------------------------
|
|
Parameters:
|
|
Force field: lj
|
|
Kernel: plain-C
|
|
Data layout: AoS
|
|
Floating-point precision: single
|
|
Unit cells (nx, ny, nz): 32, 32, 32
|
|
Domain box sizes (x, y, z): 5.374708e+01, 5.374708e+01, 5.374708e+01
|
|
Periodic (x, y, z): 1, 1, 1
|
|
Lattice size: 1.679596e+00
|
|
Epsilon: 1.000000e+00
|
|
Sigma: 1.000000e+00
|
|
Spring constant: 1.000000e+00
|
|
Damping constant: 1.000000e+00
|
|
Temperature: 1.440000e+00
|
|
RHO: 8.442000e-01
|
|
Mass: 1.000000e+00
|
|
Number of types: 4
|
|
Number of timesteps: 200
|
|
Report stats every (timesteps): 100
|
|
Reneighbor every (timesteps): 20
|
|
Prune every (timesteps): 1000
|
|
Output positions every (timesteps): 20
|
|
Output velocities every (timesteps): 5
|
|
Delta time (dt): 5.000000e-03
|
|
Cutoff radius: 2.500000e+00
|
|
Skin: 3.000000e-01
|
|
Half neighbor lists: 0
|
|
Processor frequency (GHz): 2.0000
|
|
----------------------------------------------------------------------------
|
|
step temp pressure
|
|
0 1.440000e+00 1.215639e+00
|
|
100 8.200897e-01 6.923144e-01
|
|
200 7.961481e-01 6.721031e-01
|
|
----------------------------------------------------------------------------
|
|
System: 131072 atoms 47265 ghost atoms, Steps: 200
|
|
TOTAL 10.83s FORCE 4.62s NEIGH 5.94s REST 0.26s
|
|
----------------------------------------------------------------------------
|
|
Performance: 2.42 million atom updates per second
|
|
Statistics:
|
|
Vector width: 16, Processor frequency: 2.0000 GHz
|
|
Average neighbors per atom: 76.0351
|
|
Average SIMD iterations per atom: 5.0875
|
|
Total number of computed pair interactions: 2003181259
|
|
Total number of SIMD iterations: 134032075
|
|
Useful read data volume for force computation: 32.79GB
|
|
Cycles/SIMD iteration: 68.9511
|
|
--------------------------------------------------------------------------------
|
|
Region force, Group 1: MEM_SP
|
|
+-------------------+------------+
|
|
| Region Info | HWThread 0 |
|
|
+-------------------+------------+
|
|
| RDTSC Runtime [s] | 4.452877 |
|
|
| call count | 201 |
|
|
+-------------------+------------+
|
|
|
|
+------------------------------------------+---------+-------------+
|
|
| Event | Counter | HWThread 0 |
|
|
+------------------------------------------+---------+-------------+
|
|
| INSTR_RETIRED_ANY | FIXC0 | 7428719000 |
|
|
| CPU_CLK_UNHALTED_CORE | FIXC1 | 8875251000 |
|
|
| CPU_CLK_UNHALTED_REF | FIXC2 | 11094050000 |
|
|
| PWR_PKG_ENERGY | PWR0 | 265.5057 |
|
|
| PWR_DRAM_ENERGY | PWR3 | 0 |
|
|
| FP_ARITH_INST_RETIRED_128B_PACKED_SINGLE | PMC0 | 0 |
|
|
| FP_ARITH_INST_RETIRED_SCALAR_SINGLE | PMC1 | 79036820 |
|
|
| FP_ARITH_INST_RETIRED_256B_PACKED_SINGLE | PMC2 | 0 |
|
|
| FP_ARITH_INST_RETIRED_512B_PACKED_SINGLE | PMC3 | 3935012000 |
|
|
| CAS_COUNT_RD | MBOX0C0 | 19716700 |
|
|
| CAS_COUNT_WR | MBOX0C1 | 595747 |
|
|
| CAS_COUNT_RD | MBOX1C0 | 19734880 |
|
|
| CAS_COUNT_WR | MBOX1C1 | 597090 |
|
|
| CAS_COUNT_RD | MBOX2C0 | 19732800 |
|
|
| CAS_COUNT_WR | MBOX2C1 | 595219 |
|
|
| CAS_COUNT_RD | MBOX3C0 | 19886430 |
|
|
| CAS_COUNT_WR | MBOX3C1 | 632443 |
|
|
| CAS_COUNT_RD | MBOX4C0 | 19887210 |
|
|
| CAS_COUNT_WR | MBOX4C1 | 633169 |
|
|
| CAS_COUNT_RD | MBOX5C0 | 19935560 |
|
|
| CAS_COUNT_WR | MBOX5C1 | 634112 |
|
|
+------------------------------------------+---------+-------------+
|
|
|
|
+-----------------------------------+------------+
|
|
| Metric | HWThread 0 |
|
|
+-----------------------------------+------------+
|
|
| Runtime (RDTSC) [s] | 4.4529 |
|
|
| Runtime unhalted [s] | 3.5585 |
|
|
| Clock [MHz] | 1995.2693 |
|
|
| CPI | 1.1947 |
|
|
| Energy [J] | 265.5057 |
|
|
| Power [W] | 59.6257 |
|
|
| Energy DRAM [J] | 0 |
|
|
| Power DRAM [W] | 0 |
|
|
| SP [MFLOP/s] | 14156.9661 |
|
|
| AVX SP [MFLOP/s] | 14139.2165 |
|
|
| Packed [MUOPS/s] | 883.7010 |
|
|
| Scalar [MUOPS/s] | 17.7496 |
|
|
| Memory read bandwidth [MBytes/s] | 1708.8254 |
|
|
| Memory read data volume [GBytes] | 7.6092 |
|
|
| Memory write bandwidth [MBytes/s] | 53.0035 |
|
|
| Memory write data volume [GBytes] | 0.2360 |
|
|
| Memory bandwidth [MBytes/s] | 1761.8288 |
|
|
| Memory data volume [GBytes] | 7.8452 |
|
|
| Operational intensity | 8.0354 |
|
|
+-----------------------------------+------------+
|
|
|
|
Region reneighbour, Group 1: MEM_SP
|
|
+-------------------+------------+
|
|
| Region Info | HWThread 0 |
|
|
+-------------------+------------+
|
|
| RDTSC Runtime [s] | 5.935627 |
|
|
| call count | 10 |
|
|
+-------------------+------------+
|
|
|
|
+------------------------------------------+---------+-------------+
|
|
| Event | Counter | HWThread 0 |
|
|
+------------------------------------------+---------+-------------+
|
|
| INSTR_RETIRED_ANY | FIXC0 | 18208530000 |
|
|
| CPU_CLK_UNHALTED_CORE | FIXC1 | 11805500000 |
|
|
| CPU_CLK_UNHALTED_REF | FIXC2 | 14756870000 |
|
|
| PWR_PKG_ENERGY | PWR0 | 340.7903 |
|
|
| PWR_DRAM_ENERGY | PWR3 | 0 |
|
|
| FP_ARITH_INST_RETIRED_128B_PACKED_SINGLE | PMC0 | 0 |
|
|
| FP_ARITH_INST_RETIRED_SCALAR_SINGLE | PMC1 | 6240406000 |
|
|
| FP_ARITH_INST_RETIRED_256B_PACKED_SINGLE | PMC2 | 0 |
|
|
| FP_ARITH_INST_RETIRED_512B_PACKED_SINGLE | PMC3 | 491520 |
|
|
| CAS_COUNT_RD | MBOX0C0 | 1772377 |
|
|
| CAS_COUNT_WR | MBOX0C1 | 975760 |
|
|
| CAS_COUNT_RD | MBOX1C0 | 1770611 |
|
|
| CAS_COUNT_WR | MBOX1C1 | 977433 |
|
|
| CAS_COUNT_RD | MBOX2C0 | 1771722 |
|
|
| CAS_COUNT_WR | MBOX2C1 | 979122 |
|
|
| CAS_COUNT_RD | MBOX3C0 | 1782901 |
|
|
| CAS_COUNT_WR | MBOX3C1 | 967621 |
|
|
| CAS_COUNT_RD | MBOX4C0 | 1780789 |
|
|
| CAS_COUNT_WR | MBOX4C1 | 967179 |
|
|
| CAS_COUNT_RD | MBOX5C0 | 1784733 |
|
|
| CAS_COUNT_WR | MBOX5C1 | 969349 |
|
|
+------------------------------------------+---------+-------------+
|
|
|
|
+-----------------------------------+------------+
|
|
| Metric | HWThread 0 |
|
|
+-----------------------------------+------------+
|
|
| Runtime (RDTSC) [s] | 5.9356 |
|
|
| Runtime unhalted [s] | 4.7334 |
|
|
| Clock [MHz] | 1995.2675 |
|
|
| CPI | 0.6483 |
|
|
| Energy [J] | 340.7903 |
|
|
| Power [W] | 57.4144 |
|
|
| Energy DRAM [J] | 0 |
|
|
| Power DRAM [W] | 0 |
|
|
| SP [MFLOP/s] | 1052.6723 |
|
|
| AVX SP [MFLOP/s] | 1.3249 |
|
|
| Packed [MUOPS/s] | 0.0828 |
|
|
| Scalar [MUOPS/s] | 1051.3474 |
|
|
| Memory read bandwidth [MBytes/s] | 114.9736 |
|
|
| Memory read data volume [GBytes] | 0.6824 |
|
|
| Memory write bandwidth [MBytes/s] | 62.9308 |
|
|
| Memory write data volume [GBytes] | 0.3735 |
|
|
| Memory bandwidth [MBytes/s] | 177.9044 |
|
|
| Memory data volume [GBytes] | 1.0560 |
|
|
| Operational intensity | 5.9171 |
|
|
+-----------------------------------+------------+
|
|
|