Rafael Ravedutti
|
d97fc577b0
|
Add first version of index and distance tracer
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-07-09 23:49:14 +02:00 |
|
Rafael Ravedutti
|
e6062e8f79
|
Update script to plot stubbed data and add results for casclakesp2 with AoS
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-07-03 03:07:35 +02:00 |
|
Rafael Ravedutti
|
eef44e97d7
|
Update cache settings for casclakesp2 and results
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-07-02 02:57:31 +02:00 |
|
Rafael Ravedutti
|
4bde1944cf
|
Update script for plotting gather data and AOS results for casclakesp2
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-07-01 20:02:59 +02:00 |
|
Rafael Ravedutti
|
5534f1b195
|
Modify gather script to include raw data and update results without prefetchers
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-30 16:09:34 +02:00 |
|
Rafael Ravedutti
|
ecb5ccf6ff
|
Adjust likwid markers for force region
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-30 13:44:02 +02:00 |
|
Rafael Ravedutti
|
06aae5593b
|
Add script to plot gather data
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-25 02:02:08 +02:00 |
|
Rafael Ravedutti
|
7ae22a5695
|
Add forces reading operation on memory tracer and include ampersand in TRACER_PRINT
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-21 22:56:44 +02:00 |
|
Rafael Ravedutti
|
0bb7e3c61f
|
Add cache simulator script and first results
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-17 03:07:34 +02:00 |
|
Rafael Ravedutti
|
0a2ec6376c
|
Add memory tracer and update config.mk with all options
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-16 00:56:00 +02:00 |
|
Rafael Ravedutti
|
933f7c7bba
|
Update README.md to fix lists
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-11 16:42:23 +02:00 |
|
Rafael Ravedutti
|
977bc68699
|
Add README.md for utilities
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-11 16:38:53 +02:00 |
|
Rafael Ravedutti
|
91661a79e6
|
Add plot script and move scripts to util directory
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-11 15:34:19 +02:00 |
|
Jan Eitzinger
|
1a18341d84
|
Merge pull request #3 from RRZE-HPC/stub
Stub force kernel
|
2021-06-11 09:50:05 +02:00 |
|
Jan Eitzinger
|
c6f3f9afa1
|
Set ICC as default
|
2021-06-11 09:48:41 +02:00 |
|
Jan Eitzinger
|
b6d4753c2a
|
Add LIKWID Option. Allow to overwrite with asm variant.
|
2021-06-11 09:38:34 +02:00 |
|
Rafael Ravedutti
|
b8b364d265
|
Move analysis and result files to log directory
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-11 02:16:10 +02:00 |
|
Rafael Ravedutti
|
0482e4f09a
|
Avoid resize messages on run_stub output and build objects from assembly files
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-22 03:26:56 +02:00 |
|
Rafael Ravedutti
|
abc844947d
|
Add results with small number of unit cells and higher number of neighbors
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-20 20:38:53 +02:00 |
|
Rafael Ravedutti
|
f8e5415195
|
Add results for casclakesp2 and skylakesp2 with iln=100,1000
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-20 00:47:30 +02:00 |
|
Rafael Ravedutti
|
56ad09156b
|
Fix explicit types for stubbed version
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-20 00:08:10 +02:00 |
|
Rafael Ravedutti
|
4496e91125
|
Add version with explicit types for atoms
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-19 23:51:02 +02:00 |
|
Rafael Ravedutti
|
f7f7ae2002
|
Update soa_broadep2_iln1000 results
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-16 23:58:52 +02:00 |
|
Rafael Ravedutti
|
6704089f7d
|
Update .gitignore
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-16 23:52:59 +02:00 |
|
Rafael Ravedutti
|
6c4168fdb5
|
Add OSACA analysis and ASM code for AVX2 with AoS, lt600 variant
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-08 01:03:44 +02:00 |
|
Rafael Ravedutti
|
6c03ea3f3c
|
Adjust output when computing invalid values
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-07 02:31:53 +02:00 |
|
Rafael Ravedutti
|
9c28ff1e9e
|
Add arch_analysis directory and first AVX2 results
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-06 23:18:28 +02:00 |
|
Rafael Ravedutti
|
327cc302b8
|
Create avx512 directory for analysis
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-06 14:00:08 +02:00 |
|
Rafael Ravedutti
|
e53d9961ef
|
Fix compilation when INTERNAL_LOOP_NTIMES is not set and create avx512 directory
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-06 13:59:02 +02:00 |
|
moebiusband73
|
79e9de019f
|
Update README.md
|
2021-05-05 09:58:46 +02:00 |
|
moebiusband73
|
59886dba77
|
Update README.md
|
2021-05-05 09:55:22 +02:00 |
|
Rafael Ravedutti
|
15de65303e
|
Add version iterating most internal loop multiple times
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-05 03:04:41 +02:00 |
|
Rafael Ravedutti
|
faf1e2ae85
|
Update results and assembly for SoA
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-30 15:58:54 +02:00 |
|
Rafael Ravedutti
|
0a81407948
|
Fix invalid values for cycles per neighbor
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-29 20:29:00 +02:00 |
|
Rafael Ravedutti
|
11b2d4bcc1
|
Update results for arch_analysis and stub script
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-29 17:57:37 +02:00 |
|
Rafael Ravedutti
|
02ff7de18f
|
Add markers for all kernel variants
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-28 23:19:06 +02:00 |
|
Rafael Ravedutti
|
10dacc5a4e
|
Add force for AoS data layout with markers
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-28 17:48:40 +02:00 |
|
Rafael Ravedutti
|
5cb341ab1f
|
Add IACA output for SOA
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-27 01:04:31 +02:00 |
|
Rafael Ravedutti
|
d0d2bf8a0c
|
Add OSACA output for SOA
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-27 00:36:34 +02:00 |
|
Rafael Ravedutti
|
1a195a30e2
|
Add script to get stub results for several configurations
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-22 20:24:50 +02:00 |
|
Rafael Ravedutti
|
c356336dbd
|
Show cycles per atom and neighbor
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-22 16:50:22 +02:00 |
|
Rafael Ravedutti
|
4c53519c73
|
Add output as CSV
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-22 01:22:18 +02:00 |
|
Rafael Ravedutti
|
fd108d97d8
|
Fix problem when atoms_per_unit_cell is less or equal than 4
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-22 00:07:42 +02:00 |
|
Rafael Ravedutti
|
3c7dbc833a
|
Allow any values for atoms_per_unit_cell
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-21 13:01:06 +02:00 |
|
Rafael Ravedutti
|
d3121ee08f
|
Adjust computeForce parameters for stub
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-21 11:28:02 +02:00 |
|
Rafael Ravedutti
|
5131b7bcaa
|
Add comments for second kernel variant on Intel AOS assembly
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-16 16:31:27 +02:00 |
|
Rafael Ravedutti
|
e656490a38
|
Add annotated assembly files
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-16 12:44:18 +02:00 |
|
Rafael Ravedutti
|
78e6e5c773
|
Merge master branch into stub
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-15 20:12:36 +02:00 |
|
Jan Eitzinger
|
06ba3b2726
|
Restructure timing and instrumentation. Add performance metric.
|
2021-04-15 14:55:02 +02:00 |
|
Rafael Ravedutti
|
a0699dde4c
|
Merge branch 'master' into stub
|
2021-04-12 23:10:36 +02:00 |
|