Rafael Ravedutti
0a2ec6376c
Add memory tracer and update config.mk with all options
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-06-16 00:56:00 +02:00
Rafael Ravedutti
933f7c7bba
Update README.md to fix lists
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-06-11 16:42:23 +02:00
Rafael Ravedutti
977bc68699
Add README.md for utilities
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-06-11 16:38:53 +02:00
Rafael Ravedutti
91661a79e6
Add plot script and move scripts to util directory
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-06-11 15:34:19 +02:00
Jan Eitzinger
1a18341d84
Merge pull request #3 from RRZE-HPC/stub
...
Stub force kernel
2021-06-11 09:50:05 +02:00
Jan Eitzinger
c6f3f9afa1
Set ICC as default
2021-06-11 09:48:41 +02:00
Jan Eitzinger
b6d4753c2a
Add LIKWID Option. Allow to overwrite with asm variant.
2021-06-11 09:38:34 +02:00
Rafael Ravedutti
b8b364d265
Move analysis and result files to log directory
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-06-11 02:16:10 +02:00
Rafael Ravedutti
0482e4f09a
Avoid resize messages on run_stub output and build objects from assembly files
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-22 03:26:56 +02:00
Rafael Ravedutti
abc844947d
Add results with small number of unit cells and higher number of neighbors
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-20 20:38:53 +02:00
Rafael Ravedutti
f8e5415195
Add results for casclakesp2 and skylakesp2 with iln=100,1000
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-20 00:47:30 +02:00
Rafael Ravedutti
56ad09156b
Fix explicit types for stubbed version
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-20 00:08:10 +02:00
Rafael Ravedutti
4496e91125
Add version with explicit types for atoms
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-19 23:51:02 +02:00
Rafael Ravedutti
f7f7ae2002
Update soa_broadep2_iln1000 results
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-16 23:58:52 +02:00
Rafael Ravedutti
6704089f7d
Update .gitignore
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-16 23:52:59 +02:00
Rafael Ravedutti
6c4168fdb5
Add OSACA analysis and ASM code for AVX2 with AoS, lt600 variant
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-08 01:03:44 +02:00
Rafael Ravedutti
6c03ea3f3c
Adjust output when computing invalid values
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-07 02:31:53 +02:00
Rafael Ravedutti
9c28ff1e9e
Add arch_analysis directory and first AVX2 results
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-06 23:18:28 +02:00
Rafael Ravedutti
327cc302b8
Create avx512 directory for analysis
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-06 14:00:08 +02:00
Rafael Ravedutti
e53d9961ef
Fix compilation when INTERNAL_LOOP_NTIMES is not set and create avx512 directory
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-06 13:59:02 +02:00
moebiusband73
79e9de019f
Update README.md
2021-05-05 09:58:46 +02:00
moebiusband73
59886dba77
Update README.md
2021-05-05 09:55:22 +02:00
Rafael Ravedutti
15de65303e
Add version iterating most internal loop multiple times
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-05-05 03:04:41 +02:00
Rafael Ravedutti
faf1e2ae85
Update results and assembly for SoA
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-30 15:58:54 +02:00
Rafael Ravedutti
0a81407948
Fix invalid values for cycles per neighbor
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-29 20:29:00 +02:00
Rafael Ravedutti
11b2d4bcc1
Update results for arch_analysis and stub script
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-29 17:57:37 +02:00
Rafael Ravedutti
02ff7de18f
Add markers for all kernel variants
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-28 23:19:06 +02:00
Rafael Ravedutti
10dacc5a4e
Add force for AoS data layout with markers
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-28 17:48:40 +02:00
Rafael Ravedutti
5cb341ab1f
Add IACA output for SOA
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-27 01:04:31 +02:00
Rafael Ravedutti
d0d2bf8a0c
Add OSACA output for SOA
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-27 00:36:34 +02:00
Rafael Ravedutti
1a195a30e2
Add script to get stub results for several configurations
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-22 20:24:50 +02:00
Rafael Ravedutti
c356336dbd
Show cycles per atom and neighbor
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-22 16:50:22 +02:00
Rafael Ravedutti
4c53519c73
Add output as CSV
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-22 01:22:18 +02:00
Rafael Ravedutti
fd108d97d8
Fix problem when atoms_per_unit_cell is less or equal than 4
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-22 00:07:42 +02:00
Rafael Ravedutti
3c7dbc833a
Allow any values for atoms_per_unit_cell
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-21 13:01:06 +02:00
Rafael Ravedutti
d3121ee08f
Adjust computeForce parameters for stub
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-21 11:28:02 +02:00
Rafael Ravedutti
5131b7bcaa
Add comments for second kernel variant on Intel AOS assembly
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-16 16:31:27 +02:00
Rafael Ravedutti
e656490a38
Add annotated assembly files
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-16 12:44:18 +02:00
Rafael Ravedutti
78e6e5c773
Merge master branch into stub
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-15 20:12:36 +02:00
Jan Eitzinger
06ba3b2726
Restructure timing and instrumentation. Add performance metric.
2021-04-15 14:55:02 +02:00
Rafael Ravedutti
a0699dde4c
Merge branch 'master' into stub
2021-04-12 23:10:36 +02:00
Jan Eitzinger
35273c491b
Introduce generic variant targets
...
Change Makefiles to allow multiple binary targets specified by VARIANT
environment variabel. For specified variants a matching main-<VARIANT>.c
source file must be present.
2021-04-12 08:24:27 +02:00
Rafael Ravedutti
706f1c38f2
Update stubbed force calculation
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-04-07 00:46:51 +02:00
Rafael Ravedutti
77a0774208
Improve stubbed force calculation version
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-03-30 22:17:30 +02:00
Rafael Ravedutti
7efd30791a
Add version with stubbed force calculation
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-03-30 01:54:56 +02:00
Jan Eitzinger
2bc3120112
Put force kernel in separate compilation unit
2021-03-25 06:49:28 +01:00
moebiusband73
dcfa1c4135
Merge pull request #2 from RRZE-HPC/aos
...
Aos
2021-03-24 08:50:29 +01:00
Jan Eitzinger
32b23a2ebd
Set defaults for GCC
2021-03-24 08:48:27 +01:00
Jan Eitzinger
d026028439
Streamline build system
2021-03-24 08:43:44 +01:00
Rafael Ravedutti
4b481bb407
Fix macros for AoS
...
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
2021-03-23 16:21:08 +01:00