Maximilian Gaul
|
696e6da01d
|
Implement Neighbour list AoS memory layout + performance measurement
|
2022-01-31 20:27:59 +01:00 |
|
Maximilian Gaul
|
b2a6574426
|
Remove unnecessary atom force backcopy in computeForce
|
2022-01-24 18:09:27 +01:00 |
|
Maximilian Gaul
|
c4080e866e
|
Make integrate kernels aware of neighbour list update
|
2022-01-24 18:04:50 +01:00 |
|
Maximilian Gaul
|
dc4d5f1a9c
|
Porting atom velocity memory layout to AoS, porting velocity integration to CUDA, adding measurements + logbook update
|
2022-01-01 18:18:12 +01:00 |
|
Maximilian Gaul
|
50007216ed
|
Implemented atom force AoS memory layout, added performance measurements + logbook Update
|
2022-01-01 16:09:21 +01:00 |
|
Maximilian Gaul
|
72e4599acc
|
Copy neighbour lists only when reneighbouring happens, added measurements + logbook update
|
2022-01-01 12:56:42 +01:00 |
|
Maximilian Gaul
|
8fa03733e9
|
Copy parameters & cutforces threshold only once at the start + measurements
|
2021-12-28 16:48:26 +01:00 |
|
Maximilian Gaul
|
bf1ae3d013
|
Removed debug prints, only zero atom forces and not copy them, added measurements
|
2021-12-28 16:32:54 +01:00 |
|
Maximilian Gaul
|
8009b54113
|
Trying to debug segfault if cudaMemcpy is limited to neighbour list update
|
2021-12-25 15:36:08 +01:00 |
|
Maximilian Gaul
|
0ea0587442
|
Only malloc once at the beginning plus measurement csv
|
2021-12-25 13:52:33 +01:00 |
|
Maximilian Gaul
|
c2bfa3ca3f
|
Add scripts for perf measurement, made atom-memory allocation pinnend using 'cudaMallocHost', added measurements for atom pinnend memory
|
2021-12-18 13:02:04 +01:00 |
|
Maximilian Gaul
|
2a099da5b7
|
Started cuda profiling, added first result to logbook
|
2021-12-03 08:13:43 +01:00 |
|
Maximilian Gaul
|
7691b23d67
|
Measure memory transfer of CPU to GPU, add explanation how to distribute calculation among multiple GPUs
|
2021-12-01 17:16:32 +01:00 |
|
Maximilian Gaul
|
da90466f98
|
Added first performance measurements with threads per block from 1 to 32
|
2021-11-25 08:09:20 +01:00 |
|
Maximilian Gaul
|
8f723c1299
|
Added command line description of MD-Bench, added memory transfer rate from CPU to GPU to force.cu
|
2021-11-23 15:55:23 +01:00 |
|
Maximilian Gaul
|
0586ef150a
|
Fix num of threads instead of num of blocks, add logbook template
|
2021-11-15 19:39:09 +01:00 |
|
Maximilian Gaul
|
2e5d973f7d
|
Rough rewrite to execute outer loop of force calculation in parallel, not inner loop
|
2021-11-14 10:02:23 +01:00 |
|
Maximilian Gaul
|
e2fd1a0476
|
Fixed bug, results are now equal to master branch (but still slow)
|
2021-11-11 21:00:30 +01:00 |
|
Maximilian Gaul
|
4105c844c6
|
Runs fine (but slow), results seem to be slightly off from original
|
2021-11-11 20:47:06 +01:00 |
|
Maximilian Gaul
|
1f5c9c4b23
|
Fixed segfault error, added more cudaErrorChecks, added cudaFree to avoid memory leak
|
2021-11-11 20:29:14 +01:00 |
|
Maximilian Gaul
|
29e115464b
|
Fixed cudaMemcpy for AOS data layout, added debug outputs, added cudaErrorChecks
|
2021-11-11 20:14:30 +01:00 |
|
Maximilian Gaul
|
1a54314c8b
|
First run but segfault at the moment after a few seconds
|
2021-11-11 15:23:46 +01:00 |
|
Maximilian Gaul
|
280f595b7f
|
Fixed linker error by putting includes and cuda function in extern 'C'
|
2021-11-11 14:49:29 +01:00 |
|
Maximilian Gaul
|
3428974730
|
getTimeStamp() couldn't get linked
|
2021-11-11 08:03:56 +01:00 |
|
Maximilian Gaul
|
b54842f764
|
Added Makefile instructions for .cu files
|
2021-11-11 07:27:12 +01:00 |
|
Maximilian Gaul
|
9730164e6f
|
Rename force.c to force.cu because of cuda build errors
|
2021-11-10 16:20:04 +01:00 |
|