1
edit
Changes
→Assignment 2 - N-Body Simulation
You can see that the majority of the processing time is used on SQ(square) and MAX(which value is bigger) calculations. The point calculation can be done independently and therefore can be parallelized with CUDA. This program can be speed up even more if we utilize the Barnes-hut algorithm for calculating N-Bodies using quad trees and spatial partitions.
=== Assignment 2 - N-Body Simulation ===
==== Baseline ====
The following profiles were made under the following compiler and computer settings:
* Grids - [http://www.resultsovercoffee.com/2011/02/cuda-blocks-and-grids.html resultsovercoffee.com/2011/02/cuda-blocks-and-grids.html]
* Repo - [https://code.google.com/p/gpu-nbody/ code.google.com/p/gpu-nbody]
=== Assignment 3 ===