Changes

Jump to: navigation, search

GPU610/DPS915 CUDA PI

3 bytes added, 01:38, 4 November 2013
no edit summary
== '''Progress''' ==
 
 
=== '''Assignment 1''' ===
==== '''Introduction''' ====
==== '''Serial Results''' ====
[[File:Pi_serial_results.jpg|border]]
 
=== '''Assignment 2''' ===
==== '''Serial VS CUDA''' ====
[[File:Pi_serial_vs_cuda_results.jpg|border]]
 
==== '''Conclusion''' ====
Using CUDA technology and parallelizing the serial code in the original code, there is an enormous increase in performance (lower execution time) to calculate , as high as 1372%. In the next (final) phase, an attempt to investigate if shared memory, optimal memory allocation, minimizing said memory access time, and other optimization factors would provide a further increase (lower execution time) in performance for pi_cuda.
 
=== '''Assignment 3''' ===

Navigation menu