Changes

← Older edit

GPU610/DPS915 CUDA PI

186 bytes added, 20:22, 4 November 2013

→‎Conclusion

== '''Progress''' ==

// calculate pi

pi = 4.0 * (float)score/(float)points;

==== '''Software and Hardware''' ====

[[File:Pi_software_and_hardware_list.jpg|border]]

==== '''Program Execution Plan''' ====

==== '''Conclusion''' ====

As the sample count increases, the execution time of the program also increases. The Big-O Classification for ''pi_serial'' is O(1).

=== '''Assignment 2''' ===

}

==== '''Software and Hardware''' ====

[[File:Pi_software_and_hardware_list.jpg|border]]

==== '''Program Execution Plan''' ====

==== '''Conclusion''' ====

Using CUDA technology and parallelizing the serial code in the original code, there is an enormous increase in performance (lower execution time) to calculate , as high as '''1372%'''. In the next (final) phase, an attempt to investigate if shared memory, optimal memory allocation, minimizing said memory access time, and other optimization factors would provide a further increase (lower execution time) in performance for ''pi_cuda''.

=== '''Assignment 3''' ===

Peter Huang

1

edit

CDOT Wiki β

Changes

GPU610/DPS915 CUDA PI

CDOT Wiki ^β