=== Assignment 2 ===
The project that has been chosen for For Assignment 2 is Max Fainshtein's Assignment 1 project, Boxblur on an image using opencv C++ Library and we have decided to implement parallelize the Box Blur algorithm because we realized that it would have been the only algorithm worth parallelizing using CUDA into .The main choice that made us decide to continue with Box Blur, as supposed of the Vehicle Detection program, is because it was the project only problem where we could createa kernel for and gain a lot of gain in order execution speed. The only way to eliminate optimize the Vehicle Detection program was to enable CUDA for the stress on DLIB library, which it is not really in the CPU and RAMscope of this assignment.<br><br>----
The kernel had been designed to run with 512 threads in order to ensure that this type of program would be able to run on lower compute capability CUDA supported hardware.
<br><br>