Lightning Adventures
Contents
Lightning Adventures is a assembly of 3 DPS915 Students, working towards parallelizing an application using NVIDIA's parallel computing architecture, CUDA.
Mission Statement
Our focus is to each select a program that we think will benefit from being optimized for parallel heterogeneous computing on the CPU and GPU. We will accomplish our task by first determining and profiling an application that we have found.
Team Members
- James Boelen
- Raymond Hung
- Stanley Tsang
Assignment 1
Fluorescent Monte Carlo - Simulation of fluoresence in scattering medium
Attempted by: Stanley Tsang
Uses the Monte Carlo subroutine mcsub.c to simulate the penetration and escape of excitation light from a semi-infinite medium, the distribution and escape of fluorescence from a uniform fluorophore in the medium, and the distribution and escape of fluorescence from a localized heterogeneity in the medium. Monte Carlo process-based simulations generally respond very well to parallelization as the random input process consists of entirely independent iterations and thus can be run in parallel (in theory).
The project can be found here: http://omlc.ogi.edu/software/mc/mcfluor/index.html
My goal for this project will be to parallelize the mcsub() function in mcsubLIB.c. The target function is quite long so I present a truncated version of the hotspot below: Note that Nphotons is extremely large, at least 1e6.
for (iphoton=1; iphoton<=Nphotons; iphoton++) { /* Print out progress for user if mcflag < 3 */ temp = (double)iphoton; if ((PRINTOUT == 1) & (mcflag < 3) & (temp >= 100)) { if (temp<1000) { if (fmod(temp,100)==0) printf("%1.0f photons\n",temp); } if (temp<10000) { if (fmod(temp,1000)==0) printf("%1.0f photons\n",temp); } else if (temp<100000) { if (fmod(temp,10000)==0) printf("%1.0f photons\n",temp); } else if (temp<1000000) { if (fmod(temp,100000)==0) printf("%1.0f photons\n",temp); } else if (temp<10000000) { if (fmod(temp,1000000)==0) printf("%1.0f photons\n",temp); } else if (temp<100000000) { if (fmod(temp,10000000)==0) printf("%1.0f photons\n",temp); } } . . . . . for (ir=1; ir<=NR; ir++) { r = (ir - 0.5)*dr; temp += J[ir]; /* accumulate total escaped photon weight */ J[ir] /= 2.0*PI*r*dr*Nphotons; /* flux density */ for (iz=1; iz<=NZ; iz++) F[iz][ir] /= 2.0*PI*r*dr*dz*Nphotons*mua; /* fluence rate */ } *Sptr = S = Rsptot/Nphotons; *Aptr = A = Atot/Nphotons; *Eptr = E = temp/Nphotons; }
TOMO - Proton Synchrotron Tomography
Attempted by: James Boelen
This is a program used by C.E.R.N. Currently, it is written in Fortran and uses OpenMP to accelerate the calculations. We will be re-creating this code for CUDA to try and improve the computation time. Due to the nature of this product, we are unable to post a code snippet.