Open main menu
CDOT Wiki
β
Search
Changes
← Older edit
Newer edit →
A-Team
8 bytes removed
,
05:30, 1 April 2019
→
Initial implementation
int j = blockIdx.y * blockDim.y + threadIdx.y;
//matrix multiplication
if (i < ni && j < nj) {
float sum = 0.0f;
for (int k = 0; k < nk; k++)
sum += d_a[i * nk + k] * d_b[k * nj + j];
Spdjurovic
113
edits