Open main menu

CDOT Wiki β

Changes

A-Team

19 bytes added, 05:31, 1 April 2019
Initial implementation
int j = blockIdx.y * blockDim.y + threadIdx.y;
//matrix multiplication
if (i < ni && j < nj) { float sum = 0.0f; for (int k = 0; k < nk; k++)
sum += d_a[i * nk + k] * d_b[k * nj + j];
d_p[i * nj + j] = sum;
}
}
=== Assignment 3 ===
113
edits