Changes

Jump to: navigation, search

DPS921/PyTorch: Convolutional Neural Networks

15 bytes added, 13:13, 30 November 2020
Parallelization Methods
The code is very similar to a single GPU implementation, except for the ''.to('cuda:x')'' calls, where ''cuda:0'' and ''cuda:1'' are each their own GPU.
<code>
model = ToyModel()
loss_fn = nn.MSELoss()
loss_fn(outputs, labels).backward()
optimizer.step()
</code>
The backward() and torch.optim will automatically take care of gradients as if the model is on one GPU. You only need to make sure that the labels are on the same device as the outputs when calling the loss function.
56
edits

Navigation menu