Training with low GPU memory

kargintima · March 2, 2020, 10:42pm

I have GPU with 4 GB memory.
Which options should I use to train my model?
I got this error after few hours of training:

RuntimeError: CUDA out of memory. Tried to allocate 734.00 MiB (GPU 0; 3.95 GiB total capacity; 2.21GiB already allocated; 317.06 MiB free; 3.00 GiB reserved in total by PyTorch)

It worked well despite this errors for a while, but unfortunately stops.

I know that I have to play with batchsize and accumsize, but how?

Nart · March 4, 2020, 8:46am

You can train for free on Google Colab with enough GPU,
Here is an English Russian netbook as an example:
https://colab.research.google.com/drive/1yLScS662f9Xv-uJw-MltLBDSiN4LeBnc

tel34 · March 5, 2020, 11:00am

That’s a great tip, @Nart. Thanks! My own on-premise 11GB GPU is overworked as it is

Nart · March 6, 2020, 7:12am

@kargintima check out these options on this page:
-batch_size, -accum_count
Training models