Training with mscoco data set #61

SINDHUN97 · 2020-06-26T04:13:06Z

SteganoGAN version or git commit: 0.1.3
Python version (output of python --version): 3.7.3
Pip version (output of pip --version):20.1.1
PyTorch version (output of python -c "import torch; print(torch.__version__)"):1.0.0
Operating System: Ubuntu Deep Learning instance(Instance Type: g4dn.2xlarge)

Description

I started training SteganoGAN with mscoco data set using above instance specification.To complete an epoch it takes almost 3 hrs 30 min. What is the standard time taken to train with this data set and suggested instance type to increase the performance?

What I Did

Tried with 4 different GPU instance types

g4dn.2xlarge(1 epoch-3 hrs 30 minutes)
g4dn.8xlarge(1 epoch-3 hrs 30 minutes)
p2.xlarge(1 epcoh-more than 4 hrs)
g3.8xlarge(1 epoch-3 hrs 30 minutes)

The text was updated successfully, but these errors were encountered:

k15z · 2020-07-08T01:17:03Z

That running time doesn't seem too unreasonable to me. It looks like you're running it through the Python API, just to confirm, you are setting cuda=True when you create the SteganoGAN instance?

SINDHUN97 · 2020-07-08T03:03:34Z

Hi @k15z ,
Yes I am setting cuda=True in the API
SteganoGAN(1, BasicEncoder, BasicDecoder, BasicCritic, hidden_size=32, cuda=True, verbose=True)

k15z · 2020-07-08T18:03:08Z

Ok. The running time seems reasonable to me, we ran our original experiments on p2.8xlarge instances and I believe it took around 2 hours per epoch.

kveerama · 2020-07-08T20:13:57Z

@k15z how many number of epochs are needed to train the model? Also @udvattam reported 1.5 hours on the same dataset. Is there a recommendation on the batch_size?

udvattam · 2020-07-08T20:49:12Z

@kveerama I trained on a modified version of the dataset, using only half of the data, which is why I think it was faster. However, I experimented with batch size for training and validation, and the time reduced as I increased the batch size even up to a size of 100. The time decrease plateud around a batch size of 32 for me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training with mscoco data set #61

Training with mscoco data set #61

SINDHUN97 commented Jun 26, 2020

k15z commented Jul 8, 2020 •

edited

Loading

SINDHUN97 commented Jul 8, 2020

k15z commented Jul 8, 2020

kveerama commented Jul 8, 2020

udvattam commented Jul 8, 2020

Training with mscoco data set #61

Training with mscoco data set #61

Comments

SINDHUN97 commented Jun 26, 2020

Description

What I Did

k15z commented Jul 8, 2020 • edited Loading

SINDHUN97 commented Jul 8, 2020

k15z commented Jul 8, 2020

kveerama commented Jul 8, 2020

udvattam commented Jul 8, 2020

k15z commented Jul 8, 2020 •

edited

Loading