logging loss bug when accumulate_grad_batches >= 1 #2569

YuxianMeng · 2020-07-09T13:45:38Z

🐛 Bug

When accumulate_grad_batches >= 1, logging loss is divided by accumulate_grad_batches.
For example, when accumulate_grad_batches=1, logging loss is 4; when accumulate_grad_batches=2, loss is 2; accumulate_grad_batches=4, loss is 1

Expected behavior

logging loss should be similar no matter accumulate_grad_batch is any number.

Environment

PyTorch Version (e.g., 1.0): nightly
OS (e.g., Linux): Linux
How you installed PyTorch (conda, pip, source): pip
Build command you used (if compiling from source):
Python version: 3.6
CUDA/cuDNN version: 10.2
GPU models and configuration:
Any other relevant information:

Additional context

The text was updated successfully, but these errors were encountered:

awaelchli · 2020-07-28T21:39:57Z

Fixed in #2738

YuxianMeng added bug Something isn't working help wanted Open to be worked on labels Jul 9, 2020

awaelchli mentioned this issue Jul 19, 2020

Loss value in the progress bar is wrong when accumulate_grad_batches > 1 #2635

Closed

awaelchli closed this as completed Jul 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

logging loss bug when accumulate_grad_batches >= 1 #2569

logging loss bug when accumulate_grad_batches >= 1 #2569

YuxianMeng commented Jul 9, 2020 •

edited

Loading

awaelchli commented Jul 28, 2020

logging loss bug when accumulate_grad_batches >= 1 #2569

logging loss bug when accumulate_grad_batches >= 1 #2569

Comments

YuxianMeng commented Jul 9, 2020 • edited Loading

🐛 Bug

Expected behavior

Environment

Additional context

awaelchli commented Jul 28, 2020

YuxianMeng commented Jul 9, 2020 •

edited

Loading