You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@AAnoosheh Honestly I do not understand how the PR you linked relates to the bug your report. Did you mean to link another issue?
The final epoch does not save a checkpoint during training.
I don't experience this. The epoch number is 0-indexed, and by default it only saves best checkpoints. Could one of these reasons be why you may think this is a bug?
@AAnoosheh so when you run pl.callbacks.ModelCheckpoint(save_top_k=-1, verbose=True) all the checkpoints are saved, however we do not save the last one as 'last.ckpt'. Also, the checkpoints are numbered from 0, so if you run for 4 epochs, the last checkpoint saved will be 'epoch=3.ckpt' and when you resume, it resumes from the expected 5th epoch.
🐛 Bug
Two issues occur:
Expected behavior
Environment
The text was updated successfully, but these errors were encountered: