How to properly fix random seed with pytorch lightning? #1565

belskikh · 2020-04-22T18:09:52Z

What is your question?

Hello guys
I wonder how to fix seed to get reproducibility of my experiments

Right now I'm using this function before the start of the training

def seed_everything(seed=42):
    random.seed(seed)
    os.environ['PYTHONHASHSEED'] = str(seed)
    np.random.seed(seed)
    torch.manual_seed(seed)
    torch.cuda.manual_seed(seed)
    torch.cuda.manual_seed_all(seed)
    torch.backends.cudnn.deterministic = True
    torch.backends.cudnn.benchmark = False

But it doesn't work.
I run training in DDP mode if it is somehow important.

Thanks in advance!

What's your environment?

OS: Ubuntu 18.04
Packaging: pip
Version: 0.7.1

The text was updated successfully, but these errors were encountered:

kumuji · 2020-04-22T18:13:30Z

Also have the same problem without DDP mode.

What's your environment?

OS: Ubuntu 18.04
Packaging: pip
Version: 0.7.3

awaelchli · 2020-04-22T18:48:02Z

Could you set num workers to 0 to see if it is related to the dataloading? I had this problem before with regular pytorch and I think I solved it by setting the seed also in the dataloading, because each subprocess would have its own seed.

belskikh · 2020-04-23T07:52:34Z

@awaelchli tried and failed

awaelchli · 2020-04-24T06:02:07Z

Is there a chance you could share a colab with a minimal example? If not I will try to reproduce with the pl_exampels this weekend when i get to it.

haichao592 · 2020-05-29T17:13:26Z

In my case, it is caused by dropout.
I seed everything again in the spawed process before training fix the problem basically.
you can do this in on_train_start hook

bnaman50 · 2021-01-04T21:00:40Z

@haichao592, could you please share your solution?

Thanks,
Naman

haichao592 · 2021-01-05T03:47:40Z

@haichao592, could you please share your solution?

Thanks,
Naman

Just call pl.seed_everything(args.seed) in self.on_fit_start()

bnaman50 · 2021-01-05T04:03:56Z

Hey @haichao592, thanks for your response.

I tried to reproduce this issue on MNIST dataset with a model having dropouts but I did not observe this issue. Maybe they have fixed this issue in the newer versions of PL. The only thing I used differently was deterministic=True in my trainer.

Here is the code in case you wanna see it (I just took it from the web for a quick check).

Could you please confirm it? I just want to make sure that I would not face such issues on my project related to the fine-tuning where debugging might not be trivial as is this one.

Thanks,
Naman

sld · 2021-03-16T12:15:44Z

For me helped setting generator argument in random_split fuction: train_set, val_set = random_split(data, (train_size, val_size), generator=torch.Generator().manual_seed(42))

magehrig · 2022-09-22T14:49:06Z

For anyone in the future,

I am at version 1.7.6 and it is not an issue anymore.
At this time, the documentation of the trainer shows how to achieve deterministic behaviour.

belskikh added the question Further information is requested label Apr 22, 2020

kumuji mentioned this issue Apr 23, 2020

Option to provide seed to random generators to ensure reproducibility #1572

Merged

5 tasks

awaelchli self-assigned this May 10, 2020

williamFalcon closed this as completed in #1572 May 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to properly fix random seed with pytorch lightning? #1565

How to properly fix random seed with pytorch lightning? #1565

belskikh commented Apr 22, 2020

kumuji commented Apr 22, 2020

awaelchli commented Apr 22, 2020

belskikh commented Apr 23, 2020

awaelchli commented Apr 24, 2020

haichao592 commented May 29, 2020

bnaman50 commented Jan 4, 2021

haichao592 commented Jan 5, 2021

bnaman50 commented Jan 5, 2021

sld commented Mar 16, 2021

magehrig commented Sep 22, 2022

How to properly fix random seed with pytorch lightning? #1565

How to properly fix random seed with pytorch lightning? #1565

Comments

belskikh commented Apr 22, 2020

What is your question?

What's your environment?

kumuji commented Apr 22, 2020

awaelchli commented Apr 22, 2020

belskikh commented Apr 23, 2020

awaelchli commented Apr 24, 2020

haichao592 commented May 29, 2020

bnaman50 commented Jan 4, 2021

haichao592 commented Jan 5, 2021

bnaman50 commented Jan 5, 2021

sld commented Mar 16, 2021

magehrig commented Sep 22, 2022