added MultiEpochsDataLoader #140

yoniaflalo · 2020-05-05T11:47:58Z

Hi.

I have added a feature that is called MultiEpochsDataLoader. When using the data loader of pytorch, at the beginning of every epoch, we have to wait a lot and the training speed is very low from the first iteration. It is because the pytorch data loader is reinitialized from scratch.

In this feature, we do not waste time, and just the first initialization of the the dataloader at the first epoch takes time, but for the next epochs, the first iteration of every new epoch is as fast as the iterations in the middle of an epoch.

Example when using the MultiEpochsDataLoader:

First epoch:

Next epochs

This can save more than 10 seconds per epoch, that is almost an hour of training when training a network with 300 epochs. (for example, training on 8 V100 is quite expensive, so saving an hour every time we need to train is quite nice)

I have tested the feature on a training of ecaresnetlight

chris-ha458 · 2020-05-05T13:20:32Z

This code looks like it would be valuable in the PyTorch code base itself!
Have you considered sending a PR or opening an issue there too?

yoniaflalo · 2020-05-05T13:36:40Z

No, I did not sent a PR in the PyTorch code base. I do not know this code base good enough, and PR in PyTorch code base takes several months to get merged. Maybe I could consider doing it. But for now, I think it would be good to put it in this repository, since it is the best repository that exists for image classification training.

rwightman · 2020-05-05T15:51:28Z

thanks

mrT23 · 2020-05-08T06:24:52Z

@vrandme
this has been proposed and discussed in PyTorch merge requests
pytorch/pytorch#15849 (comment)

bryant1410 · 2021-07-28T18:50:16Z

@vrandme
this has been proposed and discussed in PyTorch merge requests
pytorch/pytorch#15849 (comment)

IIUC, this is different. Re-using the workers can be just keeping them alive. This implementation goes a bit beyond: workers are also gonna pre-fetch the data for the next epoch instead of starting over the data loading pipeline (because the sampler is infinite).

AhmedAhmedEG · 2023-12-26T19:12:08Z

Does this implementation messes up the shuffling in Dataloader? I always thought the Dataloader reinitializes itself for the shuffling and a couple other operations.

Are there's no any drawbacks from using this method? It's very weird PyTorch devs never considered doing such a crucial thing.

…Loader added MultiEpochsDataLoader

added MultiEpochsDataLoader

a7f570c

rwightman merged commit 3b72ebf into huggingface:master May 5, 2020

NanoCode012 mentioned this pull request Aug 29, 2020

Add InfiniteDataLoader class ultralytics/yolov5#876

Merged

guoriyue pushed a commit to guoriyue/pytorch-image-models that referenced this pull request May 24, 2024

Merge pull request huggingface#140 from yoniaflalo/PR_MultiEpochsData…

bc14b67

…Loader added MultiEpochsDataLoader

rakhimovv mentioned this pull request Jul 18, 2024

High Data Loading Times at the Beginning of Each Epoch with LeRobot Datasets huggingface/lerobot#324

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added MultiEpochsDataLoader #140

added MultiEpochsDataLoader #140

yoniaflalo commented May 5, 2020 •

edited

Loading

chris-ha458 commented May 5, 2020

yoniaflalo commented May 5, 2020

rwightman commented May 5, 2020

mrT23 commented May 8, 2020

bryant1410 commented Jul 28, 2021

AhmedAhmedEG commented Dec 26, 2023 •

edited

Loading

added MultiEpochsDataLoader #140

added MultiEpochsDataLoader #140

Conversation

yoniaflalo commented May 5, 2020 • edited Loading

chris-ha458 commented May 5, 2020

yoniaflalo commented May 5, 2020

rwightman commented May 5, 2020

mrT23 commented May 8, 2020

bryant1410 commented Jul 28, 2021

AhmedAhmedEG commented Dec 26, 2023 • edited Loading

yoniaflalo commented May 5, 2020 •

edited

Loading

AhmedAhmedEG commented Dec 26, 2023 •

edited

Loading