Only invoke setup() once, not in both trainer.fit() and trainer.test() #2620

NumesSanguis · 2020-07-16T09:51:50Z

🚀 Feature

Only invoke def setup(self, step: str) when calling trainer.test(net) if this has not been called before (e.g. trainer.fit(net)).

Motivation

The setup function is described in the docs as: "use setup to do splits, and build your model internals".
Therefore, I wrote code that does the train-val-test function and some DataFrame labels transformation (e.g. label to one_hot) in this function.
A pretty common pattern is the following:

trainer.fit(net)
trainer.test(net)

Contrary to what I expected, I saw from my debug output that setup was invoked twice.
This is a waste of computational resources, and since I did the train-val split randomly, I do not have access to the indices that were used in either step (and possibly other issues such as the label transformation re-ordering the columns of which number represent which label).

Current situation, train and val step use same setup, test step uses another invoke of setup
I assume that it is more common that the train-val and test step of the trainer use the same setup code, than that setup does something special for only test (and not val).
As Lightning works by giving sensible defaults, and allowing you to hack at anything you want, the logic should be that setup should only be invoked once, and allow for a way to specify a special test setup function.

Pitch

Have the trainer keep track of whether setup() has been invoked or not, so setup() can be skipped in trainer.test(net) if it was already invoked in trainer.fit(net). This way the common use case will benefit from less computation power and is more in line what is expected of the magic of Lightning.

I'm not sure what would be the best approach for users to set a separate setup call for testing.
Maybe something like: trainer.test(net, always_invoke_setup=True)?

Alternatives

Include some custom logic that checks if data has been initialized:

def setup(self, step: str):
    if self.data is None:
        # setup code
    else:
        # do nothing

Additional context

Ideas formed by discussing this issue on the pytorch-lightning SLACK in the questions channel. Thanks goed to the people who replied.

The text was updated successfully, but these errors were encountered:

williamFalcon · 2020-07-16T12:42:14Z

we could also just split the method

fit_setup
test_setup?

@tullie

rohitgr7 · 2020-07-16T17:47:58Z

Or simply just don't calll setup('fit') when self.testing == True??

williamFalcon · 2020-07-16T17:55:12Z

that doesn’t happen no?

it you call .test() only the setup(‘test’) gets called

rohitgr7 · 2020-07-16T18:00:28Z

Since it calls .fit() within test, it calls .setup('fit') too, I think:
https://github.com/PyTorchLightning/pytorch-lightning/blob/7b4db3045dcc9e6bb0b66e409b25bb2c7fa378f0/pytorch_lightning/trainer/trainer.py#L1033-L1048

williamFalcon · 2020-07-16T19:00:03Z

oh! yeah that's a bug :)

mind submitting a PR?

nice catch!!

AlexHarn · 2021-10-07T19:56:01Z

I think this bug was reintroduced somehow. At least it is showing the same behavior again for me, with setup being called twice when testing after fitting.

I'm new to lightning so I am not a big help yet, but I started digging a bit and at least found that it seems like the code from this fix is not in the current trainer code anymore?

ananthsub · 2021-10-07T20:18:17Z

I think this bug was reintroduced somehow. At least it is showing the same behavior again for me, with setup being called twice when testing after fitting.

I'm new to lightning so I am not a big help yet, but I started digging a bit and at least found that it seems like the code from this fix is not in the current trainer code anymore?

hey @AlexHarn , could you open up a new issue for this, along with example code to reproduce the behavior you're seeing? The code has changed quite a bit since when this issue was filed

AlexHarn · 2021-10-07T20:21:03Z

Yep, I'm actually doing that right now!

NumesSanguis added feature Is an improvement or enhancement help wanted Open to be worked on labels Jul 16, 2020

Borda added the let's do it! approved to implement label Jul 16, 2020

rohitgr7 mentioned this issue Jul 16, 2020

fix setup call while testing #2624

Merged

7 tasks

williamFalcon closed this as completed in #2624 Jul 25, 2020

AlexHarn mentioned this issue Oct 7, 2021

Only invoke setup() once, not in both trainer.fit() and trainer.test() - #2620 follow up #9865

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only invoke setup() once, not in both trainer.fit() and trainer.test() #2620

Only invoke setup() once, not in both trainer.fit() and trainer.test() #2620

NumesSanguis commented Jul 16, 2020

williamFalcon commented Jul 16, 2020

rohitgr7 commented Jul 16, 2020

williamFalcon commented Jul 16, 2020

rohitgr7 commented Jul 16, 2020 •

edited

Loading

williamFalcon commented Jul 16, 2020

AlexHarn commented Oct 7, 2021

ananthsub commented Oct 7, 2021

AlexHarn commented Oct 7, 2021

Only invoke setup() once, not in both trainer.fit() and trainer.test() #2620

Only invoke setup() once, not in both trainer.fit() and trainer.test() #2620

Comments

NumesSanguis commented Jul 16, 2020

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

williamFalcon commented Jul 16, 2020

rohitgr7 commented Jul 16, 2020

williamFalcon commented Jul 16, 2020

rohitgr7 commented Jul 16, 2020 • edited Loading

williamFalcon commented Jul 16, 2020

AlexHarn commented Oct 7, 2021

ananthsub commented Oct 7, 2021

AlexHarn commented Oct 7, 2021

rohitgr7 commented Jul 16, 2020 •

edited

Loading