ReduceLROnPlateau does not work with multiple schedulers #1037

ghost · 2020-03-03T21:48:24Z

🐛 Bug

PL seems to only pull one ReduceLROnPlateau to call it the reduce_lr_on_plateau_scheduler

To Reproduce

One example would be adding two ReduceLROnPlateau schedulers to the GAN example.

Expected behavior

All ReduceLROnPlateau should be recognized as such.
A natural solution might be to check if a scheduler is an instance of ReduceLROnPlateau for each scheduler in the loop that runs lr_scheduler.step(), instead of having a separate self.reduce_lr_on_plateau_scheduler

The text was updated successfully, but these errors were encountered:

williamFalcon · 2020-03-03T21:54:51Z

Good catch!
mind submitting a PR?

SkafteNicki · 2020-03-04T09:04:17Z

If PR #941 is merged, it will also solve this problem

Borda · 2020-03-04T09:50:58Z

@darwinkim @SkafteNicki pls coordinate your effort #941 #1039 so none of your work gets wasted... ;] maybe make #1039 as followup of #941?

ghost · 2020-03-04T16:01:28Z

#941 seems to solve it, and adds many related features

SkafteNicki · 2020-03-05T14:01:32Z

With #941 being merged now, this can be closed

Laksh1997 · 2020-06-27T16:29:14Z

I'm not sure if this is solved. I just tried 2 schedulers with 1 optimizer and it didn't work.

When I remove my linear scheduler, ReduceLROnPlateau starts to work again.

SkafteNicki · 2020-06-27T18:49:00Z

@Laksh1997 could you provide more info:which version of lightning are you using, what does your configure optimizer looks like?

Laksh1997 · 2020-06-27T18:57:50Z

@SkafteNicki

Latest stable (0.8.1)

Using ReduceLROnPlateau at the epoch level and a linear warmup on the step level:

import torch


class LinearWarmupScheduler(torch.optim.lr_scheduler.LambdaLR):
    def __init__(
        self, optimizer: torch.optim.Optimizer, num_warmup_steps: int = 1000,
    ):
        assert num_warmup_steps > 0
        super().__init__(optimizer, lambda step: min(step / num_warmup_steps, 1.0))

SkafteNicki · 2020-06-27T19:35:57Z

Okay, but what does configure_optimizers method of your model look like?

Laksh1997 · 2020-06-27T20:22:43Z

@SkafteNicki

It returns the following:

[torch.optim.Adam(self.parameters(), lr=0.01)], [{"scheduler": ReduceLROnPlateau(...), "interval": "epoch"}, {"scheduler": LinearWarmupScheduler(...), "interval": "step"}]

Laksh1997 · 2020-06-27T20:29:35Z

Here is my code more specifically:

def build_scheduler_params(scheduler_name, param_set, optimizer: Optimizer):
    """Parses scheduler params"""
    pl_scheduler_params = {
        "monitor": param_set.pop("monitor", "val_loss"),
        "interval": param_set.pop("interval", "epoch"),
        "frequency": param_set.pop("frequency", 1),
    }
    if hasattr(torch.optim.lr_scheduler, scheduler_name):
        scheduler_class = getattr(torch.optim.lr_scheduler, scheduler_name)
        scheduler = scheduler_class(optimizer, **param_set)
    elif hasattr(lr_schedulers, scheduler_name):
        scheduler_class = getattr(lr_schedulers, scheduler_name)
        scheduler = scheduler_class(optimizer, **param_set)
    else:
        raise ValueError(
            f"Scheduler: {scheduler_name} not available. "
            f"Schedulers available are: {get_available_schedulers()}"
        )
    pl_scheduler_params["scheduler"] = scheduler
    return pl_scheduler_params


def build_optimizer(model: nn.Module, config) -> Optimizer:
    """Makes optimizer from model and config"""
    optim_kwargs = copy.deepcopy(config.optimizer_kwargs)
    optim_class = get_optim_class(config.optimizer)
    optim_param_groups = build_optim_param_groups(model, optim_kwargs)
    check_valid_param_groups(optim_param_groups, model)
    if isinstance(optim_kwargs, list):
        optim_kwargs = optim_kwargs[0]
    optimizer = optim_class(optim_param_groups, **optim_kwargs)
    return optimizer


def build_schedulers(optimizer: Optimizer, config):
    """Makes schedulers from optimizer and config"""
    schedulers = []
    schedulers_names = config.schedulers
    schedulers_kwargs = config.schedulers_kwargs

    assert len(schedulers_names) == len(schedulers_kwargs), (
        f"Need to have as many schedulers as scheduler param sets! "
        f"Got {len(schedulers_names)} of schedulers and "
        f"{len(schedulers_kwargs)} of scheduler param sets!"
    )
    if schedulers_names is not None:
        for scheduler_name, param_set in zip(schedulers_names, schedulers_kwargs):
            pl_scheduler_params = build_scheduler_params(
                scheduler_name, param_set, optimizer
            )
            schedulers.append(pl_scheduler_params)
    return schedulers


def configure_optimizers(model: nn.Module, config) -> PL_EXPECTED_OUTPUT:
    """Configures PL optimizers and schedulers"""
    optimizer = build_optimizer(model, config)
    schedulers = build_schedulers(optimizer, config)
    return [optimizer], schedulers

SkafteNicki · 2020-06-29T12:58:42Z

@Laksh1997 I do not really see a problem in your code. After experimenting around with ReduceLROnPlateau scheduler in combination with other schedulers, I don't think the there is a problem in lightning either as I can get multiple schedulers to work at the same time.

My best guess is that this is specific to your model/data. When you have two optimizers, and one is changing the learning rate every step it is very possible that you do not reach a plateau, such that the ReduceLROnPlateau scheduler never kicks in.

ghost added bug Something isn't working help wanted Open to be worked on labels Mar 3, 2020

ghost mentioned this issue Mar 4, 2020

[blocked by #941] Support for multiple ReduceLROnPlateau schedulers #1039

Closed

Borda closed this as completed Mar 5, 2020

vinnamkim mentioned this issue Jan 31, 2024

Introduce warm-up and change the LRPlateau monitor value, Enable Early stopping openvinotoolkit/training_extensions#2857

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ReduceLROnPlateau does not work with multiple schedulers #1037

ReduceLROnPlateau does not work with multiple schedulers #1037

ghost commented Mar 3, 2020

williamFalcon commented Mar 3, 2020 •

edited

Loading

SkafteNicki commented Mar 4, 2020

Borda commented Mar 4, 2020 •

edited

Loading

ghost commented Mar 4, 2020

SkafteNicki commented Mar 5, 2020

Laksh1997 commented Jun 27, 2020

SkafteNicki commented Jun 27, 2020

Laksh1997 commented Jun 27, 2020

SkafteNicki commented Jun 27, 2020

Laksh1997 commented Jun 27, 2020

Laksh1997 commented Jun 27, 2020 •

edited

Loading

SkafteNicki commented Jun 29, 2020

ReduceLROnPlateau does not work with multiple schedulers #1037

ReduceLROnPlateau does not work with multiple schedulers #1037

Comments

ghost commented Mar 3, 2020

🐛 Bug

To Reproduce

Expected behavior

williamFalcon commented Mar 3, 2020 • edited Loading

SkafteNicki commented Mar 4, 2020

Borda commented Mar 4, 2020 • edited Loading

ghost commented Mar 4, 2020

SkafteNicki commented Mar 5, 2020

Laksh1997 commented Jun 27, 2020

SkafteNicki commented Jun 27, 2020

Laksh1997 commented Jun 27, 2020

SkafteNicki commented Jun 27, 2020

Laksh1997 commented Jun 27, 2020

Laksh1997 commented Jun 27, 2020 • edited Loading

SkafteNicki commented Jun 29, 2020

williamFalcon commented Mar 3, 2020 •

edited

Loading

Borda commented Mar 4, 2020 •

edited

Loading

Laksh1997 commented Jun 27, 2020 •

edited

Loading