SummaryWriter `add_hparams` should support adding new hyperparameters #39250

awwong1 · 2020-05-29T15:46:17Z

🐛 Bug

When calling SummaryWriter().add_hparams with new hyperparameters, keys that do not exist in the first call do not appear in the hyperparameter dashboard output.

To Reproduce

#!/usr/bin/env python3

from torch.utils.tensorboard import SummaryWriter

with SummaryWriter() as w:
    w.add_hparams({"key_A": 10}, {})
with SummaryWriter() as w:
    w.add_hparams({"key_B": 10}, {})

When viewing the Tensorboard summary writer output on http://localhost:6006/#hparams:

Trial_ID	key_A
May29_09-27-46_mbp13/1590766066.254924	10.000
May29_09-27-46_mbp13/1590766066.2567558

Expected behavior

I would expect key_B to also appear in the output, with a blank value for the first row.

Environment

PyTorch version: 1.4.0
Is debug build: No
CUDA used to build PyTorch: 10.1

OS: Debian GNU/Linux 10 (buster)
GCC version: (Debian 8.3.0-6) 8.3.0
CMake version: Could not collect

Python version: 3.7
Is CUDA available: No
CUDA runtime version: No CUDA
GPU models and configuration: No CUDA
Nvidia driver version: No CUDA
cuDNN version: No CUDA

Versions of relevant libraries:
[pip3] numpy==1.17.4
[pip3] torch==1.4.0
[pip3] torchvision==0.5.0
[conda] Could not collect

Additional context

The text was updated successfully, but these errors were encountered:

AvivWn · 2020-09-25T20:49:28Z

I am experiencing this exact bug. Any news about that?
It is quite annoying to run a new logger, whenever adding a new parameter.

Sushobhan04 · 2021-10-06T07:17:38Z

Any updates on this one ? I am facing this same issue and it seems like it has not been fixed in a year.

@AvivWn @awwong1 any workaround that you found ? I don't want to move to another logging tool, but this bug has been very annoying bug from tensorboard.

sriveravi · 2021-12-03T20:32:17Z

The bug still persists today.

tensorboard 2.5.0
pytorch 1.8.0

LarsHill · 2022-04-09T10:16:11Z

The issue still persists and is quite annoying when comparing multiple runs with different hyperparameter setups. Is there any plan to fix this or is there a known workaround?

sriveravi · 2022-04-11T12:36:16Z

I believe the issue has been fixed. On the left, you need to scroll down and check the appropriate boxes. It will default to a smaller subset if you change the parameter sets.

phisad · 2022-04-20T12:58:56Z

This still persists for me as well.

pytorch-lightning            1.5.9
torch                        1.10.1+cu113

ghost · 2022-11-06T20:10:20Z

I am annoyed by this bug as well and wanted to understand it better.
I did a couple of tests, on tensorboard 2.10.1, pytorch 1.12.1, python 3.10.
Note: I wrote and edited that comment as I tested. Sorry about that, please jump to the end for conclusions.

Each time, I:

rm -rf runs
run the python script
start Tensorboard tensorboard --logdir runs (I stop Tensorboard before the next test).

Here are some results:

Case 1

from torch.utils.tensorboard import SummaryWriter

with SummaryWriter(log_dir="runs/ABC", filename_suffix="A") as w:
    w.add_hparams({"key_A": 10}, {"loss":1})
with SummaryWriter(log_dir="runs/ABC", filename_suffix="B") as w:
    w.add_hparams({"key_B": 20}, {"loss":2})
with SummaryWriter(log_dir="runs/ABC", filename_suffix="C") as w:
    w.add_hparams({"key_C": 30}, {"loss":3})

First surprise, I would have expected key_A, not B.
I don't understand why B comes up. That's weird, let's run it again.

Damn it.
Once more:

So, I'm already lost.
I confirm that the other keys are stored anyway by deleting folders and refreshing Tensorboard. From this last last run:
Delete A and B folders:

Delete A and C folders:

The data is there but doesn't show up. That points to a front-end issue, perhaps?
Let's do some other tests.

Case 2

from torch.utils.tensorboard import SummaryWriter

with SummaryWriter(log_dir="runs/ABC", filename_suffix="A") as w:
    w.add_hparams({"key_A": 10, "key_B": 10}, {"loss":1})
with SummaryWriter(log_dir="runs/ABC", filename_suffix="B") as w:
    w.add_hparams({"key_B": 20}, {"loss":2})
with SummaryWriter(log_dir="runs/ABC", filename_suffix="C") as w:
    w.add_hparams({"key_C": 30}, {"loss":3})

Another run:

Case 3

from torch.utils.tensorboard import SummaryWriter

with SummaryWriter(log_dir="runs/ABC", filename_suffix="A") as w:
    w.add_hparams({"key_A": 10, "key_B": 10, "key_C":10}, {"loss":1})
with SummaryWriter(log_dir="runs/ABC", filename_suffix="B") as w:
    w.add_hparams({"key_B": 20}, {"loss":2})
with SummaryWriter(log_dir="runs/ABC", filename_suffix="C") as w:
    w.add_hparams({"key_C": 30}, {"loss":3})

Another run...

And another one...

Conclusion

I don't get it at all!
But I hope these few tests may give more idea to someone maintaining Tensorboard...

EDIT: Follow-up

Looking at TB with a high verbosity, I notice that TB cyclically reload folders, but not always in the same order.
I suspect this non-deterministic order might explain why the same script can show different results. My assumption is that the first folder loaded during the very first loading defines the format of the table and thus which keys will be shown.

EDIT 2: I think I get it

This last one was easy to test, I should have started there.
First, start Tensorboard.
Second, run one experiment:

with SummaryWriter(log_dir="runs/ABC", filename_suffix="A") as w:
    w.add_hparams({"key_A": 10, "key_B": 10}, {"loss":1})

This is necessarily the first file loaded by TB, and thus defines the table schema.
If we then run the following, without turning TB off:

with SummaryWriter(log_dir="runs/ABC", filename_suffix="B") as w:
    w.add_hparams({"key_B": 20}, {"loss":2})
with SummaryWriter(log_dir="runs/ABC", filename_suffix="C") as w:
    w.add_hparams({"key_C": 30}, {"loss":3})

We see:

So it would appear that:

When you start TB before generating experiments, the first experiment will define the schema of the table, probably as long as TB is running.
When you start TB and open a folder of existing experiments, then the schema is defined by the first loaded experiment, which seems to be an unpredictable output of Rust loading files in parallel.

gchanan added oncall: visualization Related to visualization in PyTorch, e.g., tensorboard enhancement Not as big of a feature, but technically not a bug. Should be easy to fix labels Jun 1, 2020

nfelt mentioned this issue Jul 1, 2020

tensorboard hyperparameters don't update tensorflow/tensorboard#3371

Closed

timothe-chaumont mentioned this issue Oct 6, 2022

[Bug] New hyperparameters logged through HParam don't get displayed in Tensorboard DLR-RM/stable-baselines3#1099

Closed

3 tasks

This was referenced Nov 6, 2022

Bug Report: Tensorboard can not load all Hyperparameters keys, if call writer.add_hparams with different hparam_dict parameters. tensorflow/tensorboard#5596

Open

hparams: why not show all metrics for multi-task? tensorflow/tensorboard#5414

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SummaryWriter `add_hparams` should support adding new hyperparameters #39250

SummaryWriter `add_hparams` should support adding new hyperparameters #39250

awwong1 commented May 29, 2020

AvivWn commented Sep 25, 2020 •

edited

Loading

Sushobhan04 commented Oct 6, 2021

sriveravi commented Dec 3, 2021

LarsHill commented Apr 9, 2022

sriveravi commented Apr 11, 2022

phisad commented Apr 20, 2022

ghost commented Nov 6, 2022 •

edited by ghost

Loading

SummaryWriter add_hparams should support adding new hyperparameters #39250

SummaryWriter add_hparams should support adding new hyperparameters #39250

Comments

awwong1 commented May 29, 2020

🐛 Bug

To Reproduce

Expected behavior

Environment

Additional context

AvivWn commented Sep 25, 2020 • edited Loading

Sushobhan04 commented Oct 6, 2021

sriveravi commented Dec 3, 2021

LarsHill commented Apr 9, 2022

sriveravi commented Apr 11, 2022

phisad commented Apr 20, 2022

ghost commented Nov 6, 2022 • edited by ghost Loading

Case 1

Case 2

Case 3

Conclusion

EDIT: Follow-up

EDIT 2: I think I get it

SummaryWriter `add_hparams` should support adding new hyperparameters #39250

SummaryWriter `add_hparams` should support adding new hyperparameters #39250

AvivWn commented Sep 25, 2020 •

edited

Loading

ghost commented Nov 6, 2022 •

edited by ghost

Loading