Skip to content

Commit f209a5a

Browse files
committed
README update
1 parent 5cb1825 commit f209a5a

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

README.md

+6
Original file line numberDiff line numberDiff line change
@@ -365,6 +365,12 @@ Second, a **transformer** is trained to sample from the codebook
365365
The first and the second stages can be trained on the same or separate datasets as long as the process of spectrogram extraction is the same.
366366

367367
## Training a Spectrogram Codebook
368+
369+
> **Erratum**: during training with the default config, the code will silently fail to load the checkpoint of
370+
> the perceptual loss. This leads to the results which are as good as without the perceptual loss.
371+
> For this reason, one may try turning it off completely: `perceptual_weight=0.0` and benefit from faster
372+
> iterations. For details please refer to [Issue#13](https://github.com/v-iashin/SpecVQGAN/issues/13)
373+
368374
To train a spectrogram codebook, we tried two datasets: VAS and VGGSound.
369375
We run our experiments on a relatively expensive hardware setup with four _40GB NVidia A100_ but the models
370376
can also be trained on one _12GB NVidia 2080Ti_ with smaller batch size.

0 commit comments

Comments
 (0)