Skip to content

Does Training with SSL objectives support? #1152

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
hui-li-xf opened this issue Apr 28, 2025 · 0 comments
Open

Does Training with SSL objectives support? #1152

hui-li-xf opened this issue Apr 28, 2025 · 0 comments
Labels
bug Something isn't working

Comments

@hui-li-xf
Copy link

Training with SSL objectives
Monolingual data can be incorporated into training using one SSL objectives by specifying one of the following values to the ssl_task training configuration option:

mono_dae: mBART-style denoising objective
mono_lm: left-to-right language model objective on the decoder side (dummy encoder input)
mono_mixed_task: monolingual examples probabilistically split between the above (p=0.5)
In order to use SSL objectives for training, binarized monolingual data needs to be provided by specifying the mono_num_shards and mono_data_prefix options in the dataset config. Note that we found the first of these options (mono_dae) helpful for smaller models, and in particular for training back-translation models, but SSL objectives did not provide additional benefits for the full model when applied to the same monolingual data that had been used for back-translation.

@hui-li-xf hui-li-xf added the bug Something isn't working label Apr 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant