Skip to content

Commit 0a82e1b

Browse files
author
Alexis Conneau
committed
readme change
1 parent 06222b0 commit 0a82e1b

File tree

3 files changed

+1
-1
lines changed

3 files changed

+1
-1
lines changed

Diff for: README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33

44
SentAugment is a data augmentation technique for semi-supervised learning in NLP. It uses state-of-the-art sentence embeddings to structure the information of a very large bank of sentences. The large-scale sentence embedding space is then used to retrieve in-domain unannotated sentences for any language understanding task such that semi-supervised learning techniques like self-training and knowledge-distillation can be leveraged. This means you do not need to assume the presence of unannotated sentences to use semi-supervised learning techniques. In our paper [Self-training Improves Pre-training for Natural Language Understanding](https://arxiv.org/abs/2010.02194), we show that SentAugment provides strong gains on multiple language understanding tasks when used in combination with self-training or knowledge distillation.
55

6-
![Model](sentaugment_figure.pdf)
6+
![Model](sentaugment_figure.png)
77

88
## Dependencies
99

Diff for: sentaugment_figure.pdf

-1.44 MB
Binary file not shown.

Diff for: sentaugment_figure.png

203 KB
Loading

0 commit comments

Comments
 (0)