amralaa-MSFT
diff --git a/Diff for: ‎README.md
+1-1 b/Diff for: ‎README.md
+1-1
diff --git a/Diff for: ‎sentaugment_figure.pdf
-1.44 MB b/Diff for: ‎sentaugment_figure.pdf
-1.44 MB
diff --git a/Diff for: ‎sentaugment_figure.png
203 KB b/Diff for: ‎sentaugment_figure.png
203 KB
@@ -3,7 +3,7 @@
 
 SentAugment is a data augmentation technique for semi-supervised learning in NLP. It uses state-of-the-art sentence embeddings to structure the information of a very large bank of sentences. The large-scale sentence embedding space is then used to retrieve in-domain unannotated sentences for any language understanding task such that semi-supervised learning techniques like self-training and knowledge-distillation can be leveraged. This means you do not need to assume the presence of unannotated sentences to use semi-supervised learning techniques. In our paper [Self-training Improves Pre-training for Natural Language Understanding](https://arxiv.org/abs/2010.02194), we show that SentAugment provides strong gains on multiple language understanding tasks when used in combination with self-training or knowledge distillation.
 
-![Model](sentaugment_figure.pdf)
+![Model](sentaugment_figure.png)
 
 ## Dependencies