|
| 1 | +Warming up PyWSD (takes ~10 secs)... Loading the LM will be faster if you build a binary file. |
| 2 | +Reading /remote-home/***/Code/NLP/EPDA/lms/offense.arpa |
| 3 | +----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100 |
| 4 | +**************************************************************************************************** |
| 5 | +Simple Test |
| 6 | +['thusly cute the little baby is crying!', 'So cute the little baby is crying!'] |
| 7 | +LR= 5e-05 |
| 8 | +Start to read: new_data/sentiment/train_1.txt |
| 9 | +Load Over, Find: 153 datas. |
| 10 | +Start to read: new_data/sentiment/test.txt |
| 11 | +Load Over, Find: 3027 datas. |
| 12 | +took 5.79877781867981 secs. |
| 13 | +Some weights of the model checkpoint at bert-base-uncased were not used when initializing BertForSequenceClassification: ['cls.predictions.transform.LayerNorm.bias', 'cls.predictions.transform.dense.bias', 'cls.predictions.bias', 'cls.predictions.decoder.weight', 'cls.predictions.transform.dense.weight', 'cls.seq_relationship.bias', 'cls.seq_relationship.weight', 'cls.predictions.transform.LayerNorm.weight'] |
| 14 | +- This IS expected if you are initializing BertForSequenceClassification from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model). |
| 15 | +- This IS NOT expected if you are initializing BertForSequenceClassification from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model). |
| 16 | +Some weights of BertForSequenceClassification were not initialized from the model checkpoint at bert-base-uncased and are newly initialized: ['classifier.weight', 'classifier.bias'] |
| 17 | +You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference. |
| 18 | +Bert Tokenizer |
| 19 | +Update EPOCHES to 16 |
| 20 | +Start to update Dataset |
| 21 | +?? 2295 2295 2295 |
| 22 | +Before 153 |
| 23 | +Start Update Dataset, Find 153 datas. |
| 24 | +Update Dataset Finish, Find 2295 datas. |
| 25 | +< Update Done. |
| 26 | +After 2295 |
| 27 | +start to extract something |
| 28 | +torch.Size([2295, 3072]) 2295 |
| 29 | +Error Rate EPida0.0153 EDA0.0305 CEM0.0065 REM0.0675 |
| 30 | +Distance EPida0.0078 EDA0.0054 CEM0.0025 REM0.0121 |
| 31 | +> Done. Model Training |
0 commit comments