“roberta-language-model”的相关标签问题

0 投票

1 回答

25 浏览

python - 我在哪个程序/界面中运行以下代码？

对于机器学习/NLP 项目，我在 roBERTa 上查看来自 github 的一些代码。我想看看是否能得到相同的结果，然后修改程序以适应我自己的数据。

但是，我不知道如何/在哪里/使用什么程序来运行以下代码：

我尝试了多个程序/终端/jupyter，但似乎无法找出如何正确运行此代码。有人知道如何运行它吗？（我知道我必须更改第一行等中的目录，但现在只会出错。）

2020-12-10T15:24:12.897

0 投票

0 回答

283 浏览

deep-learning - 使用带有 RoBERTa 的 WordPiece 标记化

据我了解，Huggingface库实现的 RoBERTa 模型使用了BPE分词器。这是文档的链接：

RoBERTa 具有与 BERT 相同的架构，但使用字节级 BPE 作为标记器（与 GPT-2 相同）并使用不同的预训练方案。

但是，我有一个基于WordPiece标记化的自定义标记器，并且我使用了BertTokenizer。

因为我的自定义标记器与我的任务更相关，所以我不喜欢使用 BPE。

当我使用我的自定义标记器从头开始预训练 RoBERTa (RobertaForMaskedLM) 时，MLM 任务的损失比 BPE 的损失要好得多。然而，在微调方面，模型（RobertaForSequenceClassification）表现不佳。我几乎可以肯定问题不在于标记器。我想知道 RobertaForSequenceClassification 的拥抱脸库是否与我的标记器不兼容。

有关微调的详细信息：

任务：具有不平衡标签的多标签分类。

时代：20

损失：BCEWithLogitsLoss()

优化器：Adam，weight_decay_rate：0.01，lr：2e-5，correct_bias：True

F1 和 AUC 非常低，因为标签的输出概率与实际标签不符（即使阈值非常低），这意味着模型无法学习任何东西。

*

注意：使用 BPE 分词器进行预训练和微调的 RoBERTa 比使用自定义分词器进行预训练和微调的 RoBERTa 表现更好，尽管使用自定义分词器的 MLM 的损失要好于 BPE。

deep-learning bert-language-model huggingface-transformers transfer-learning roberta-language-model

2020-12-11T09:12:06.630

0 投票

1 回答

479 浏览

python - Why is my tensorflow Roberta Model unable to train/finetune?

We are trying to finetune / train our RoBERTa model on our own train data. The project is exactly the same as the SemEval-2020 task B on choosing the right reason out of 3 on why a sentence is against common sense. For the past two days we have been struggling with errors, mainly when trying to train/finetune our model. The code we have used comes from https://huggingface.co/transformers/model_doc/roberta.html#robertamodel . Although we have tried to alter this code in multiple ways we can't seem to really start training our model. Our main problem is the data we try to train the model on. We have tried to immediately insert a numpy array or pandas dataframe, but to no avail. Finally we tried to use a tfds. We used the following code, which results in the error code which can be found below.

I install the following packages:

#xA;

I import my train and test sets as csv files, where after the data is cleaned and concatenated.

#xA;

I make a tfds of the train and test data:

#xA;

These data sets are then used to train the model through the following code:

#xA;

This results in the following error:

ValueError: The training dataset must have an asserted cardinality

If anybody has any advice or can point us in the right direction on how to train/finetune our model we would be very grateful!

python tensorflow machine-learning roberta-language-model