python - 如何评估训练有素的 spaCy 版本 3 模型？

Question

我想使用以下代码使用内置 Scorer 函数评估我训练有素的 spaCy 模型：

def evaluate(ner_model, examples):
    scorer = Scorer()
    for input_, annot in examples:
        text = nlp.make_doc(input_)
        gold = Example.from_dict(text, annot)
        pred_value = ner_model(input_)
        scorer.score(gold)
    return scorer.scores

examples = [('Brief discussion about instument replcement and Product ...confirmation', {'entities': [(48, 55, 'PRODUCT')]})('Met with special chem lead. Did not yet move assays from immulite to produc. Follow up with PhD tomorrow.', {'entities': [(57, 68, 'PRODUCT'), (45, 51, 'DATE'), (97, 105, 'DATE')]}), ('Discuss new products for ...', {'entities': [(36, 51, 'PRODUCT')]})]

ner_model = spacy.load(r'D:\temp\model') # for spaCy's pretrained use 'en_core_web_sm'
results = evaluate(ner_model, examples)

当我运行该函数时，我收到以下错误消息：

TypeError: [E978] Tokenizer.score 方法采用示例对象列表，但得到：<class 'spacy.training.example.Example'>

我已经尝试输入像 {"entities": annot} 和其他一些版本的注释。我查了谷歌，但每篇文章似乎都与 spaCy 的 2.xx 版有关。

我究竟做错了什么？如何使用 spacy Score() 计算召回率、准确率和 F1 分数？

score 2 · Accepted Answer

spaCy 3.0 ( https://spacy.io/api/scorer )仍然支持 score 方法，我终于用下面的代码让它工作了：

nlp = spacy.load(path_to_model)
examples = []
scorer = Scorer()
for text, annotations in TEST_REVISION_DATA:
    doc = nlp.make_doc(text)
    example = Example.from_dict(doc, annotations)
    example.predicted = nlp(str(example.predicted))
    examples.append(example)
scorer.score(examples)

我没有发现命令行工具易于应用（我正在为测试数据加载而苦苦挣扎），并且根据我的需要，代码版本也更方便。这样我就可以轻松地将结果转换为视觉效果。

score 1 · Accepted Answer

您应该使用评估命令行模式。

spacy evaluate my-model/ test.spacy

test.spacy你的测试数据在哪里。

另外，关于这个错误：

TypeError: [E978] The Tokenizer.score method takes a list of Example objects, but got: <class 'spacy.training.example.Example'>

正如此错误所示，您应该传递一个示例列表，但您只传递了一个示例。你应该使用scorer.score([gold])，基本上。

python - 如何评估训练有素的 spaCy 版本 3 模型？

2 回答 2

Related

Reference