python - AllenNLP 中的可分解注意力模型难以微调

Question

我在微调 decomposable-attention-elmo 模型时遇到了麻烦。我已经能够下载模型：wget https://s3-us-west-2.amazonaws.com/allennlp/models/decomposable-attention-elmo-2018.02.19.tar.gz. 我正在尝试加载模型，然后使用 AllenNLP train 命令行命令对我的数据进行微调。

我还创建了一个自定义数据集阅读器，它类似于SNLIDatasetReader并且似乎运行良好。

我创建了一个.jsonnet文件，类似于这里的文件，但我无法让它工作。

当我使用这个版本时：

// Configuraiton for a textual entailment model based on:
//  Parikh, Ankur P. et al. “A Decomposable Attention Model for Natural Language Inference.” EMNLP (2016).
{
  "dataset_reader": {
    "type": "custom_reader",
    "token_indexers": {
      "elmo": {
        "type": "elmo_characters"
      }
    },
    "tokenizer": {
      "end_tokens": ["@@NULL@@"]
    }
  },
  "train_data_path": "examples_train_",
  "validation_data_path": "examples_val_",
  "model": {
    "type": "from_archive",
    "archive_file": "decomposable-attention-elmo-2018.02.19.tar.gz",
    "text_field_embedder": {
      "token_embedders": {
        "elmo": {
            "type": "elmo_token_embedder",
            "do_layer_norm": false,
            "dropout": 0.2
        }
      }
    },
   },
  "data_loader": {
    "batch_sampler": {
      "type": "bucket",
      "batch_size": 64
    }
  },
  "trainer": {
    "num_epochs": 140,
    "patience": 20,
    "grad_clipping": 5.0,
    "validation_metric": "+accuracy",
    "optimizer": {
      "type": "adagrad"
    }
  }
}

我收到一个错误：

 File "lib/python3.6/site-packages/allennlp/common/params.py", line 423, in assert_empty
    "Extra parameters passed to {}: {}".format(class_name, self.params)
allennlp.common.checks.ConfigurationError: Extra parameters passed to Model: {'text_field_embedder': {'token_embedders': {'elmo': {'do_layer_norm': False, 'dropout': 0.2, 'type': 'elmo_token_embedder'}}}}

然后，当我取出那text_field_embedder部分并使用此版本时：

// Configuraiton for a textual entailment model based on:
//  Parikh, Ankur P. et al. “A Decomposable Attention Model for Natural Language Inference.” EMNLP (2016).
{
  "dataset_reader": {
    "type": "fake_news",
    "token_indexers": {
      "elmo": {
        "type": "elmo_characters"
      }
    },
    "tokenizer": {
      "end_tokens": ["@@NULL@@"]
    }
  },
  "train_data_path": "examples_train_",
  "validation_data_path": "examples_val_",
  "model": {
    "type": "from_archive",
    "archive_file": "decomposable-attention-elmo-2018.02.19.tar.gz",
   },
  "data_loader": {
    "batch_sampler": {
      "type": "bucket",
      "batch_size": 64
    }
  },
  "trainer": {
    "num_epochs": 140,
    "patience": 20,
    "grad_clipping": 5.0,
    "validation_metric": "+accuracy",
    "optimizer": {
      "type": "adagrad"
    }
  }
}

我收到一个错误：

    raise ConfigurationError(msg)
allennlp.common.checks.ConfigurationError: key "token_embedders" is required at location "model.text_field_embedder."

这两个错误似乎是矛盾的，我不确定如何进行这种微调。

score 1 · Accepted Answer

我们在 GitHub 上发现问题出在 @hockeybro 加载的旧版本模型上。现在的最新版本位于https://storage.googleapis.com/allennlp-public-models/decomposable-attention-elmo-2020.04.09.tar.gz。

python - AllenNLP 中的可分解注意力模型难以微调

1 回答 1

Related

Reference