我正在尝试通过将预训练的 .bin 文件从 google word2vec 站点(freebase-vectors-skipgram1000.bin.gz)加载到 word2vec 的 gensim 实现中开始。模型加载良好,
使用 ..
model = word2vec.Word2Vec.load_word2vec_format('...../free....-en.bin', binary= True)
并创建一个
>>> print model
<gensim.models.word2vec.Word2Vec object at 0x105d87f50>
但是当我运行最相似的功能时。它无法在词汇表中找到单词。我的错误代码如下。
有什么想法我哪里出错了吗?
>>> model.most_similar(['girl', 'father'], ['boy'], topn=3)
2013-10-11 10:22:00,562 : WARNING : word ‘girl’ not in vocabulary; ignoring it
2013-10-11 10:22:00,562 : WARNING : word ‘father’ not in vocabulary; ignoring it
2013-10-11 10:22:00,563 : WARNING : word ‘boy’ not in vocabulary; ignoring it
Traceback (most recent call last):
File “”, line 1, in
File “/....../anaconda/python.app/Contents/lib/python2.7/site-packages/gensim-0.8.7/py2.7.egg/gensim/models/word2vec.py”, line 312, in most_similar
raise ValueError(“cannot compute similarity with no input”)
ValueError: cannot compute similarity with no input