我试图找出单热编码(keras函数)的输出。
我正在查看以下示例的输出:
from keras.preprocessing.text import one_hot
reviews =[
'Never coming back!',
'horrible service',
'rude waitress',
'cold food',
'horrible food!',
'awesome',
'awesome services!',
'rocks',
'poor work',
'couldn\'t have done better'
]
Vocab_size = 50
encoded_reviews = [one_hot(d,Vocab_size) for d in reviews]
print(f'encoded reviews: {encoded_reviews}')
我得到以下输出:
[[5, 7, 36], [17, 12], [39, 33], [27, 2], [17, 2], [33], [33, 10], [35], [3, 21], [14, 14, 44, 24]]
- 为什么最后一句话 (
'couldn\'t have done better') 的输出向量是[14, 14, 44, 24]? - 为什么单词
couldn\'t和have得到相同的值14?