2

我正在尝试使用 Keras 对多类分类进行网格搜索。下面是一段代码:

数据的一些属性如下:

y_
array(['fast', 'immobile', 'immobile', ..., 'slow',
       'immobile', 'slow'],
      dtype='<U17')

y_onehot = pd.get_dummies(y_).values

y_onehot
array([[1, 0, 0],
       [0, 0, 1],
       [0, 0, 1],
    ...
       [0, 1, 0],
       [0, 0, 1],
       [0, 1, 0]], dtype=uint8)

#Do train-test split

y_train.shape
(1904,)

y_train_onehot.shape
(1904, 3)

还有模特...

# Function to create model, required for KerasClassifier
def create_model(optimizer='rmsprop', init='glorot_uniform'):
    # create model
    model = Sequential()
    model.add(Dense(2048, input_dim=X_train.shape[1], kernel_initializer=init, activation='relu'))
    model.add(Dense(512, kernel_initializer=init, activation='relu'))
    model.add(Dense(y_train_onehot.shape[1], kernel_initializer=init, activation='softmax'))
    # Compile model
    model.compile(loss='categorical_crossentropy', optimizer=optimizer, metrics=['accuracy'])
    return model

# create model
model = KerasClassifier(build_fn=create_model, verbose=0)

# grid search epochs, batch size and optimizer
optimizers = ['rmsprop', 'adam']
init = ['glorot_uniform', 'normal', 'uniform']
epochs = [50, 100, 150]
batches = [5, 10, 20]

param_grid = dict(optimizer=optimizers, epochs=epochs, batch_size=batches, init=init)
grid = GridSearchCV(estimator=model, param_grid=param_grid, scoring='accuracy')
grid_result = grid.fit(X_train, y_train_onehot)

这是错误:

--> grid_result = grid.fit(X_train, y_train_onehot)
ValueError: Classification metrics can't handle a mix of multilabel-indicator and multiclass targets

该代码用于二进制模型,但我希望针对多类数据集对其进行修改。请协助。谢谢!

4

1 回答 1

2

错误在 softmax 层。

我想你的意思y_train_onehot.shape[1]y_train_onehot[1]

更新 1:这很奇怪,但您的第二个问题似乎是 y_train_onehot,您介意尝试两件事吗:

  1. 尝试在y_train 上使用 onehot 编码的相同模型。
  2. 如果仅此一项不起作用,请将损失更改为sparse_categorical_crossentropy

还要确保更改y_train_onehot.shape[1]为 softmax 层中的类数

于 2017-10-17T22:37:32.520 回答