1

当提供 tf.data.Dataset 来训练 EfficientnetB0 模型时,我收到以下错误:

ValueError: in converted code:

    C:\Users\fconrad\AppData\Local\Continuum\anaconda3\envs\venv_spielereien\lib\site-packages\tensorflow_core\python\keras\engine\training_v2.py:677 map_fn
        batch_size=None)
    C:\Users\fconrad\AppData\Local\Continuum\anaconda3\envs\venv_spielereien\lib\site-packages\tensorflow_core\python\keras\engine\training.py:2410 _standardize_tensors
        exception_prefix='input')
    C:\Users\fconrad\AppData\Local\Continuum\anaconda3\envs\venv_spielereien\lib\site-packages\tensorflow_core\python\keras\engine\training_utils.py:573 standardize_input_data
        'with shape ' + str(data_shape))

    ValueError: Error when checking input: expected efficientnet-b0_input to have 4 dimensions, but got array with shape (224, 224, 3)

我真的很想知道为什么会发生这种情况,因为当我从我的数据集创建一个批次时:

train_generator = (tf.data.Dataset
                  .from_tensor_slices((train_imgs, train_labels))
                  .map(read_img)
                  .map(flip_img)
                  .map(brightness)
                  .map(blur)
                  .map(noise)
                   .map(rotate_90)
                  .repeat()
                  .shuffle(512)
                  .batch(BATCH_SIZE)
                  .prefetch(True))

validation_generator = (tf.data.Dataset
                       .from_tensor_slices((validation_imgs, validation_labels))
                       .map(read_img)
                       )

print(train_generator.__iter__().__next__()[0].shape)

我得到了预期的结果(64、224、224、3)。

但是在创建模型后,当我开始训练时会出现上述错误:

effn = tfkeras.EfficientNetB0(include_top=False, input_shape=img_shape, classes=4)
effn_model = tf.keras.Sequential()
effn_model.add(effn)
effn_model.add(tf.keras.layers.GlobalAveragePooling2D())
effn_model.add(tf.keras.layers.Dense(4, 'softmax'))

effn_model.compile(optimizer= 'adam', loss='categorical_crossentropy', metrics= ['categorical_accuracy'])

effn_model.fit(train_generator,
               epochs=20,
               steps_per_epoch=train_imgs.shape[0] // BATCH_SIZE,
               validation_data= validation_generator)

有谁知道为什么数据集中的切片具有形状 (64,224,224,3) 但模型无法识别批量维度?当我尝试训练 keras.application 模型时,一切正常。我使用 tensorflow 2.1 和效率网络的 pip 安装。谢谢

4

1 回答 1

0

如此处所述keras.io/api/applications/efficientnet/

input_shape:可选的形状元组,仅在 include_top 为 False 时指定。它应该正好有 3 个输入通道。

所以试试这个->

from tensorflow.keras.applications.efficientnet import EfficientNetB0, EfficientNetB5
mm = EfficientNetB0(include_top=True, weights=None, input_tensor=None, input_shape=(128, 128, 3), pooling=None, classes=2, classifier_activation="sigmoid")
mm.summary()

注意input_shape=(128, 128, 3) 它有 3 个通道。

于 2021-01-07T09:59:00.957 回答