keras - 尽管使用了 Dropout、MaxPooling、Early Stopping 和正则化器，我的 CNN 模型仍然过拟合。如何进一步防止过拟合？

Question

正如标题清楚地描述了我正在经历的情况，尽管使用了Dropout、和MaxPooling，但我的模型仍然过拟合。此外，我尝试了各种,和. 如何进一步防止过拟合？EarlyStoppingRegularizersCNNlearning_ratedropout_rateL1/L2 regularization weight decay

这是模型（Keras在TensorFlow后端使用）：

batch_size = 128
num_epochs = 200
weight_decay = 1e-3
num_filters = 32 * 2
n_kernel_size = 5
num_classes = 3
activation_fn = 'relu'
nb_units = 128
last_dense_units = 128
n_lr = 0.001
n_momentum = 0.99
n_dr = 0.00001
dropout_rate = 0.8

model.add(Embedding(nb_words, EMBEDDING_DIM, input_length=max_seq_len))
model.add(Dropout(dropout_rate))
model.add(Conv1D(num_filters, n_kernel_size, padding='same', activation=activation_fn,
                 kernel_regularizer=regularizers.l2(weight_decay)))
model.add(MaxPooling1D())
model.add(GlobalMaxPooling1D())
model.add(Dense(128, activation=activation_fn, kernel_regularizer=regularizers.l2(weight_decay)))
model.add(Dropout(dropout_rate))
model.add(Dense(num_classes, activation='softmax'))

adam = Adam(lr=n_lr, beta_1=0.9, beta_2=0.999, epsilon=1e-08, decay=n_dr)
model.compile(loss='categorical_crossentropy', optimizer=adam, metrics=['acc'])

early_stopping = EarlyStopping(
    monitor='val_loss',
    patience=3,
    mode='min',
    verbose=1,
    restore_best_weights=True
)

model.fit(...)

这是训练和验证的准确度图：

score 0 · Accepted Answer

仍然有过拟合的方法可以尝试：

嵌套 K 折叠
只需在训练和验证之外制作另一个测试集
引入更多数据
你做过数据增强吗？

您的模型似乎确实过度拟合了大约 10%。但是多少过拟合才算过拟合呢？我会查看这篇文章和相关对话，以便您可以最好地评估您的具体情况。

keras - 尽管使用了 Dropout、MaxPooling、Early Stopping 和正则化器，我的 CNN 模型仍然过拟合。如何进一步防止过拟合？

1 回答 1

Related

Reference