tensorflow - 几个时期后损失开始跳跃

Question

我正在尝试训练神经网络来检测隐写图像。我将 Nvidia Digits 与 Tensorflow 一起使用。我的问题是损失开始逐渐减少，然后开始跳跃。

我的神经网络是 -

from model import Tower
from utils import model_property
import tensorflow as tf
import tensorflow.contrib.slim as slim
import utils as digits

class UserModel(Tower):

    @model_property
    def inference(self):
        x = tf.reshape(self.x, shape=[-1, self.input_shape[0], self.input_shape[1], self.input_shape[2]])
        with slim.arg_scope([slim.conv2d, slim.fully_connected],
                            weights_initializer=tf.contrib.layers.xavier_initializer(),
                            weights_regularizer=slim.l2_regularizer(0.00001)):
            conv1 = tf.layers.conv2d(inputs=x, filters=64, kernel_size=7, padding='Valid', strides=2, activation=tf.nn.relu)
            rnorm1 = tf.nn.local_response_normalization(input=conv1)
            conv2 = tf.layers.conv2d(inputs=rnorm1, filters=16, kernel_size=5, padding='Valid', strides=1, activation=tf.nn.relu)
            rnorm2 = tf.nn.local_response_normalization(input=conv2) 
            flatten = tf.contrib.layers.flatten(rnorm2)
            fc1 = tf.contrib.layers.fully_connected(inputs=flatten, num_outputs=1000, activation_fn=tf.nn.relu)
            fc2 = tf.contrib.layers.fully_connected(inputs=fc1, num_outputs=1000, activation_fn=tf.nn.relu)
            fc3 = tf.contrib.layers.fully_connected(inputs=fc2, num_outputs=2, activation_fn=None)
            return fc3

    @model_property
    def loss(self):
        model = self.inference
        loss = digits.classification_loss(model, self.y)
        accuracy = digits.classification_accuracy(model, self.y)
        self.summaries.append(tf.summary.scalar(accuracy.op.name, accuracy))
        return loss

我正在使用具有 0.0005 基本学习率的 SGD。我将步长更改为 5%，gamma 为 0.95。（我在研究时使用了这些设置，并且当学习率没有足够快地降低时，一段时间后损失开始跳跃 - 早些时候我使用 0.0005 和基本速率和 nvidia 数字默认步长）。

你知道如何让损失逐渐减少吗？任何建立网络的建议或指导将不胜感激。

谢谢！

score 1 · Accepted Answer

因此，如果有人遇到同样的问题，我所做的就是将初始损失调整为 0.0001，将步长调整为 5%，将 gamma 调整为 0.9。它让我逐渐减少了损失。

但是我认为学习率太低了，因为损失并没有像我希望的那样下降。

tensorflow - 几个时期后损失开始跳跃

1 回答 1

Related

Reference