tensorflow - TensorFlow Estimators 和大量预训练模型：跨调用重用？

Question

我正在尝试使用 TensorFlow 中预训练模型的迁移学习 - 使用 Estimator API。模型的细节（层数、神经元等）会改变并且不相关。

get_feature_extractor() 实例化一个 TensorFlow Hub 模块。最终发生的情况是，每次调用 .train_and_evaluate()、.predict() 等都会破坏会话和图形并从头开始，重新加载特征提取器。这需要几秒钟。是否有一种干净的方法可以在这些调用中保留 get_feature_extractor() 的结果并使其保持会话 - 至少对于 .predict() 而言？还是我必须使用较低级别的 API 来实现这一点？

def model_fn(features, labels, mode):
    feature_extractor = get_feature_extractor()
    layer = feature_extractor(features)
    layer = tf.layers.batch_normalization(layer)
    layer = tf.layers.dense(inputs=layer, units=1280, activation=tf.nn.relu)
    layer = tf.layers.dense(inputs=layer, units=2048, activation=tf.nn.relu)
    layer = tf.layers.dense(inputs=layer, units=512, activation=tf.nn.relu)
    layer = tf.layers.dense(inputs=layer, units=2)
    if mode == tf.estimator.ModeKeys.PREDICT:
        estimator = tf.estimator.EstimatorSpec(mode, predictions=layer)     
    else:
        accuracy = tf.metrics.accuracy(labels=labels,
                               predictions=layer,
                               name='acc_op')
        metrics = {'accuracy': accuracy}
        loss = tf.losses.mean_squared_error(labels=labels, predictions=layer)
        optimizer = tf.train.AdamOptimizer()
        train_op = optimizer.minimize(loss=loss, global_step=tf.train.get_global_step())
        estimator = tf.estimator.EstimatorSpec(
            mode=mode, loss=loss, train_op=train_op,
            eval_metric_ops=metrics)    
    return estimator

tensorflow - TensorFlow Estimators 和大量预训练模型：跨调用重用？

0 回答 0

Related

Reference