matlab - Matconvnet 中的卷积层到全连接层

Question

我试图了解 Matconvnet 中的 MNIST 示例是如何设计的。看起来他们正在使用 LeNet 变体，但由于我之前没有使用过 Matconvnet，所以我很难建立最后一个卷积层和第一个全连接层之间的连接：

net.layers = {} ;
net.layers{end+1} = struct('type', 'conv', ...
                       'weights', {{f*randn(5,5,1,20, 'single'), zeros(1, 20, 'single')}}, ...
                       'stride', 1, ...
                       'pad', 0) ;
net.layers{end+1} = struct('type', 'pool', ...
                       'method', 'max', ...
                       'pool', [2 2], ...
                       'stride', 2, ...
                       'pad', 0) ;
net.layers{end+1} = struct('type', 'conv', ...
                       'weights', {{f*randn(5,5,20,50, 'single'),zeros(1,50,'single')}}, ...
                       'stride', 1, ...
                       'pad', 0) ;
net.layers{end+1} = struct('type', 'pool', ...
                       'method', 'max', ...
                       'pool', [2 2], ...
                       'stride', 2, ...
                       'pad', 0) ;
net.layers{end+1} = struct('type', 'conv', ...
                       'weights', {{f*randn(4,4,50,500, 'single'),  zeros(1,500,'single')}}, ...
                       'stride', 1, ...
                       'pad', 0) ;
net.layers{end+1} = struct('type', 'relu') ;
net.layers{end+1} = struct('type', 'conv', ...
                       'weights', {{f*randn(1,1,500,10, 'single'), zeros(1,10,'single')}}, ...
                       'stride', 1, ...
                       'pad', 0) ;
net.layers{end+1} = struct('type', 'softmaxloss') ;

通常，在 Tensorflow 和 MxNet 等库中，最后一个卷积层被展平，然后连接到全连接层。在这里，据我了解，他们将第一个全连接层解释{{f*randn(4,4,50,500, 'single'), zeros(1,500,'single')}}为全连接层，但该层仍然给出一个三维激活图作为其结果。我看不出这里的“扁平化”是如何发生的。我需要有关如何在此处建立卷积层全连接层连接的帮助。

score 1 · Accepted Answer

据我所知，您应该只用卷积层替换全连接层，该卷积层的过滤器的宽度和高度等于输入的宽度和高度。事实上，您不需要在 Matconvnet 中的全连接层之前对数据进行展平（平面数据已经1x1xDxN成型）。在您的情况下，使用与输入空间大小相同的内核，即4x4，将作为 FC 层运行，其输出将为 1 x 1 x 500 x B。（B 代表输入中的第四维）

更新： 网络架构及其输出在此处可视化，以了解操作流程。

matlab - Matconvnet 中的卷积层到全连接层

1 回答 1

Related

Reference