So basically this are the dimensions of the weights from trained caffenet:
conv1: (96,3,11,11) conv2: (256,48,5,5) conv3:(384,256,3,3) conv4: (384,192,3,3) conv5:(256, 192, 3 , 3)
I am confused that although conv1 gives 96 channels as output why does conv2 only considers 48 while convolution? Am I missing something?