I am trying to understand the basics of caffe, in particular to use with python.
My understanding is that the model definition (say a given neural net architecture) must be included in the '.prototxt'
file.
And that when you train the model on data using the '.prototxt'
, you save the weights/model parameters to a '.caffemodel'
file
Also, there is a difference between the '.prototxt'
file used for training (which includes learning rate and regularization parameters) and the one used for testing/deployment, which does not include them.
Questions:
- is it correct that the
'.prototxt'
is the basis for training and that the'.caffemodel'
is the result of training (weights), using the'.prototxt'
on the training data? - is it correct that there is a
'.prototxt'
for training and one for testing, and that there are only slight differences (learning rate and regularization factors on training), but that the nn architecture (assuming you use neural nets) is the same?
Apologies for such basic questions and possibly some very incorrect assumptions, I am doing some online research and the lines above summarize my understanding to date.