About fine tuning with different architecture #140

jackiechensuper · 2014-02-23T06:28:08Z

Hi guys,
If I want to classify 100 objects and use the convolution and max-pooling layers of pre-trained Imagenet model, I only change the imagenet.prototxt with different outputs in fully-connected network. How to initialize the network with pre-trained imagenet model in previous layers while latter layer randomized ?
Thanks

kloudkl · 2014-02-23T08:24:06Z

Please take a look at #31. You may consider searching the issues first before opening a new issue.

The secret is in Net::CopyTrainedLayersFrom. In my opinion, all you need to do is to define your desired network and name the layers that you want to copy from a pre-trained model with the names of the corresponding source layers to be copied from. Other layers will be initialized by the random fillers that you specify.

shelhamer · 2014-02-23T09:20:33Z

@kloudkl is right. See the finetuning slide in the Caffe presentation for a miniature example.

shelhamer added question labels Feb 23, 2014

sergeyk closed this as completed Feb 25, 2014

Yangqing mentioned this issue Apr 15, 2014

Replacing layers in a trained net #328

Closed

wendlerc mentioned this issue Jul 8, 2014

Finetuning issues, loss: -nan after 100 iterations #644

Closed

tlind mentioned this issue Aug 22, 2014

Update or add documentation on fine-tuning with different data / labels #967

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About fine tuning with different architecture #140

About fine tuning with different architecture #140

jackiechensuper commented Feb 23, 2014

kloudkl commented Feb 23, 2014

shelhamer commented Feb 23, 2014

About fine tuning with different architecture #140

About fine tuning with different architecture #140

Comments

jackiechensuper commented Feb 23, 2014

kloudkl commented Feb 23, 2014

shelhamer commented Feb 23, 2014