Caffe to TensorFlow

Usage

Run convert.py to convert an existing Caffe model to TensorFlow.

Make sure you're using the latest Caffe format (see the notes section for more info).

The output consists of two files:

A data file (in NumPy's native format) containing the model's learned parameters.
A Python class that constructs the model's graph.

LeNet Example

This example showns you how to finetune code from the Caffe MNIST tutorial using Tensorflow. First, you can convert a prototxt model to tensorflow code:

$ ./convert.py examples/mnist/lenet.prototxt --code-output-path=mynet.py

This produces tensorflow code for the LeNet network in mynet.py. The code can be imported as described below in the Inference section. Caffe-tensorflow also lets you convert .caffemodel weight files to .npy files that can be directly loaded from tensorflow:

$ ./convert.py examples/mnist/lenet.prototxt --caffemodel examples/mnist/lenet_iter_10000.caffemodel --data-output-path=mynet.npy

The above command will generate a weight file named mynet.npy.

Inference:

Once you have generated both the code weight files for LeNet, you can finetune LeNet using tensorflow with

$ ./examples/mnist/finetune_mnist.py

At a high level, finetune_mnist.py works as follows:

# Import the converted model's class
from mynet import MyNet

# Create an instance, passing in the input data
net = MyNet({'data':my_input_data})

with tf.Session() as sesh:
    # Load the data
    net.load('mynet.npy', sesh)
    # Forward pass
    output = sesh.run(net.get_output(), ...)

ImageNet example

See test.py for a functioning example. It verifies the sample models (under examples/) against the ImageNet validation set.

Verification

The following converted models have been verified on the ILSVRC2012 validation set.

Model	Top 5 Accuracy
VGG 16	89.88%
GoogLeNet	89.06%
CaffeNet	79.93%
AlexNet	79.84%

Notes

Only the new Caffe model format is supported. If you have an old model, use the upgrade_net_proto_text and upgrade_net_proto_binary tools that ship with Caffe to upgrade them first. Also make sure you're using a fairly recent version of Caffe.
It appears that Caffe and TensorFlow cannot be concurrently invoked (CUDA conflicts - even with set_mode_cpu). This makes it a two-stage process: first extract the parameters with convert.py, then import it into TensorFlow.
Caffe is not strictly required. If PyCaffe is found in your PYTHONPATH, and the USE_PYCAFFE environment variable is set, it will be used. Otherwise, a fallback will be used. However, the fallback uses the pure Python-based implementation of protobuf, which is astoundingly slow (~1.5 minutes to parse the VGG16 parameters). The experimental CPP protobuf backend doesn't particularly help here, since it runs into the file size limit (Caffe gets around this by overriding this limit in C++). A cleaner solution here would be to implement the loader as a C++ module.
Only a subset of Caffe layers and accompanying parameters are currently supported.
Not all Caffe models can be converted to TensorFlow. For instance, Caffe supports arbitrary padding whereas TensorFlow's support is currently restricted to SAME and VALID.
The border values are handled differently by Caffe and TensorFlow. However, these don't appear to affect things too much.
Image rescaling can affect the ILSVRC2012 top 5 accuracy listed above slightly. VGG16 expects isotropic rescaling (anisotropic reduces accuracy to 88.45%) whereas BVLC's implementation of GoogLeNet expects anisotropic (isotropic reduces accuracy to 87.7%).
The support class kaffe.tensorflow.Network has no internal dependencies. It can be safely extracted and deployed without the rest of this library.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
examples		examples
kaffe		kaffe
.gitignore		.gitignore
.pylintrc		.pylintrc
.style.yapf		.style.yapf
LICENSE.md		LICENSE.md
README.md		README.md
convert.py		convert.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

kaffe

kaffe

.gitignore

.gitignore

.pylintrc

.pylintrc

.style.yapf

.style.yapf

LICENSE.md

LICENSE.md

README.md

README.md

convert.py

convert.py

test.py

test.py

Repository files navigation

Caffe to TensorFlow

Usage

LeNet Example

Inference:

ImageNet example

Verification

Notes

About

Releases

Packages

Languages

License

wkentaro/caffe-tensorflow

Folders and files

Latest commit

History

Repository files navigation

Caffe to TensorFlow

Usage

LeNet Example

Inference:

ImageNet example

Verification

Notes

About

Resources

License

Stars

Watchers

Forks

Languages