ImageTorchEncoder

ImageTorchEncoder wraps the models from torchvision.

ImageTorchEncoder receives Documents with blob attributes of type ndarray and shape Height x Width x Channel. The blob attribute represents the image to be encoded by ImageTorchEncoder. This Executor will encode each blob into an ndarray of shape embedding_dim and store them in the embedding attribute of the Document.

Usage

Use the prebuilt images from Jina Hub in your Python codes, add it to your Flow and encode an image:

from jina import Flow, Document

f = Flow().add(uses='jinahub+docker://ImageTorchEncoder', uses_with={'model_name': 'resnet50'})

doc = Document(uri='my_image.png')
doc.convert_image_uri_to_blob()

with f:
    f.post(on='/index', inputs=doc, on_done=lambda resp: print(resp.docs[0].embedding))

Encoding example

After creating a Flow, prepare your Documents to encode. They should have the blob attribute set with shape Height x Width x Channel. Then, we can start the Flow and encode the Documents. By default, any endpoint will encode the Documents:

import numpy as np

from jina import Flow, Document

f = Flow().add(uses='jinahub+docker://ImageTorchEncoder')

doc = Document(blob=np.ones((224, 224, 3), dtype=np.uint8))

with f:
    f.post(on='/index', inputs=doc, on_done=lambda resp: print(resp.docs[0].embedding))

Set `volumes`

With the volumes attribute, you can map the torch cache directory to your local cache directory, in order to avoid downloading the model each time you start the Flow.

from jina import Flow

flow = Flow().add(
    uses='jinahub+docker://ImageTorchEncoder',
    volumes='/your_home_folder/.cache/torch:/root/.cache/torch'
)

Alternatively, you can reference the docker image in the yml config and specify the volumes configuration.

flow.yml:

jtype: Flow
pods:
  - name: encoder
    uses: 'jinahub+docker://ImageTorchEncoder'
    volumes: '/your_home_folder/.cache/torch:/root/.cache/torch'

And then use it like so:

from jina import Flow

flow = Flow.load_config('flow.yml')

Returns

Document with embedding fields filled with an ndarray of shape embedding_dim (size depends on the model) with dtype=float32.

Supported models:

You can specify the model to use with the parameter model_name:

import numpy as np

from jina import Flow, Document

f = Flow().add(
    uses='jinahub+docker://ImageTorchEncoder',
    uses_with={'model_name': 'alexnet'}
)

doc = Document(blob=np.ones((224, 224, 3), dtype=np.uint8))

with f:
    f.post(on='/index', inputs=doc, on_done=lambda resp: print(resp.docs[0].embedding))

ImageTorchEncoder supports the following models:

alexnet
squeezenet1_0
vgg16
densenet161
inception_v3
googlenet
shufflenet_v2_x1_0
mobilenet_v2
mnasnet1_0
resnet18

By default, resnet18 is the used model.

You can check the models here

GPU usage:

To enable GPU, you can set the device parameter to a cuda device. Make sure your machine is cuda-compatible. If you're using a docker container, make sure to add the gpu tag and enable GPU access to Docker with gpus='all'. Furthermore, make sure you satisfy the prerequisites mentioned in Executor on GPU tutorial.

import numpy as np

from jina import Flow, Document

f = Flow().add(
    uses='jinahub+docker://ImageTorchEncoder/gpu',
    uses_with={'device': 'cuda'}, gpus='all'
)

doc = Document(blob=np.ones((224, 224, 3), dtype=np.uint8))

with f:
    f.post(on='/index', inputs=doc, on_done=lambda resp: print(resp.docs[0].embedding))

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github/workflows		.github/workflows
executor		executor
tests		tests
.gitignore		.gitignore
Dockerfile.gpu		Dockerfile.gpu
LICENSE		LICENSE
README.md		README.md
config.yml		config.yml
gpu_requirements.txt		gpu_requirements.txt
manifest.yml		manifest.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ImageTorchEncoder

Usage

Encoding example

Set `volumes`

Returns

Supported models:

GPU usage:

Reference

About

Releases 3

Packages

Contributors 14

Languages

License

jina-ai/executor-image-torch-encoder

Folders and files

Latest commit

History

Repository files navigation

ImageTorchEncoder

Usage

Encoding example

Set volumes

Returns

Supported models:

GPU usage:

Reference

About

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 14

Languages

Set `volumes`

Packages