Expose encoder output #65

maykcaldas · 2023-07-19T16:56:59Z

Issue Type

Questions

Source

PyPi

DECIMER Image Transformer Version

2.3.0

OS Platform and Distribution

MacBook Pro M1, 2020

Python version

3.10

Current Behaviour?

Hey!

Is there a way to access the encoder output using decimer's loaded model?
I'm interested in the embedded representation that is fed to the decoder, not the smiles itself. I was wondering if it's possible to access them once the Transformer class calls the encoder and the decoder separately.

I could reproduce the predict_SMILES function by loading the model from the checkpoint available in zenodo, but since it's a TF model, I can only __call__ it.

Is there any possible way to load these weights into the Transformer class so I can call the t_encoder to access the enc_output?
Having an argument in the call to expose the hidden_states would also work fine.

Any suggestion is welcome!
Thanks!
Mayk

Which images caused the issue? (This is mandatory for images related issues)

No response

Standalone code to reproduce the issue

N/A, but I'm using this code to test:

import pickle
import DECIMER
import tensorflow as tf

tokenizer = pickle.load(
    open(
        "DECIMER_model/assets/tokenizer_SMILES.pkl", # Downloaded from zenodo
        "rb",
    )
)


def detokenize_output(predicted_array: int) -> str:
    """
    This function takes the predited tokens from the DECIMER model
    and returns the decoded SMILES string.

    Args:
        predicted_array (int): Predicted tokens from DECIMER

    Returns:
        (str): SMILES representation of the molecule
    """
    outputs = [tokenizer.index_word[i] for i in predicted_array[0].numpy()]
    prediction = (
        "".join([str(elem) for elem in outputs])
        .replace("<start>", "")
        .replace("<end>", "")
    )

    return prediction

DECIMER_V2 = tf.saved_model.load("DECIMER_model/") # Downloaded from zenodo
chem_struct = DECIMER.config.decode_image('sample.png')
pred_tkns = DECIMER_V2(chem_struct)
predicted_SMILES = DECIMER.utils.decoder(detokenize_output(pred_tkns))
print(predicted_SMILES)

Relevant log output

No response

Code of Conduct

I agree to follow this project's Code of Conduct

The text was updated successfully, but these errors were encountered:

OBrink · 2023-07-20T08:37:58Z

Hey @maykcaldas,

I am not really sure what you are trying to do with the tokenizer if you want to access the encoder output.

If you want to access the encoder output, have a look at DECIMER/Predictor_EfficientNet2.py

In that file, add a function called get_encoder_output (I have copied this from evaluate() and left out the bits that process the image embedding with the Transformer decoder, as we just want the image embedding here):

def get_encoder_output(image_path: str):
    """
    This function takes an image path (str) and returns the encoder output.

    Args:
        image_path (str): Path of chemical structure depiction image

    Returns:
        (tf.Tensor): image embedding
    """
    sample = config.decode_image(image_path)
    _image_batch = tf.expand_dims(sample, 0)
    image_embedding = encoder(_image_batch, training=False)
return image_embedding

If you then call that function, you get the EfficientNet V2 encoder output.

I hope this helps! Have a nice day! :)

maykcaldas · 2023-07-27T01:52:21Z

Hey @OBrink , thanks for your suggestion!
Me and @smichtavy were working on that today.

I tried it, but I had some problems:

Even before adding the get_encoder_output, I tried to run the Predictor_EfficientNet2.py script directly (was that what you meant?). It leads me to some errors:
1.1. In the Efficient_Net_encoder.py file, was using the import import DECIMER.efficientnetv2 as efficientnetv2 to use the EffNetV2Model class. What might be an environment problem, but the efficientnetv2 directory is not a module. I included a init file in efficientnetv2 to import EffNetV2Model
1.2. In the config.py file, we had the import import DECIMER.Transformer_decoder, but it was calling Transformer_decoder. I changed it to call DECIMER.Transformer_decoder.
1.3. in the efficientnetv2/utils.py file, the requirements.txt is missing the tensorflow_addons package.

After solving these three issues, I could run the Predictor_EfficientNet2.py script. It was missing the tokenizer_Isomeric_SELFIES and max_length_Isomeric_SELFIES pickle files, I used the ones provided in zenodo: https://zenodo.org/record/8093783/files/models.zip (needed to rename the files).
But running raised a shape error. Do you have any idea what may be the cause of it?

TFA has entered a minimal maintenance and release mode until a planned end of life in May 2024.
Please modify downstream libraries to take dependencies from other repositories in our TensorFlow community (e.g. Keras, Keras-CV, and Keras-NLP). 

For more information see: https://github.com/tensorflow/addons/issues/2807 

  warnings.warn(
Traceback (most recent call last):
  File "Predictor_EfficientNet2.py", line 179, in <module>
    main()
  File "Predictor_EfficientNet2.py", line 100, in main
    SMILES = predict_SMILES(sys.argv[1])
  File "Predictor_EfficientNet2.py", line 170, in predict_SMILES
    predicted_SELFIES = evaluate(image_path)
  File "Predictor_EfficientNet2.py", line 119, in evaluate
    _image_embedding = encoder(_image_batch, training=False)
  File "/home/maykcaldas/.local/lib/python3.8/site-packages/keras/utils/traceback_utils.py", line 70, in error_handler
    raise e.with_traceback(filtered_tb) from None
  File "/home/maykcaldas/.local/lib/python3.8/site-packages/DECIMER/Efficient_Net_encoder.py", line 80, in call
    x = self.reshape(x, training=training)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Exception encountered when calling layer 'image_embedding' (type Reshape).

{{function_node __wrapped__Reshape_device_/job:localhost/replica:0/task:0/device:CPU:0}} Input to reshape is a tensor with 59392 values, but the requested shape has 23200 [Op:Reshape]

Call arguments received by layer 'image_embedding' (type Reshape):
  • inputs=tf.Tensor(shape=(1, 16, 16, 232), dtype=float32)```

OBrink · 2023-07-28T14:27:32Z

Hey @maykcaldas & @smichtavy,

I am sorry for the confusion, I will look into cleaning up a couple of things in the repository.

I found an easier solution that saves us a lot of trouble, and I confirmed that it works:

git clone https://github.com/Kohulan/DECIMER-Image_Transformer
cd DECIMER-Image_Transformer
in DECIMER/__init__.py, replace the imports with:

from .decimer import predict_SMILES, DECIMER_V2
from . import config

pip install .
in your Python code, call the encoder the following way:

from DECIMER import DECIMER_V2, config
import tensorflow as tf

encoder = DECIMER_V2.DECIMER.encoder

def get_encoder_output(image_path: str):
    """
    This function takes an image path (str) and returns the encoder output.

    Args:
        image_path (str): Path of chemical structure depiction image

    Returns:
        (tf.Tensor): image embedding
    """
    sample = config.decode_image(image_path)
    _image_batch = tf.expand_dims(sample, 0)
    image_embedding = encoder(_image_batch, training=False)
    return image_embedding
get_encoder_output(image_path)

If I run this on /Tests/caffeine.png, I get:

<tf.Tensor: shape=(1, 256, 512), dtype=float32, numpy=
array([[[-0.05972348, -0.22405794,  0.26993287, ...,  0.25145775,
         -0.10838411, -0.24237064],
        [ 0.20004356, -0.53331375, -0.04955457, ...,  0.16480035,
         -0.03783005, -0.26316923],
        [ 0.57722473, -0.38815224,  0.33489236, ..., -0.04090453,
         -0.03352015, -0.32710063],
        ...,
        [ 0.83991385,  0.2703736 ,  0.2658328 , ...,  0.21896963,
         -0.20168161,  0.29251066],
        [ 0.82687134, -1.2922107 , -0.25024006, ..., -0.64587194,
         -1.0208106 ,  0.07405327],
        [ 0.44606146, -0.1196775 ,  1.3218036 , ..., -0.4189508 ,
         -1.0974574 , -0.6215218 ]]], dtype=float32)>

Let us know if you have any further trouble! I'll wait until I hear from you to close this issue.

Have a nice weekend! :)
Otto

maykcaldas · 2023-07-28T20:58:46Z

Thank you very much!
This one is much cleaner! Appreciate that!!

Feel free to close this issue :)

Have a great weekend you too!

Kohulan · 2023-07-30T17:52:49Z

I've reopened the issue since we have to update the code on the Predictor code to use checkpoints. Will close it once the issue is solved.

fix: use checkpoints properly for predictions #65 and #66

Kohulan self-assigned this Jul 20, 2023

Kohulan added the question Further information is requested label Jul 20, 2023

OBrink self-assigned this Jul 28, 2023

OBrink mentioned this issue Jul 28, 2023

Predictor_EfficientNet2.py is outdated #66

Closed

1 task

OBrink closed this as completed Jul 28, 2023

Kohulan reopened this Jul 30, 2023

Kohulan added bug Something isn't working enhancement New feature or request labels Jul 30, 2023

Kohulan added a commit that referenced this issue Jul 31, 2023

fix: use checkpoints properly for predictions #65 and #66

f47ac99

Kohulan linked a pull request Jul 31, 2023 that will close this issue

fix: use checkpoints properly for predictions #65 and #66 #67

Merged

Kohulan closed this as completed in #67 Aug 1, 2023

Kohulan added a commit that referenced this issue Aug 1, 2023

Merge pull request #67 from Kohulan/development

b217f5f

fix: use checkpoints properly for predictions #65 and #66

github-actions bot mentioned this issue Aug 1, 2023

chore(master): release 2.4.0 #64

Merged

OBrink mentioned this issue Dec 17, 2023

If I only want to get the result of EfficientNet, could you give me some suggest about how to do? #80

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose encoder output #65

Expose encoder output #65

maykcaldas commented Jul 19, 2023

OBrink commented Jul 20, 2023 •

edited

maykcaldas commented Jul 27, 2023

OBrink commented Jul 28, 2023 •

edited

maykcaldas commented Jul 28, 2023

Kohulan commented Jul 30, 2023

Expose encoder output #65

Expose encoder output #65

Comments

maykcaldas commented Jul 19, 2023

Issue Type

Source

DECIMER Image Transformer Version

OS Platform and Distribution

Python version

Current Behaviour?

Which images caused the issue? (This is mandatory for images related issues)

Standalone code to reproduce the issue

Relevant log output

Code of Conduct

OBrink commented Jul 20, 2023 • edited

maykcaldas commented Jul 27, 2023

OBrink commented Jul 28, 2023 • edited

maykcaldas commented Jul 28, 2023

Kohulan commented Jul 30, 2023

OBrink commented Jul 20, 2023 •

edited

OBrink commented Jul 28, 2023 •

edited