Quantized model artifacts are not accessible as class attributes #984

denver1117 · 2020-01-11T23:15:05Z

It does not appear that the model artifacts needed to explore quantized models are made public, specifically with the python module.

For instance, with a non-quantized supervised model class, you can access the input and output matrix explicitly as numpy objects:

model.get_input_matrix()
model.get_output_matrix()

In this case you are able to access all of the model artifacts needed to manually get word and sentence vectors, perform the dot product for prediction, etc.

For quantized models, it is very opaque what has gone on under the hood and where the artifacts are. The input matrix usually accessible with model.get_input_matrix() goes away. This is understandable. But presumably some artifacts exist that map words to their nearest centroid, and that map centroids back into the full dimensional input matrix space. These seemingly must exist in order to produce word/sentence vectors for quantized models as part of the predict step.

Where are these attributes? Can they be accessed with the python module? I see no new attributes created in either the python model object or the pybind object model.f after quantization.

These would be much appreciated in efforts to further explore fasttext quantization. While I understand the methods (predict, get_word_vector, get_sentence_vector, etc.) function appropriately after quantization, it is very much a black box to the end user.

The text was updated successfully, but these errors were encountered:

Celebio · 2020-01-14T09:46:00Z

Hi @denver1117 ,
Thank you for your suggestion, it makes sense. We will add it to our feature request list.

Best regards,
Onur

denver1117 · 2020-01-14T16:29:56Z

Great thanks 💯

Celebio added the Feature request label Jan 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantized model artifacts are not accessible as class attributes #984

Quantized model artifacts are not accessible as class attributes #984

denver1117 commented Jan 11, 2020

Celebio commented Jan 14, 2020

denver1117 commented Jan 14, 2020

Quantized model artifacts are not accessible as class attributes #984

Quantized model artifacts are not accessible as class attributes #984

Comments

denver1117 commented Jan 11, 2020

Celebio commented Jan 14, 2020

denver1117 commented Jan 14, 2020