Minor improvements to the Embedder #35

LeonardoEmili · 2022-02-17T21:15:04Z

The following changes have been applied:

Add support to average the last four hidden layers of the transformer model
~~Add a shape property to the TransformersEmbedderOutput class (referenced in the README)~~
Add option to specify which hidden states to use for pooling
Update the docs accordingly
Run black in compliance with the project specifications
Fix documentation issues

Riccorl · 2022-02-17T23:12:49Z

I suggest to change

def __init__(
        self,
        model: Union[str, tr.PreTrainedModel],
        return_words: bool = True,
        pooling_strategy: str = "last",
        output_layers: List[int] = [-4, -3, -2, -1],
        fine_tune: bool = True,
        return_all: bool = False,
    ) -> None:
        super().__init__()

to

def __init__(
        self,
        model: Union[str, tr.PreTrainedModel],
        return_words: bool = True,
        pooling_strategy: str = "last",
        output_layers: List[int] = None,
        fine_tune: bool = True,
        return_all: bool = False,
    ) -> None:
        super().__init__()
        if output_layers is None:
             output_layers  = [-4, -3, -2, -1]

to pass the checks.

LeonardoEmili · 2022-02-17T23:35:41Z

I suggest to change

def __init__(
        self,
        model: Union[str, tr.PreTrainedModel],
        return_words: bool = True,
        pooling_strategy: str = "last",
        output_layers: List[int] = [-4, -3, -2, -1],
        fine_tune: bool = True,
        return_all: bool = False,
    ) -> None:
        super().__init__()

to

def __init__(
        self,
        model: Union[str, tr.PreTrainedModel],
        return_words: bool = True,
        pooling_strategy: str = "last",
        output_layers: List[int] = None,
        fine_tune: bool = True,
        return_all: bool = False,
    ) -> None:
        super().__init__()
        if output_layers is None:
             output_layers  = [-4, -3, -2, -1]

to pass the checks.

Wouldn't it mean loosing default value suggestions on the IDE? What about passing the layers as a tuple instead of a list?

Riccorl · 2022-02-18T09:16:52Z

Let's try with the tuple first.

LeonardoEmili added 4 commits February 17, 2022 21:56

Add support for average when computing the output word embeddings

241e4b7

Update README and add property shape to

e784eb1

Add option to select the hidden states to return

ddf2083

Remove ambiguos shape property and update README

bd6f51d

Riccorl self-assigned this Feb 17, 2022

LeonardoEmili added 2 commits February 18, 2022 11:26

Update output_layers parameter to tuple instead of list

7bf964a

Refactor one-line docstrings

6bf701a

Riccorl merged commit 88c6d52 into Riccorl:main Feb 18, 2022

Riccorl self-requested a review February 18, 2022 10:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor improvements to the Embedder #35

Minor improvements to the Embedder #35

LeonardoEmili commented Feb 17, 2022 •

edited

Riccorl commented Feb 17, 2022

LeonardoEmili commented Feb 17, 2022 •

edited

Riccorl commented Feb 18, 2022

Minor improvements to the Embedder #35

Minor improvements to the Embedder #35

Conversation

LeonardoEmili commented Feb 17, 2022 • edited

Riccorl commented Feb 17, 2022

LeonardoEmili commented Feb 17, 2022 • edited

Riccorl commented Feb 18, 2022

LeonardoEmili commented Feb 17, 2022 •

edited

LeonardoEmili commented Feb 17, 2022 •

edited