You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the example listed here, is it possible to use other models from the huggingface models hub to generate lexical substitutes? I am happy to contribute more models to the repo once I understand the pipeline of adding new models.
Also, which approach from the paper does this example correspond to? Is it the XLNet+embs approach listed in bold in this table from the paper?
The text was updated successfully, but these errors were encountered:
If you want to use huggingface models not tested in our paper, this may be not so trivial as it seems, because likely the approach to generating substitutes shall be modified to some degree for each model individually (at least, different pre-processing and post-processing may be required, and likely optimal hyperparameters will be significantly different, like we saw in our experiments when architecturally very similar BERT and RoBERTa were compared ). Technically, you can start from copy-pasting one of the existing configs (this simple one for instance https://github.com/Samsung/LexSubGen/blob/main/configs/subst_generators/lexsub/bert.jsonnet) and then replace pipeline steps with you new steps. For each step you shall use either an existing, or create a new config. If you want a new model, you shall also write a new probability estimator (start from copy-pasting this one https://github.com/Samsung/LexSubGen/blob/main/lexsubgen/prob_estimators/bert_estimator.py) and the corresponding config (https://github.com/Samsung/LexSubGen/blob/main/configs/prob_estimators/lexsub/bert.jsonnet).
I think in the notebook our best model XLNet+embs is used, since config "xlnet_embs.jsonnet" is used.
In the example listed here, is it possible to use other models from the huggingface models hub to generate lexical substitutes? I am happy to contribute more models to the repo once I understand the pipeline of adding new models.
Also, which approach from the paper does this example correspond to? Is it the XLNet+embs approach listed in bold in this table from the paper?
The text was updated successfully, but these errors were encountered: