how to generate embeddings for all entities after we have the model? #21

XRodriguez10 · 2020-06-24T04:30:44Z

I'm trying to train a biencoder model to support Chinese. After I got the trained model for biencoder, how can I get the embeddings for all entities like the given file models/all_entities_large.t7?

XRodriguez10 · 2020-06-24T08:40:24Z

Also, are you going to release instructions on how to train the models? Thanks!

rajatinteros · 2020-07-09T20:11:48Z

Hello All, I would also like to get some tips on training this architecture from scratch, and information to use this architecture as a pre-trained network on any custom dataset

anjalibhavan · 2020-07-11T13:44:14Z

Yes, same issue here! I would like to know how to use this for a custom dataset, and how to generate embeddings from the linked documents.

izuna385 · 2020-07-31T12:19:53Z

Hello, I just re-implemented hard-negative mining and scripts for encoding entities with zeshel dataset from [Logeswaran et al., '19].
See here for your information. Also this repository might be useful for re-implementation of encoding all entities.
Thanks.

abhinavkulkarni · 2022-01-17T13:12:02Z

Hi all,

You can refer to my comment #106 (comment) with regards to generating embeddings for new candidates for an existing model.

JLUGQQ · 2022-05-02T02:27:33Z

I'm trying to train a biencoder model to support Chinese. After I got the trained model for biencoder, how can I get the embeddings for all entities like the given file models/all_entities_large.t7?

I wonder whether you have trained this model using Chinese dataset. If so, can you share me your Chinese training dataset? I also want to use this model in Chinese, but I lack Chinese dataset. Thank you very much!

abhinavkulkarni · 2022-05-05T04:06:31Z

With regards to training a new model with custom data, yes, it is indeed possible to do so. I would recommend first training a zero-shot learning (zeshel) model first just to get hang of the training process. The scripts to download and pre-process zeshel data are in the repository. You can then replicate the same steps, bring your data in the same format as zeshel, modify any hyperparameters (such as context length or choice of bert base model) and train your own model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to generate embeddings for all entities after we have the model? #21

how to generate embeddings for all entities after we have the model? #21

XRodriguez10 commented Jun 24, 2020

XRodriguez10 commented Jun 24, 2020

rajatinteros commented Jul 9, 2020 •

edited

Loading

anjalibhavan commented Jul 11, 2020

izuna385 commented Jul 31, 2020 •

edited

Loading

abhinavkulkarni commented Jan 17, 2022

JLUGQQ commented May 2, 2022

abhinavkulkarni commented May 5, 2022

how to generate embeddings for all entities after we have the model? #21

how to generate embeddings for all entities after we have the model? #21

Comments

XRodriguez10 commented Jun 24, 2020

XRodriguez10 commented Jun 24, 2020

rajatinteros commented Jul 9, 2020 • edited Loading

anjalibhavan commented Jul 11, 2020

izuna385 commented Jul 31, 2020 • edited Loading

abhinavkulkarni commented Jan 17, 2022

JLUGQQ commented May 2, 2022

abhinavkulkarni commented May 5, 2022

rajatinteros commented Jul 9, 2020 •

edited

Loading

izuna385 commented Jul 31, 2020 •

edited

Loading