Pretrained BERT model? #197

priamai · 2023-09-04T09:06:38Z

Hi there,
I have just deployed the last version via docker and noticed that there are only 2 models pre trained.

It would useful to know how to:
a) train the SCIBERT on some annotated dataset (the link is broken I guess is a private repo)?
b) download a pre-trained SCIBERT

Cheers!
@mehaase

mehaase · 2023-09-05T12:34:35Z

Hi @priamai, that screen is a bit misleading. It is showing stats for the models that were trained inside the container; the SciBERT model is trained outside the container (by us, on high-end GPUs) and downloaded into the docker container. If you want to fine-tune the model on your own data, we have some jupyter notebooks to facilitate that: https://github.com/center-for-threat-informed-defense/tram/wiki/Large-Language-Models#jupyter-notebooks

(I also fixed the broken link that you were looking at: https://github.com/center-for-threat-informed-defense/tram/wiki/Data-Annotation)

priamai · 2023-09-05T13:16:46Z

Hi @mehaase but when I upload a report it doesn't let me choose the mode, so does it default to the SCIBERT?
Thanks for fixing the link!
I love the colabo books so we can fine tune for free on Colab!

mehaase · 2023-09-05T13:42:31Z

Yes it defaults to scibert. The choice of model is specified entrypoint.sh.

mehaase closed this as completed Sep 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pretrained BERT model? #197

Pretrained BERT model? #197

priamai commented Sep 4, 2023

mehaase commented Sep 5, 2023

priamai commented Sep 5, 2023

mehaase commented Sep 5, 2023

Pretrained BERT model? #197

Pretrained BERT model? #197

Comments

priamai commented Sep 4, 2023

mehaase commented Sep 5, 2023

priamai commented Sep 5, 2023

mehaase commented Sep 5, 2023