Skip to content

Meeting 2020 05 07

ferraric edited this page May 12, 2020 · 1 revision

Status Update

Claudio

4 jobs done: trained with only glove embeddings, with glove & flair embeddings, on full and small datasets. --> learning: more data and more embeddings help

goal: ablation study - try as many embeddings as possible --> next steps: fix flair that it's possible to train on whole dataset, then start many jobs. checkout possible baseline implementatinos.

Sinan

next steps: checkout google implementation and if it is possible for us to implement. is a mixed of supervised und unsupervised learning. seems rather complicated and unintuitive, but maybe possible to achieve something and generalize better.

Vanessa

done: pushed code to github, maybe useful as start for pipeline, general code of bert - i.e. easy to expand models. started with flair tutorials. --> next steps: train flair embeddings on our dataset - support claudio.

Jérémy:

currently on data augmentation. finding: until now only trained on augmented data and not whole dataset, thus expect better results --> next steps: augmentation experiments done

Ideas from research:

General:

  • we need 2 baselines, here we can start figuring out which one to use (glove, bert, ...)
  • start with related works section in paper
  • too early to start with pipeline, first need to fix direction we want to go