Change to the new deberta model #735

kwalcock · 2023-07-18T01:28:57Z

No description provided.

kwalcock · 2023-07-18T01:30:59Z

This shows how to change from the roberta to the deberta model. However, using it will result in #21.

MihaiSurdeanu · 2023-07-22T06:59:04Z

Hey @kwalcock, any updates on this? Thank you!

kwalcock · 2023-07-22T07:20:39Z

I'm still digging deep into the Python code. It is going to be difficult. The most recent changes are in the kwalcock/types branch FWIW.

MihaiSurdeanu · 2023-07-22T07:28:28Z

That message is a warning, and it is possible that the tokens look similar. Did you compare the tokens produced by the Python and Rust tokenizers?

MihaiSurdeanu · 2023-07-22T07:29:00Z

In any case, if not all tokenizers are supported, it's not the end of the world. We just need to know, so we don't use the deberta model.

kwalcock · 2023-07-22T07:54:32Z

All of the output coming from Python matches whether the value for use_fast is True or False. I'm assuming that False will result in the Python code running instead of Rust, but I have not yet seen exactly how that works. So far all of it has matched the Rust code called from Scala as long as the tokenizer is one of those available directly from Rust.

kwalcock · 2023-07-22T07:59:09Z

The top tokenizers here (and maybe more) work from Scala via Rust without the stop in Python. The trick will be to get the ones that include some kind of Python assistance to work. Aren't the tokenizers paired with models so that this limits the models that can be used?

class SentencesTest extends Test {
  // See also test_clu_tokenizer.py.
  val tokenizerNames = Seq(
    "bert-base-cased",
    "distilbert-base-cased",
    "roberta-base",
    "xlm-roberta-base" // ,
    // All of these latter ones will not just fail, but cause a
    // fatal runtime error and end the testing completely.
    // "google/bert_uncased_L-4_H-512_A-8",
    // "google/electra-small-discriminator",
    // "microsoft/deberta-v3-base"
  )

MihaiSurdeanu · 2023-07-22T09:38:56Z

Yes, limiting the tokenizers will limit the models we have access to. In particular, deberta is an important one for processors.
I wonder if we do the same trick we did with Breeze. That is, replicate the Python parts directly in Scala. Is that complicated?

MihaiSurdeanu · 2023-07-28T06:13:08Z

Nice!! Ok to merge?

kwalcock · 2023-07-28T06:31:33Z

Yes. Just finished testing.

MihaiSurdeanu · 2023-07-28T06:40:41Z

This works great!

Change to the new deberta model

1429d84

kwalcock added 3 commits July 27, 2023 15:22

Update build.sbt

4128d4d

Use published version of scala-transformers

41b262b

Fix compilation problem

59af3e2

kwalcock merged commit 3432394 into balaur Jul 28, 2023

kwalcock deleted the kwalcock/balaur branch July 28, 2023 06:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change to the new deberta model #735

Change to the new deberta model #735

kwalcock commented Jul 18, 2023

kwalcock commented Jul 18, 2023

MihaiSurdeanu commented Jul 22, 2023

kwalcock commented Jul 22, 2023

MihaiSurdeanu commented Jul 22, 2023

MihaiSurdeanu commented Jul 22, 2023

kwalcock commented Jul 22, 2023

kwalcock commented Jul 22, 2023

MihaiSurdeanu commented Jul 22, 2023

MihaiSurdeanu commented Jul 28, 2023

kwalcock commented Jul 28, 2023

MihaiSurdeanu commented Jul 28, 2023

Change to the new deberta model #735

Change to the new deberta model #735

Conversation

kwalcock commented Jul 18, 2023

kwalcock commented Jul 18, 2023

MihaiSurdeanu commented Jul 22, 2023

kwalcock commented Jul 22, 2023

MihaiSurdeanu commented Jul 22, 2023

MihaiSurdeanu commented Jul 22, 2023

kwalcock commented Jul 22, 2023

kwalcock commented Jul 22, 2023

MihaiSurdeanu commented Jul 22, 2023

MihaiSurdeanu commented Jul 28, 2023

kwalcock commented Jul 28, 2023

MihaiSurdeanu commented Jul 28, 2023