How to use BERT to evaluate directly to my conversation? #6

dxlong2000 · 2022-08-02T08:01:00Z

Thanks for your great work. May I ask if there is any inference scripts so that I can run them to evaluate my generated dialog? Look forward hearing from you soon.

Thanks!

ehsk · 2022-08-02T16:55:04Z

Hi @dxlong2000,

Unfortunately, we don't have our fine-tuned models anymore. You need to fine-tune BERT yourself first.

Hope this helps!

dxlong2000 · 2022-08-03T03:30:27Z

Thanks for your reply. I see. Could you mind uploading the inference codes? Like how to load and evaluate a new dialog? Thanks

ehsk · 2022-08-03T16:16:26Z

Our code supports evaluation. You can find it here. We didn't implement inference where we save the predicted labels for an input data, but it would be quite similar to the evaluation code.

dxlong2000 · 2022-08-10T15:29:46Z

Hi @ehsk ,

Your evaluation code only reports the eval_accuracy, eval_loss, global_step, and loss. May I ask how can I get the SS scores? Look forward hearing from you soon.

Thanks!

ehsk · 2022-08-10T23:02:44Z

Hi @dxlong2000,

For Semantic Similarity, take a look at here. You need to write a code like the following:

from dialogentail.semantic_similarity import SemanticSimilarity

ss = SemanticSimilarity()
ss.compute(conversation_history, actual_response, generated_response)

conversation_history is a list of strings and actual_response and generated_response are both strings.

dxlong2000 · 2022-08-11T01:50:55Z

Hi @ehsk ,

Thanks for your quick response. From my understanding is that let's say the entailment model is trained on the ground-truth response and then we can take that pre-trained model to evaluate on a new conversation without knowing the actual_response, am I correct?

I still see in the computation of ss includes actual_response. In the paper I saw: It measures the distance between the generated response and the utterances in the conversation history but there is no mention of theactual_response. Could you mind clarifying for me?

Thanks a lot!

dxlong2000 · 2022-08-11T02:37:28Z

I saw you already provided sim_generated_resp. That answered my right above question. Is there any way I can load my above pretrained BERT instead of Elmo?

ehsk · 2022-08-11T14:19:54Z

Semantic Similarity measures cosine similarity between embedding vectors. An updated version of it would be BERTScore.
actual_response is not really necessary. You can pass the same string as generated_response.

If you want to use an entailment model, the coherence metric, here, is what you need:

from dialogentail.coherence import BertCoherence

c = BertCoherence("/path/to/model")
c.compute(conversation_history, actual_response, generated_response)

The constructor argument is the path to a fine-tuned BERT model.

ehsk closed this as completed Oct 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use BERT to evaluate directly to my conversation? #6

How to use BERT to evaluate directly to my conversation? #6

dxlong2000 commented Aug 2, 2022

ehsk commented Aug 2, 2022

dxlong2000 commented Aug 3, 2022

ehsk commented Aug 3, 2022

dxlong2000 commented Aug 10, 2022

ehsk commented Aug 10, 2022

dxlong2000 commented Aug 11, 2022

dxlong2000 commented Aug 11, 2022

ehsk commented Aug 11, 2022

How to use BERT to evaluate directly to my conversation? #6

How to use BERT to evaluate directly to my conversation? #6

Comments

dxlong2000 commented Aug 2, 2022

ehsk commented Aug 2, 2022

dxlong2000 commented Aug 3, 2022

ehsk commented Aug 3, 2022

dxlong2000 commented Aug 10, 2022

ehsk commented Aug 10, 2022

dxlong2000 commented Aug 11, 2022

dxlong2000 commented Aug 11, 2022

ehsk commented Aug 11, 2022