GitHub - 934865517zjk/SimOAP: SimOAP: Improve Coherence and Consistency in Persona-based Dialogue Generation via Over-sampling and Post-evaluation (ACL2023)

SimOAP: Improve Coherence and Consistency in Persona-based Dialogue Generation via Over-sampling and Post-evaluation

The code of SimOAP in the ACL2023 main conference. Paper link: link

First Step: Over-sampling

We used BERT-over-BERT (BoB) and Multi-GPT2 as backbone models, so you need to first generate large-scale candidate responses based on their codes. In our setting, n=2000 candidate responses are generated for each history utterances. And top-k sampling is used, where k is set to 100 (k=100).

The code links for the two backbone models are as follows:

BERT-over-BERT (BoB): https://github.com/songhaoyu/BoB

Implementation details: BoB has two decoders, the first decoder is used to generate a preliminary response and the second decoder is used to modify the preliminary response and generate the final response. We only use top-k sampling in the first decoder. The second decoder is a response modifier, so we use greedy search.

Multi-GPT2: https://github.com/caoyu-noob/Multi-GPT2

Implementation details: we directly use top-k sampling for sampling.

Second Step: Post-evaluation

Post-evaluation consists of two parts: coherence evaluation and consistency evaluation. In coherence evaluation, the TF-IDF algorithm is used. In consistency evaluation, natural language inference (NLI) is used.

Coherence Evaluation (TF-IDF)

python coherence_evaluation.py

Consistency Evaluation (NLI)

For the weights of the NLI model, you need to download it from the following link and place it into the consistent_model/ folder: link

python consistent_evaluation.py

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
consistent_model		consistent_model
README.md		README.md
coherence_evaluation.py		coherence_evaluation.py
consistent_evaluation.py		consistent_evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

consistent_model

consistent_model

README.md

README.md

coherence_evaluation.py

coherence_evaluation.py

consistent_evaluation.py

consistent_evaluation.py

Repository files navigation

SimOAP: Improve Coherence and Consistency in Persona-based Dialogue Generation via Over-sampling and Post-evaluation

First Step: Over-sampling

Second Step: Post-evaluation

About

Releases

Packages

Languages

934865517zjk/SimOAP

Folders and files

Latest commit

History

Repository files navigation

SimOAP: Improve Coherence and Consistency in Persona-based Dialogue Generation via Over-sampling and Post-evaluation

First Step: Over-sampling

Second Step: Post-evaluation

About

Resources

Stars

Watchers

Forks

Languages