RULEDST evaluation #227

IreneSucameli · 2022-03-01T09:16:21Z

Hi, could you please provide more information on how the Rule DST module is evaluated?
Thanks

zqwerty · 2022-03-10T06:05:48Z

We did not evaluate the rule DST solely since it needs dialog acts as input. If you want to compare rule DST with other DST models, you may use the golden dialog acts as input or use an NLU model such as BERTNLU to parse both user and system acts.

IreneSucameli · 2022-03-21T09:04:12Z

I would like to use the output of BERTNLU as the input for the dst; however, it is not clear for me how to pass the data from one module to another, and I haven't find any code for that in convlab, for the moment.

Could you kindly link the convlab's page where this is described, or provide me more information about this process?

zqwerty · 2022-03-21T12:41:36Z

You can refer to the Colab tutorial or the interface class for nlu and dst. You can see PipelineAgent for how to build an agent with modules. Example usage:
https://github.com/thu-coai/ConvLab-2/blob/master/tests/test_BERTNLU-RuleDST-RulePolicy-TemplateNLG.py

IreneSucameli · 2022-03-21T14:18:24Z

Thank you for the info. Nevertheless, the Colab tutorial refers to an overall evaluation (nlu + dst+ nlg).
What if I would like to evaluate the nlu+dst only, in order to analyze if the defined rules are ok or need some improvements? Is that possible? Thanks again

zqwerty · 2022-03-21T14:37:27Z

Sure. Just feed the output of NLU to DST:

ConvLab-2/convlab2/dialog_agent/agent.py

Lines 122 to 132 in ad32b76

    
               self.input_action = self.nlu.predict(observation, context=[x[1] for x in self.history[:-1]]) 
        
           else: 
        
               self.input_action = observation 
        
           self.input_action = deepcopy(self.input_action) # get rid of reference problem 
        
           # get state 
        
           if self.dst is not None: 
        
               if self.name is 'sys': 
        
                   self.dst.state['user_action'] = self.input_action 
        
               else: 
        
                   self.dst.state['system_action'] = self.input_action 
        
               state = self.dst.update(self.input_action)

IreneSucameli · 2022-03-22T14:59:29Z

From the code you posted it doesn't seem that the module is evaluated with F1 scores or a similar measure... perhaps I don't understand your point...

zqwerty · 2022-03-23T09:26:28Z

Sorry, I thought you need instruction about how to pass the output of NLU to DST. If you want to evaluate NLU+DST, you can write a script to: 1) read the original data; 2) pass utterances to NLU to get the user dialog acts; 3) pass user dialog acts to RuleDST to get predicted state; 4) compare predictions with references

zqwerty · 2022-03-23T09:27:29Z

refer to https://github.com/thu-coai/ConvLab-2/blob/master/convlab2/dst/evaluate.py for dst metric

IreneSucameli · 2022-03-23T14:20:57Z

Ok, thanks, I'll try in this way!

IreneSucameli added the feature Feature to add in the future label Mar 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RULEDST evaluation #227

RULEDST evaluation #227

IreneSucameli commented Mar 1, 2022

zqwerty commented Mar 10, 2022

IreneSucameli commented Mar 21, 2022

zqwerty commented Mar 21, 2022

IreneSucameli commented Mar 21, 2022

zqwerty commented Mar 21, 2022 •

edited

Loading

IreneSucameli commented Mar 22, 2022

zqwerty commented Mar 23, 2022

zqwerty commented Mar 23, 2022

IreneSucameli commented Mar 23, 2022

RULEDST evaluation #227

RULEDST evaluation #227

Comments

IreneSucameli commented Mar 1, 2022

zqwerty commented Mar 10, 2022

IreneSucameli commented Mar 21, 2022

zqwerty commented Mar 21, 2022

IreneSucameli commented Mar 21, 2022

zqwerty commented Mar 21, 2022 • edited Loading

IreneSucameli commented Mar 22, 2022

zqwerty commented Mar 23, 2022

zqwerty commented Mar 23, 2022

IreneSucameli commented Mar 23, 2022

zqwerty commented Mar 21, 2022 •

edited

Loading