Evaluator cannot return the expected metrics. #2078

lcy-seso · 2017-05-10T04:37:40Z

In V2 API, by default, the following codes print a certain metric calculated by an evaluator.

def event_handler(event):
        if isinstance(event, paddle.event.EndIteration):
            if event.batch_id % 100 == 0:
                print "\nPass %d, Batch %d, Cost %f, %s" % (
                    event.pass_id, event.batch_id, event.cost, event.metrics)

In the C++ implementation, an evaluator stores the evaluation results in Argument, and there are also some evaluators that store nothing but just print to stderr.

The above code can only print the evaluated stored in Argument.value. If any of the following two situations happens, the evaluator cannot return a right result.

An evaluator stores multiple evaluated metrics in a special format.
An evaluator does not store the evaluated metrics but just print, for example, the chuck evaluator.

For the fist case, there should be some documentations to explain how each evaluator stores its results.

For the second case, the original C++ codes need to be modified.

Does anyone test that in V2 API every evaluator returns the right results?

The text was updated successfully, but these errors were encountered:

reyoung · 2017-05-10T07:05:01Z

Could we fix this bug by making Evaluator can return many types in Python?

Just print to stderr is really a bad design in Paddle. If an Evaluator could return any types of metrics, it seems would fix this bug. For example, if we use beam search evaluator, we could get the result sentences in metrics.

reyoung · 2017-05-10T07:08:51Z

I also think Evaluator in Paddle is an overused concept. Just tell me why we need print Evaluator when we can get any layer's output in Python?

lcy-seso · 2017-05-10T07:25:10Z

@reyoung I think the problem is we should disable some evaluators that only to print, and formalize evaluators that do not store their results in Arguments, so that they can work with v2 API better.

lcy-seso · 2017-05-11T05:35:02Z

@pkuyym will fix this bug.

pkuyym · 2017-05-16T07:40:55Z

@lcy-seso @reyoung
I find that the v2 api will call getValue in a loop to retrieve all metrics calculated by ChunkEvaluator. So I just implement corresponding virtual functions of ChunkEvaluator to return Precision、Recall and F1-Score. Functions I implemented are getType、getNames and getValue. Please feel free to point out what I've missed.

lcy-seso self-assigned this May 10, 2017

lcy-seso added the Bug label May 10, 2017

lcy-seso added this to 全局BUG in V2 API Enhancement May 10, 2017

lcy-seso mentioned this issue May 10, 2017

Floating point exception #1961

Closed

lcy-seso added this to Top priorities in Defects board May 10, 2017

lcy-seso moved this from Not in schedule to Next Week in Defects board May 10, 2017

lcy-seso moved this from Next Week to Current Week ToDo in Defects board May 10, 2017

lcy-seso moved this from Current Week ToDo to Next Week in Defects board May 10, 2017

lcy-seso moved this from Next Week to Not in schedule in Defects board May 10, 2017

reyoung moved this from Not in schedule to Next Week in Defects board May 10, 2017

lcy-seso assigned pkuyym May 11, 2017

pkuyym mentioned this issue May 16, 2017

Fix 2078 #2165

Merged

pkuyym closed this as completed in #2165 May 22, 2017

lcy-seso moved this from Next Week to Done in Defects board May 22, 2017

lcy-seso removed this from Done in Defects board May 22, 2017

luotao1 moved this from 全局BUG to 已完成 in V2 API Enhancement May 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluator cannot return the expected metrics. #2078

Evaluator cannot return the expected metrics. #2078

lcy-seso commented May 10, 2017 •

edited

Loading

reyoung commented May 10, 2017

reyoung commented May 10, 2017

lcy-seso commented May 10, 2017 •

edited

Loading

lcy-seso commented May 11, 2017

pkuyym commented May 16, 2017 •

edited

Loading

Evaluator cannot return the expected metrics. #2078

Evaluator cannot return the expected metrics. #2078

Comments

lcy-seso commented May 10, 2017 • edited Loading

reyoung commented May 10, 2017

reyoung commented May 10, 2017

lcy-seso commented May 10, 2017 • edited Loading

lcy-seso commented May 11, 2017

pkuyym commented May 16, 2017 • edited Loading

lcy-seso commented May 10, 2017 •

edited

Loading

lcy-seso commented May 10, 2017 •

edited

Loading

pkuyym commented May 16, 2017 •

edited

Loading