Confirming eval and test sets

Hi @m3hrdadfi,

Thank your for the great repository.
I just want to confirm, in the colab that you gave, the evaluation and test sets are from the same.
It is intended for demo only, right? Since the test set is included in the training process (as `eval_dataset`) 
it is not a big surprise that the performance was high.