I believe the evaluator currently only supports text input. It would be useful if it could also accept tokenized input data (ie. if the data is already stored in tokenized format or we want to isolate the inference process from the tokenization process).