The current timing info for an eval looks like this: llm.eval.test: 12748ms It would be great to also include: - the prompt being tested - the test number - the LLM being tested