The report should be ordered according to the time the screenshot was taken.
The report should have a passed/failed label on each screenshot.
When a screenshot matches, we don't need to store the screenshot or the diff. We can just use the base image. This will speed up the tests