Labels do not match the prediction array for data labeled by model. Deleted that output for now because the prediction values are what's a bit more useful and important and those are correct- but would be good to fix this. In inference.py script the 'out' dictionary contains this.