CLIP will recognize this image as a hot dog with a very high probability close to 1, but the actual label should be a person. Is there a solution? 