Hi, thank you for your excellent work and for sharing your research with the community!
I’m very interested in reproducing the results from your paper. Would it be possible to release the caption files produced for each dataset as described in the paper?
Regenerating the caption database from scratch can be quite expensive, and due to the stochastic nature of LLM generation, it may also result in differences that affect reproducibility. Having access to the original caption files would be extremely helpful for accurate benchmarking and further research.
Thank you very much for considering this request!