Thanks for the great work: I enjoyed reading the paper, and your proposed iterative improving strategy seems super interesting.
I am interested in the human data you collected (DecomposedCaptions4k with 24960 human annotations), is there a planned release date for this?
Also a couple more related questions:
I could not find the supplementary material of your paper online (it wasn't attached to the arxiv paper), are you planning an update to arxiv; I would be super interested in going over the supp. mat.
For the human data, did you run two separate experiments? One for the preference study with two images and one prompt displayed, and another for single image-single prompt alignment?
Hi @1jsingh. Thanks for the nice work!
I noticed you mentioned that data (Decomposable-Captions-4k) was supposed to be released in September 2023. It seems like a significant amount of time has passed. May I ask when it is expected to be released?