entalent / vision-and-language-papers Goto Github PK
View Code? Open in Web Editor NEWIndexing conference papers in the field of vision and language, including image/video captioning, visual question answering (VQA), vision-language navigation (VLN) and other related topics