soulaimene / p2m_image_captioning Goto Github PK
View Code? Open in Web Editor NEWThe ViT-GPT2 architecture for image captioning. It includes a code implementation that allows you to train your own network using a customizable amount of data and specific epochs.