dcmm,naver

dcmm's Introduction

Differentiable Cross Modal Model (DCMM)

This repository contains the code implementing the model introduced in Learning to Rank Images with Cross-Modal Graph Convolutions, ECIR'20, by Thibault Formal, Stéphane Clinchant, Jean-Michel Renders, Sooyeol Lee and Geun Hee Cho.

In this paper, we proposed a reformulation of unsupervised cross-modal pseudo relevance feedback mechanisms for image search, as a differentiable architecture relying on graph convolutions. Indeed, we can see the problem as a supervised representation learning task on graphs, and design graph convolutions operating jointly over text and image features (namely cross-modal graph convolutions). The proposed architecture directly learns how to combine image and text features for the ranking task, while taking into account the context given by all the other elements in the set of images to be (re-)ranked.

Requirements

The code builds on PyTorch Geometric, a library designed to train graph neural networks.

Minimal requirements:

torch==1.5.1
torch-geometric==1.5.0
pandas==1.0.2
tensorboard==2.2.2
pytrec-eval==0.4
tabulate==0.8.7

To get started

First, we recommend having a look at the PyTorch geometric library. Second, we provide a small dataset of preprocessed data from a Mediaval'17 challenge. Please have a look at the README in the exp folder to download the data and start training your own models !

Copyright 2020-present NAVER Corp.
CC BY-NC 4.0

Recommend Projects

naver / dcmm Goto Github PK

dcmm's Introduction

Differentiable Cross Modal Model (DCMM)

Requirements

To get started

dcmm's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent