Code Monkey home page Code Monkey logo

erich-hs / google-gldv2-2020 Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 32.52 MB

Google's Landmark Recognition Challenge 2020. This repository contains my implementation of a metric learning solution with cosine similarity search (using an EfficientNet backbone for image embedding), paired with a DELF module for reranking based on local features of the images. These techniques were adapted from the 2020 Recognition challenge winner and 2019 Recognition challenge 2nd place papers.

Jupyter Notebook 100.00%

google-gldv2-2020's Introduction

Contributors

Google Landmark Recognition Challenge 2020

Architecture

About The Challenge

Google Landmark Recognition 2020 was the third instance of the competition sponsored by Google and hosted on Kaggle. It challenged Kagglers to build models that recognize the correct landmark (if any) on a public and a private test set of real pictures taken by Google operators.

GLDv2

The training data for the competition comes from a cleaned version of the original Google Landmarks Dataset v2 (GLDv2), which consists of over 5M images and over 200k distinct instance labels (landmarks).

Challenges

The biggest challenges in this competition and to the landmark recognition task are captured by the GLDv2 dataset, containing:

  1. Extremely skewed class distribution. While famous landmarks might have tens of thousands of image samples, 57% of classes have at most 10 images and 38% of classes have at most 5 images.
  2. Intra-class variability. Landmarks have views from different vantage points, and of different details, as well as both indoor and outdoor views of buildings.
  3. Out-of-domain query images. The query set consists of only 1.1% images of landmarks and 98.9% out-of-domain images, for which no result should be expected.

More details about the dataset construction, its cleaned version, and other particular challenges related to the Landmark Recognition task are available in the dataset paper.

Medium Article

I dedicated a Medium Article where I explain, illustrate, and implement the ideas behind a baseline architecture for landmark image recognition. There, I cover its core concepts from theory to practical results.

About this Repository

This repository contains my implementation of a metric learning solution with cosine similarity search (using an EfficientNet backbone for image embedding), paired with a DELF module for reranking based on local features of the images. These techniques were adapted from the 2020 Recognition challenge winner and 2019 Recognition challenge 2nd place papers. The notebooks contained here illustrate the implementation on a subset of the competition data.

My Kaggle Notebook

The notebooks in this repository were split from my original Kaggle notebook to enable rendering on Github. Please consider leaving an upvote on my notebook if you find the content helpful for your applications!

(back to top)

google-gldv2-2020's People

Contributors

erich-hs avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.