Code Monkey home page Code Monkey logo

loci's Introduction

Loci

Loci is an unsupervised disentangled LOCation and Identity tracking system, which excels on the CATER and related object tracking challenges featuring emergent object permanence and stable entity disentanglement via fully unsupervised learning.

Paper: "Learning What and Where - Unsupervised Disentangling Location and Identity Tracking" | arXiv

CATER.Snitch.Tracking.Challenge.mp4

Requirements

A suitable conda environment named loci can be created and activated with:

conda env create -f environment.yaml
conda activate loci

Dataset and trained models

A preprocessed CATER dataset together with the 5 trained networks from the paper can be found here

The dataset folder (CATER) needs to be copied to data/data/video/

Interactive GUI

Loci-Latent-GUI.mp4

We provide an interactive GUI to explore the learned representations of the model. The GUI can load the extracted latent state for one slot. In the top left grid the bits of the gestalt code can be flipped, while in the top right image the position can be changed (by clicking or scrolling). The Bottom half of the GUI shows the composition of the background with the reconstructed slot content as well as the entity's RGB repressentation and mask.

Run the GUI (extracted latent states can be found here):

python -m model.scripts.playground -cfg model/cater.json \
  -background data/data/video/CATER/background.jpg -load net2.pt \
  -latent latent-states/net2/latent-0000-07.pickle

Training

Training can be started with:

python -m model.main -train -cfg model/cater-stage1.json

Evaluation

A trained model can be evaluated with:

python -m model.main -eval -testset -cfg model/cater.json -load net1.pt

Images and latent states can be generated using:

python -m model.main -save -testset -cfg model/cater.json -load net1.pt

loci's People

Contributors

manueltraub avatar

Stargazers

Shreya Kapoor avatar Morgan Titcher avatar Alex Khorin avatar Joonsu Gha avatar Rundong Luo avatar Angel Villar-Corrales avatar Martin Butz avatar rosebud avatar Kevin Blin avatar wangxr avatar Cheol-Hui Min avatar  avatar  avatar Jan avatar Ranran avatar Tasks With Code avatar Emmanuel Bondzie-Selby avatar Chang Liu avatar  avatar

Watchers

Fedor Scholz avatar Martin Butz avatar

Forkers

shinypond

loci's Issues

Moving MNIST

Hi, thanks for publishing the code.
I would like to train the model on a custom dataset, and for reference it would be helpful to be able to train it on moving MNIST as well.
Could you please provide the hyperparameters / config to reproduce the moving MNIST results reported in your paper. This would be much appreciated.

Preprocessed dataset and pretrained weights link expired

Thanks for your great work!

I try to download the preprocessed dataset and pretrained weights provided in the repo, but it shows :

"Sorry, the link has expired.
The link was set to expire after a certain amount of time. Please contact the person who shared this link with you.
"

Could you please kindly check the SharePoint link? Thanks a lot!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.