Light

evelinehong / slot-attention-pytorch Goto Github PK

View Code? Open in Web Editor NEW

80.0 80.0 14.0 150 KB

Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"

Jupyter Notebook 84.77% Python 15.23%

slot-attention-pytorch's Introduction

👋 Hi, I’m @evelinehong from Shanghai Jiao Tong University, UCLA
🥳 My personal website: evelinehong.github.io

slot-attention-pytorch's People

Contributors

Stargazers

Watchers

Forkers

tcwltcwl nikepupu jday54 oleksost shagunsodhani elv-zhounan riccardomajellaro dc914337 red-fairy sandeshadhikary aamer98 shahrushi2003 malindah jihochoi

slot-attention-pytorch's Issues

dataset.py seems to be missing

Thanks for your code!

The dataset.py seems to be missing. In the meanwhile I'll try to adapt https://github.com/vahidk/tfrecord/ to read directly DeepMind's https://github.com/deepmind/multi_object_datasets#clevr-with-masks

Gradient flow for learning slots initialization

The gradient with respect to the slots_mu and slots_sigmavariables is zero. To learn the initialization of slots, you could change your model.pyin line 40 to slots = torch.distributions.Normal(mu, sigma).rsample()... with rsample()the gradients will flow for these variables

Sampling operation prevents gradients from the back-propagation

Thanks for your implementation of Slot Attention module. However, I found that the sampling operation (in Line 40 at model.py) prevents gradients from the back-propagation. During training, the gradients of slot_mu and slot_sigma will be zero, which means the two variable will not change. I think the reparameterization trick is needed to make the sampling operation differentiable.

Shape of Image [batch_size, num_channels, width, height]

Hi,
Just wanted to confirm if shape of image is
[batch_size, num_channels, width, height],

usually it's
[batch_size, num_channels, height, width].

slot-attention-pytorch/model.py

Line 176 in 1518c23

# `image` has shape: [batch_size, num_channels, width, height].

Thanks and regards

Pre-trained model is missing

Hello and thank you for the great work.
I think the eval file misses the pre-trained model ('./tmp/model3.ckpt') as indicated in the error I received. Would you please show me how I can have it?
Thank you in advance

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.