Light

jsikyoon / visual-interaction-networks_tensorflow Goto Github PK

View Code? Open in Web Editor NEW

132.0 8.0 30.0 631 KB

Tensorflow Implementation of Visual Interaction Networks

Home Page: https://arxiv.org/abs/1706.01433

License: MIT License

Python 100.00%

tensorflow relational-reasoning computer-vision agi physics-engine deepmind interaction-nets

visual-interaction-networks_tensorflow's Introduction

Visual Interaction Networks

Tensorflow Implementation of Visual Interaction Networks from Google Deepmind.

Implementation is on Tensorflow r1.2.

https://arxiv.org/abs/1706.01433

"Another key part of relational reasoning involves predicting the future in a physical scene. From just a glance, humans can infer not only what objects are where, but also what will happen to them over the upcoming seconds, minutes and even longer in some cases. For example, if you kick a football against a wall, your brain predicts what will happen when the ball hits the wall and how their movements will be affected afterwards (the ball will ricochet at a speed proportional to the kick and - in most cases - the wall will remain where it is)." From an article of Deepmind

https://deepmind.com/blog/neural-approach-relational-reasoning/

N-objects Gravity Simulations

For changing configure values, please check constants script.

cat constracts.py

For generating images and data,

python physical_engines.py

For modeling Visual Interaction Networks

python gravity_vin.py

Data

The Data are gathered from my own implemented physics engine, which is same of interaction network git repo. https://github.com/jaesik817/Interaction-networks_tensorflow

One different thing from IN physics engines is time difference between frames, which was 0.0001 in IN repo and is 0.001 in this. Because 0.0001 secs frame cannot be recognized in 32 x 32 images.

Settings

Settings are written in constants.py and gravity_vin.py. The number of objects, frames on each simulations and rollout frames are 3, 50 and 20. The training max epoches are 80000. In physical_engines code, every frames are saved as image and coded data, and those things are used in gravity_vin script. Each image has background ones from CIFAR 10 training data set as the paper.

Results

The loss decreased as followed, which is summarized value of losses on near future 8 frames and encoding-decoding losses on input images.

The quilititive results are as followed.

True :

Modeling :

visual-interaction-networks_tensorflow's People

Contributors

Stargazers

Watchers

visual-interaction-networks_tensorflow's Issues

TypeError range object doesn’t support item assignment

def swap (sequence, i,j):
temp = sequence[i]
sequence[i] = sequence[j]
I am new in python and I would like to solve this issue. Thank you for your help !

TypeError: 'range' object does not support item assignment

Changed
np.random.shuffle(..._idx);
to
np.random.shuffle(list(..._idx));
for it to work for python3.

rel_num wrong. vim.py line 162. real_num=int(FLAGS.No*(FLAGS.N0-1))

updating code to TF 1.4

THIS IS NOT AGI

please remove the tag(s).

Implementation Difference

Hello,
Thank you for implementation. I was checking your code and i saw some difference at implementation between paper and your code. for example: according to the supplementary material you should first concatenate and apply a linear layer, but you are for applying a linear layer and then doing the concatenation op why do you need this difference?

Checking Functionality with larger img then current size

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.