jacobandreas / l3 Goto Github PK

View Code? Open in Web Editor NEW

50.0 50.0 4.0 8.97 MB

Learning with latent language

Home Page: https://arxiv.org/abs/1711.00482

License: Apache License 2.0

Python 96.87% Shell 3.13%

l3's People

Contributors

Stargazers

Watchers

Forkers

lim0606 jayelm catherinewong hughperkins

l3's Issues

Reproduce L3 PbD results

Hi Jacob,

Which exp/* directory reproduces the 80 Val, 76 Test L3 programming by demonstration results? Haven't been able to find the numbers in any of the train.outs.

Thanks!

Why is the size of the featurized images 4608, and not either 8192 or 2048?

Hi Jacob,

Working my way slowly through L3 :) It looks like the featurized inputs are vectors of length 4608?

In [1]: import numpy as np

In [2]: d = np.load('train/inputs.feats.npy')

In [3]: d.shape
Out[3]: (9000, 4608)

However, looking at VGG16, it seems to me to look like it consists of three types of layers:

max-pooling, which halves the width and the height
relu, which doesnt change any dimension
padded conv, which also doesnt change the width or the height; and fixes the channels to an absolute number

Depending on whether we use the maxpooling after the final conv, there are either 4 max-poolings, or 5, meaning that the width and height will each be divided either by 2^4 = 16 or 2^5 = 32? The number of channels in either case is set by the final conv, which is 512?

Then, given an input that is 3 x 64 x 64, the output will be either:

512 * 4 * 4 (if we ignore the pooling after the final conv), or
512 * 2 * 2 (if we include it)

Then, the flattened vector size in each case would be 8192 or 2048?

It looks like in order to obtain a size of 4608, if we assume there are 512 channels, we'd need a final output dimension of 512 x 3 x 3?

What am I missing in the above analysis?

Shapesworld link is not valid

Thanks for open sourcing this work! Is there an updated link for http://people.eecs.berkeley.edu/~jda/data/shapeworld.tar.gz ?

Code to generate additional shapeworld data?

Hi Jacob,

Really like your shapeworld dataset. Question: how can I go about creating additional data, potentially with tweaked characteristics, eg number of distractors etc?

Hugh

How to run this to reproduce Table 1 results?

Hi Jacob,

Awesome paper :)

Quite hard to figure out how to run this :)

it looks like cls.py is the entry point, and ClsModel is the model that corresponds to the L3 "Learning with latent language" paper Table 1, is this a fair impression?
I've tried running with:

python cls.py -train -n_epochs 10000

which gives outputs like:

[iter]    774
[loss]    10.2767
[trn_acc] 0.9700
[val_acc] 0.4980
[val_same_acc] 0.5280
[val_mean_acc] 0.5130

This looks like the results of training the interpretation model, is that right?

How can I train in addition the proposal model?
How can I train and run the full model, as in Table 1?

Model selection

Hi @jacobandreas, this is a very interesting work. I wanted to understand how you selected the model for reporting the numbers in the paper, specifically the few-shot classification task. Was it the best model on the validation set? How many epochs of training did you run?

License?

Hi Jacob,

Question: what license is this code provided under?

jacobandreas / l3 Goto Github PK

l3's People

Contributors

Stargazers

Watchers

Forkers

l3's Issues

Reproduce L3 PbD results

Why is the size of the featurized images 4608, and not either 8192 or 2048?

Shapesworld link is not valid

Code to generate additional shapeworld data?

How to run this to reproduce Table 1 results?

Model selection

License?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent