jacobandreas / l3 Goto Github PK
View Code? Open in Web Editor NEWLearning with latent language
Home Page: https://arxiv.org/abs/1711.00482
License: Apache License 2.0
Learning with latent language
Home Page: https://arxiv.org/abs/1711.00482
License: Apache License 2.0
Hi Jacob,
Which exp/*
directory reproduces the 80 Val, 76 Test L3 programming by demonstration results? Haven't been able to find the numbers in any of the train.out
s.
Thanks!
Hi Jacob,
Working my way slowly through L3 :) It looks like the featurized inputs are vectors of length 4608?
In [1]: import numpy as np
In [2]: d = np.load('train/inputs.feats.npy')
In [3]: d.shape
Out[3]: (9000, 4608)
However, looking at VGG16, it seems to me to look like it consists of three types of layers:
Depending on whether we use the maxpooling after the final conv, there are either 4 max-poolings, or 5, meaning that the width and height will each be divided either by 2^4 = 16 or 2^5 = 32? The number of channels in either case is set by the final conv, which is 512?
Then, given an input that is 3 x 64 x 64, the output will be either:
Then, the flattened vector size in each case would be 8192 or 2048?
It looks like in order to obtain a size of 4608, if we assume there are 512 channels, we'd need a final output dimension of 512 x 3 x 3?
What am I missing in the above analysis?
Thanks for open sourcing this work! Is there an updated link for http://people.eecs.berkeley.edu/~jda/data/shapeworld.tar.gz
?
Hi Jacob,
Really like your shapeworld dataset. Question: how can I go about creating additional data, potentially with tweaked characteristics, eg number of distractors etc?
Hugh
Hi Jacob,
Awesome paper :)
Quite hard to figure out how to run this :)
cls.py
is the entry point, and ClsModel
is the model that corresponds to the L3 "Learning with latent language" paper Table 1, is this a fair impression?python cls.py -train -n_epochs 10000
which gives outputs like:
[iter] 774
[loss] 10.2767
[trn_acc] 0.9700
[val_acc] 0.4980
[val_same_acc] 0.5280
[val_mean_acc] 0.5130
This looks like the results of training the interpretation model, is that right?
Hi @jacobandreas, this is a very interesting work. I wanted to understand how you selected the model for reporting the numbers in the paper, specifically the few-shot classification task. Was it the best model on the validation set? How many epochs of training did you run?
Hi Jacob,
Question: what license is this code provided under?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.