Code Monkey home page Code Monkey logo

grid-lstm's People

Contributors

coreylynch avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

grid-lstm's Issues

nngraph: Got 3 inputs instead of 5

Hi,
Training runs fine, but sampling gives the following error:

th ./sample.lua cv/lm_lstm_epoch7.01_1.3587.t7

nngraph/gmodule.lua:362: Got 3 inputs instead of 5

stack traceback:
	[C]: in function 'error'
	/home/user/torch/install/share/lua/5.1/nngraph/gmodule.lua:362: in function 'forward'
	./sample.lua:173: in main chunk
	[C]: in function 'dofile'
	/home/user/torch/install/lib/luarocks/rocks/trepl/scm-1/bin/th:150: in main chunk
	[C]: at 0x00406670

char-rnn on the same machine doesn't give error.
Updated the nngraph package:
luarocks install nngraph
The issue remains. Thank you for looking into it and for sharing.

attempt to index field 'rnn' (a nil value)

 th train.lua -model grid.lstm
using CUDA on GPU 0...
loading data files...
cutting off end of data so that the batches/sequences divide evenly
reshaping tensor...
data load done. Number of data batches in train: 423, val: 23, test: 0
vocab size: 65
creating an grid.lstm with 2 layers
number of parameters in the model: 0
cloning criterion
/usr/local/torch/install/bin/luajit: train.lua:315: attempt to index field 'rnn' (a nil value)
stack traceback:
    train.lua:315: in function 'opfunc'
    /usr/local/torch/install/share/lua/5.1/optim/adam.lua:33: in function 'adam'
    train.lua:381: in main chunk
    [C]: in function 'dofile'
    ...ocal/torch/install/lib/luarocks/rocks/trepl/scm-1/bin/th:145: in main chunk
    [C]: at 0x00405ea0

Hi, I configure as char-RNN's README and I could train char-rnn on my computer (so I suppose I set up the environment properly), but when it comes to grid-lstm, it failed. The error messages are as above.

Could you kindly tell me what's wrong with it?

Thanks!

No way to sample

I'd like to be able to sample the results of the checkpoints in order to see if the network has developed longer dependency/more creativity than the vanilla LSTM, but there's no sample.lua file, and the default sample.lua can't handle the grid-lstm model.

cudnn 7.5 installed but libcunn not recognized.

Hi:

I'm new to torch/lua but I'm familiar with running karpathy's original on a gpu with no problems. I've installed the latest cudnn 7.5. The latest version libcudnn.so.5 is but when I test your version I received the following errors referencing libcudnn.so.4 :

$ th train.lua -model grid_lstm
/home/aaron/torch/install/bin/luajit: /home/aaron/torch/install/share/lua/5.1/trepl/init.lua:384: /home/aaron/torch/install/share/lua/5.1/trepl/init.lua:384: /home/aaron/torch/install/share/lua/5.1/cudnn/ffi.lua:1278: 'libcudnn (R4) not found in library path.

Please install CuDNN from https://developer.nvidia.com/cuDNN

Then make sure files named as libcudnn.so.4 or libcudnn.4.dylib are placed in your library load path (for example /usr/local/lib , or manually add a path to LD_LIBRARY_PATH)

My paths are:

$ export CPATH=/usr/local/cuda/include:$CPATH
$ export LIBRARY_PATH=/usr/local/cuda/lib64:$LIBRARY_PATH
$ export LD_LIBRARY_PATH=/usr/local/cuda/lib64:$LD_LIBRARY_PATH

Is there a change I can make to the code to fix this problem?
Thanks,

Error when saving checkpoints.

This error occurs consistently, but only at seemingly random intervals. It also prevents sampling from or resuming training from that checkpoint.

Here's the traceback:
saving checkpoint to cv/lm_lstm_epoch0.10_1.2833.t7 /home/____/torch/install/bin/luajit: /home/____/torch/install/share/lua/5.1/torch/File.lua:210: write error: wrote 39776036 blocks instead of 41251350 at /tmp/luarocks_torch-scm-1-1419/torch7/lib/TH/THDiskFile.c:340 stack traceback: [C]: in function 'write' /home/____/torch/install/share/lua/5.1/torch/File.lua:210: in function </home/____/torch/install/share/lua/5.1/torch/File.lua:107> [C]: in function 'write' /home/____/torch/install/share/lua/5.1/torch/File.lua:210: in function 'writeObject' /home/____/torch/install/share/lua/5.1/torch/File.lua:235: in function 'writeObject' /home/____/torch/install/share/lua/5.1/nn/Module.lua:154: in function 'write' /home/____/torch/install/share/lua/5.1/torch/File.lua:210: in function 'writeObject' /home/____/torch/install/share/lua/5.1/torch/File.lua:235: in function 'writeObject' /home/____/torch/install/share/lua/5.1/torch/File.lua:235: in function 'writeObject' /home/____/torch/install/share/lua/5.1/torch/File.lua:220: in function 'writeObject' /home/____/torch/install/share/lua/5.1/torch/File.lua:228: in function 'writeObject' ... /home/____/torch/install/share/lua/5.1/torch/File.lua:235: in function 'writeObject' /home/____/torch/install/share/lua/5.1/nn/Module.lua:154: in function 'write' /home/____/torch/install/share/lua/5.1/torch/File.lua:210: in function 'writeObject' /home/____/torch/install/share/lua/5.1/torch/File.lua:235: in function 'writeObject' /home/____/torch/install/share/lua/5.1/torch/File.lua:235: in function 'writeObject' /home/____/torch/install/share/lua/5.1/torch/File.lua:388: in function 'save' train.lua:420: in main chunk [C]: in function 'dofile' ...than/torch/install/lib/luarocks/rocks/trepl/scm-1/bin/th:145: in main chunk [C]: at 0x00406670

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.