Code Monkey home page Code Monkey logo

matdl's Introduction

matDL

matDL icon
v0.42 BETA
A lightweight MATLAB deeplearning toolbox,based on gpuArray.
One of the fastest matlab's RNN libs.

Performance

model:A LSTM model has [1024,1024,1024] hidensizes and 10 timestep with a 256 dims input.
Device: i7-4710hq,GTX940m
matDL: 60sec/epoch Keras(1.2.2,Tensorflow backend,cudnn5.1): 29sec/epoch

Features

High parallel Implementation.

  • Concatance the weights of 4 gates to W and the values of x and h of every timesteps in a batch to a 3D tensor xh.Compute x*W for every timesteps of every samples in a batch at one time.
  • Compute the activated values of input,forget ,ouput gates at one time.

OOP style

  • Use struct type to define a layer class and a model class.Define ff, bp, optimize methods by using a FunctionHandle.

APIs

Model

  • A model is a set of layers,data and optimizer.
  • build
    • model=model_init(input_shape,configs ,flag,optimizer)
    • arguments:
      • input_shape : a vector,[input_dim,batchsize] or [input_dim,timestep,batchsize]
      • configs : cell ,configures of each layers
      • flag : bool ,0 is predict model,1 is trrain model
      • optimizer : struct ,keywords: opt(type of optimizer) ,learningrate
  • attributes :
    • model.input_shape
    • model.output_shape
    • model.batchsize
    • model.configs
    • model.flag
    • model.layers
    • model.optimizer (if flag)
    • model.loss
  • methods:
    • private:
      • model.eval_loss=@(outputlayer,y_true,flag)eval_loss(outputlayer,y_true,flag)
      • model.optimize=@(layer,optimizer,batch,epoch)layer_optimize(layer,optimizer,batch,epoch)
    • public:
      • model.train=@(model,x,y,nb_epoch,verbose,filename)model_train(model,x,y,nb_epoch,verbose,filename)
        • model=model.train(model,x,y,nb_epoch,verbose,filename)
          • arguments:
            • model : self
            • x:input,shape:[dim,timestep,nb_samples],or [dim,nb_samples]
            • y:targets
            • nb_epoch: how many epochs you want to train
            • verbose :0,1,2,3,0 means no waitbar an figure,1 means showing waitbar only,2 means showing waitbar and plotting figures every epoch,3 means showing waitbar and plotting figures every epoch an batch.
      • model.predict=@(model,x)model_predict(model,x)
        • y=model.predict(model,x)
      • model.evaluate=@(model,x,y_true)model_evaluate(model,x,y_true)
        • mean_loss=model.evaluate(model,x,y_true)
      • model.save=@(filename)model_save(model,filename)
        • model.save(filename)
        • Save layers weigths and configs to a.mat file.
  • reload:
    • model=model_load(minimodel,batch_size,flag,optimizer)
      • minimodel is the minimodel saved by model.save(),can be a struct variable or a string of filename.
  • example: x=rand(100,10,3200,'single','gpuArray');
    y=(zeros(512,10,3200'single','gpuArray'));
    y(1,:,:)=1;
    %% Define a model which has 2 lstm layers with 512 hiddenunits,and a timedistrbuted dense layer with 512 hiddenunits
    input_shape=[100,10,64];%input dim is 100,timestep is 10,batchsize is 64
    hiddensizes=[512,512,512];
    for l=1:length(hiddensize)
    configs{l}.type='lstm';
    configs{l}.hiddensize=hiddensize(l);
    configs{l}.return_sequence=1;
    end
    configs{l+1}.type='activation';
    configs{l+1}.act_fun='softmax';
    configs{l+1}.loss='categorical_cross_entropy';
    optimizer.learningrate=0.1;
    optimizer.momentum=0.2;
    optimizer.opt='sgd'; model=model_init(input_shape,configs,1,optimizer);
    %% Train the model
    model=model.train(model,x,y,nb_epoch,3,'example/minimodel_f.mat');
    or
    test_lstm(50,[512,512,512],256,10,64,5);

Layers

Layer class:

  • attributes:
    • type : string,type of the layer,available types:input,dense,lstm,activation
    • prelayer_type : string,type of the previous layer,available types:input,dense,lstm,activation
    • trainable : bool,is the layer trainable
    • flag : train model or predict model
    • configs :configures of the layer
    • input_shape : vector,[input_dim,batchsize] or [input_dim,timestep,batchsize]
    • output_shape : vector,[hiddensize,batchsize]or[hiddensize,timestep,batchsize]
    • batch : int,how many batches have been passed
    • epoch : same to batch
  • methods:
    • layer=**layer_init(prelayer,loss,kwgrs)
      • Built and init a layer.If the layer is a input layer,prelayer argument should be input_shape
    • layer=layer.ff(layer,prelayer)
    • layer=layer.bp(layer,nextlayer)
    LSTM layer(layer)
      * `layer=lstm_init_gpu(prelayer,hiddensize,return_sequence,flag,loss)`
      * A LSTM(**Long-Short Term Memory unit - Hochreiter 1997**) layer,see [there]:http://deeplearning.net/tutorial/lstm.html for a step-by-step description of the algorithm.
          * aviliable configures:
              * `config.hiddensize` : `int`(`double`),number of hidden units(output dim)
              * `config.return_sequence` :`bool`(`double`),return sequences or not.if `return_sequences`,output will be a 3D tensor with shape (hiddensize,timestep,batchsize). Else ,a 2D tensor with shape (hiddensize,batchsize). 
              * `config.loss` : `string`,type of loss function.Optional,only be used if the layer is an ouput layer.
              * **example**
    

matdl's People

Contributors

quantumliu avatar

Watchers

James Cloos avatar shuai.nie avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.