skku-eslab / cnn-on-flash Goto Github PK

View Code? Open in Web Editor NEW

23.0 23.0 16.0 92 KB

CNN functions for dense matrices resident in flash storage

License: MIT License

CMake 1.26% C++ 98.60% Shell 0.14%

blas cnn inference-engine machine-learning memory-management

cnn-on-flash's People

Contributors

Stargazers

Watchers

Forkers

leehayun redcarrottt gh-jo hoseung2 gleegend jungjeeyoon 7bvcxz choijinwoo01 sunghern podossiu wizehun sechakb jinseok103 nugabom asdfrv100 kimjoohyungsd

cnn-on-flash's Issues

Expand the coverage of the memory budget

Now the memory budget only includes submatrices allocated on memory.
For more general usage, it is necessary to include total memory usage of the process that executes gemm in the memory budget.

Add ARM GPU support

Currently, on-flash gemm only support ARM CPU.

GPU support is needed for usage in various environments and performance comparison between processors.

Introducing CNN-on-flash to ArmCL's layers

CNN-on-flash seems to support only GEMM operators.

However, real machine learning apps usually use layer-wise operators; such as convolution layer, fully-connected layer.

If you want to apply it to convolution layer, supporting for im2col is imperative.

Need partial im2col operation

We need to add partial im2col operation.

It is necessary to operate im2col on input tensor for convolution using gemm.
im2col should be done while a partial matrix is in memory and the other parts are in storage.

Unused-variable warnings ignore GLOG

While building this project, warnings like following example occur many times.

/home/odroid/Desktop/test-cof/include/pointers/allocator.h: In instantiation of ‘void flash::unmap_file(flash::flash_ptr<X>) [with T = float]’:
/home/odroid/Desktop/test-cof/drivers/gemm.cpp:72:26:   required from here
/home/odroid/Desktop/test-cof/include/pointers/allocator.h:49:9: warning: unused variable ‘ret’ [-Wunused-variable]
     int ret = munmap(

When I look into the files that make warnings, 'unused variables' are actually used in GLOG sentences but not counted as used.
This bothers us when we modify and debug the codes. It should be solved.

Support for other operations

Is it possible to apply CNN-on-flash for other operations like Winograd?

Infinite loop when budget is short

When I set PROGRAM_BUDGET less than the size of three submatrices, it never ends but continues the loop.

This is not proper behavior because failed execution should be informed explicitly and terminated.

So it is necessary to add the functionality to judge if the budget is not enough and if termination is needed.

Convolution implementation using gemm needs weight filters of a convolutional layer to be reshaped.
To reduce the maximum memory usage, reshape procedure should progress while partial matrices are loaded from storage and reshaped sequentially, not the whole matrix loaded at the same time.

So I propose to make partial reshape operation.

It will be meaningful to compare the execution time between in-memory and on-flash GEMM.

skku-eslab / cnn-on-flash Goto Github PK

cnn-on-flash's People

Contributors

Stargazers

Watchers

Forkers

cnn-on-flash's Issues

Expand the coverage of the memory budget

Add ARM GPU support

Introducing CNN-on-flash to ArmCL's layers

Need partial im2col operation

Unused-variable warnings ignore GLOG

Support for other operations

Infinite loop when budget is short

Partial reshape operation

Detailed documentation

Peak memory usage

Performance comparison to in-memory gemm

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent