The goal of this project to run Convolutional Neural Network layers for flash-resident-matrices. Now gemm using ARM CPU is implemented.
This project is implemented based on BLAS-on-flash
- BLAS-on-flash https://github.com/microsoft/BLAS-on-flash
- Arm Compute Library https://github.com/ARM-software/ComputeLibrary
- Ubuntu 16.04
- Arm Compute Library 19.02
- built with neon option turned on
git clone
vim CMakeLists.txt
- modify
set (ACL_ROOT [arm_compute_library_path])
- modify
mkdir bin && cd bin
cmake ..
make
cd ..
gemm execution
cd misc
chmod +x gemm.sh
./exec.sh [A_row] [B_row] [B_col]