Code Monkey home page Code Monkey logo

hands-on-gemm's Introduction

Hands-on-GEMM

A GEMM tutorial.

Performance

SGEMM 性能对比

Usage

src/cuda 文件夹下面,找到你想看性能的 gemm,记住那个名字,然后回到主项目文件夹下,首先mkdir build,然后输入 make benchmark_xxx

如你想看 double_buffer_yhs_refine_gemm.cu 这个矩阵乘的性能,就输入:

make benchmark_double_buffer_yhs_refine

然后二进制会出现在 bin 文件夹下面。

Tutorial

知乎链接:这里

hands-on-gemm's People

Contributors

pzzzzz5142 avatar

Stargazers

Learning Chip avatar xxxxxxccc avatar  avatar  avatar  avatar justin_shao avatar Rui avatar RippyKuang avatar FengPeanut avatar  avatar kyle avatar  avatar yhwang avatar  avatar  avatar Sin avatar LazyCodingCat avatar  avatar  avatar XYZ avatar  avatar XiaoFeiNiu avatar  avatar  avatar  avatar Clerk avatar Light Lin avatar abdul dakkak avatar Tream avatar Xuejun avatar LucaDoncic avatar logicat avatar Jason avatar hongzhe bi avatar weishengying avatar Ruibo avatar longer_is_better avatar fade_away avatar  avatar  avatar Jiacheng Yang avatar  avatar river_cold avatar Shunli Wang (王顺利) avatar  avatar  avatar  avatar Ye Li avatar Yuchen avatar  avatar  avatar ys_cv avatar xiayanming avatar woso avatar Zhousx avatar Chunwei Xia avatar  avatar  avatar zhangjunsong-nreal avatar sunnchioo avatar gager avatar  avatar  avatar  avatar Kevin avatar 杨现 avatar  avatar Chauncey Wang avatar Jun Zhang avatar nihao avatar BaofengZan avatar  avatar Tong Xiao avatar fabfish avatar martin avatar ZZK avatar  avatar Henry avatar 阿邦 avatar

Watchers

 avatar  avatar

hands-on-gemm's Issues

Intersting project

Seems like you wanna to do an implementation of int8 gemm to do NLP inference? I wonder why you choose to hands on wmma api to do int8 gemm instead of opensource library like cutlass or vendor library like cublas? I implemented some conflict-free int8 gemm kernels, maybe I can help you to implement it.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.