Code Monkey home page Code Monkey logo

cuda_op_benchmark's Introduction

cuda_op_benchmark

在实际的推理引擎开发中,更多的是深度学习算子实现,所以本仓库目标是以实现高效深度学习算子为切入点,对CUDA编程,以及CUDA优化和概念进行一个介绍。 本仓库旨在通过算子Benchmark来对CUDA编程进行学习,框架采用CMake进行编译,也可以比较方便的进行代码组织,可以使得小伙伴们专注在算子的优化上面。

个人开发服务器配置环境说明

  • RTX4090

算子完成情况

Op Name Naive Version2 Version 3 Version 4 Version 5 Status
GEMM Wmma ✅ mma-ptx ✅ mma-ptx-opt ❌ cutlass ❌ Doing
Attention FlashAttention ❌ Doing

使用方法

1. 编译

在当前工程目录运行

编译

bash scripts/build_linux.sh

运行

# 运行gemm op的测试,只测试cuda kernel的耗时
./build/CUDABench op/gemm

# 运行gemm op的测试,测试cuda kernel的耗时,并且测试其正确性
./build/CUDABench op/gemm 1

# 运行gemm op的测试,测试cuda kernel的耗时,并且测试其正确性,并且将所有log输出到log.txt
./build/CUDABench op/gemm 1 > log.tx 2>&1

Reference

cuda_op_benchmark's People

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.