Code Monkey home page Code Monkey logo

Hi there, 👋

I'm BBuf, Working At SkyworkAI Now.

BBuf's github stats

My side projects

  1. Learning note on CUDA, AIComplier, LLM:

    how-to-optim-algorithm-in-cuda

    https://github.com/BBuf/tvm_mlir_learn

    https://github.com/BBuf/how-to-learn-deep-learning-framework

  2. Traditional image processing algorithm sharing:

    Image-processing-algorithm

    Note: Currently, this project only includes explain blog in Chinese.

  3. Deep learning technology sharing:

    Keras-Semantic-Segmentation

    DarkNet Source Code Analysis

  4. Algorithm optimization:

    Image-processing-algorithm-Speed

    ArmNeonOptimization

  5. Contibutions To Open Source Projects:

    I have made some contributions to some famous open source projects such as pytorchoneflow, TVM etc.

  6. Keep writing and publishing articles about tech comm (technical writing, stories, self-improvement, etc.) in Chinese:

Xiaoyu Zhang's Projects

chatrwkv icon chatrwkv

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

codeforces-go icon codeforces-go

Golang 算法竞赛模板库 | Solutions to Codeforces by Go 💭💡🎈

cpp_related_tips icon cpp_related_tips

📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.

cpufp icon cpufp

A CPU tool for benchmarking the peak of floating points

fastllm icon fastllm

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

how_to_optimize_in_gpu icon how_to_optimize_in_gpu

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

kineto icon kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

llama-factory icon llama-factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.