Hamdy Abdelkhalik's Projects
CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels
gpgpu-sim in a docker image (Ubuntu 20.04, CUDA Toolkit 11)
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
A collection of GPGPUs apps and benchmarks for architecture and HPC research
Config files for my GitHub profile.
Inference code for LLaMA models
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
NPBench - A Benchmarking Suite for High-Performance NumPy