Code Monkey home page Code Monkey logo

Kareem Omar's Projects

arduinophysics icon arduinophysics

Realtime 2D physics and collision detection on an Arduino with 60 fps output to a Sharp memory LCD.

bheap icon bheap

Lightweight binary heap that greatly outperforms std::priority_queue and other commonly available heap implementations

bitops icon bitops

Basic, efficient, header-only bit ops and bit array primitives for modern x86. Tests provided.

boids_sdl icon boids_sdl

Numerical simulation of flocking behavior using pure CPU and SDL.

boxblur icon boxblur

Fastest CPU (AVX/SSE) Horizontal Box Blur for image processing and computer vision

clatch icon clatch

Insanely fast CUDA LATCH: fully scale- and rotation-invariant 512-bit binary descriptor for computer vision

crc icon crc

Fastest CRC32 for x86, Intel and AMD, + comprehensive derivation and discussion of various approaches

cudaboids icon cudaboids

Numerical simulation of flocking behavior using CUDA and OpenGL

cudaflerp icon cudaflerp

Fast CUDA (GPU) Bilinear and Nearest-Neighbor Interpolation at high accuracy - float32 data

cudahammingmean icon cudahammingmean

Fastest GPU implementation of a brute-force Hamming-weight matrix sum/mean for 512-bit binary descriptors.

cudak2nn icon cudak2nn

Insanely fast CUDA 2NN 512-bit binary descriptor matcher for computer vision

cudakfnn icon cudakfnn

Fastest CUDA SIFT or other 128-float vector matcher for computer vision

cudakfnn_packed icon cudakfnn_packed

Fastest CUDA SIFT or other 128-float *packed as uint8_t* vector matcher for computer vision

cudalerp icon cudalerp

Fast CUDA (GPU) Bilinear and Nearest-Neighbor Interpolation at high accuracy - uint8_t data

cudargb2y icon cudargb2y

Fastest CUDA RGB to grayscale: 5-30x faster than OpenCV. For image processing/computer vision.

factorization-primality icon factorization-primality

Extremely fast, single-file factorization and primality testing for 32-bit and 64-bit integers on x86.

fastarrayops icon fastarrayops

Extremely fast x86 / AVX2 assembly implementations of common operations for linear arrays: checking whether array contains element, finding index of element, finding min/max element, finding index of min/max element.

fastdivide icon fastdivide

Divide 64-bit integers faster than hardware. Or precompute for a given denom and quickly divide repeatedly.

fastdivide128 icon fastdivide128

Getting __udivti3 or __umodti3 errors? Just want faster division/modulo for 128-bit ints on Clang? Look no further.

fastintegersqrt icon fastintegersqrt

Fastest implementations of 32-bit and 64-bit integer square roots for x86-64

featureangle icon featureangle

Extremely fast SSE gradient (angle of rotation) computation of grayscale features in an image, for image processing and computer vision.

k2nn icon k2nn

Fast bruteforce and Multi-Index Hash (MIH) accelerated 2NN matchers for 512-bit binary descriptors for computer vision

kfast icon kfast

Implementation of FAST feature detector for computer vision (Rosten 2006) using AVX2 to outperform canonical implementation by up to 600%.

kfnn icon kfnn

Fastest CPU (AVX/SSE) SIFT or other 128-float vector matcher for computer vision

khalf icon khalf

Optimized special-case bilinear interpolation, halving the width and not changing the height, for computer vision dual-frame display.

klerp icon klerp

Fastest CPU (AVX2) Bilinear and Nearest-Neighbor Interpolation: 25-100% faster than OpenCV. For computer vision / image processing.

knes icon knes

Complete, lightweight NES emulator in C++, speedcoded in 3 days.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.