Code Monkey home page Code Monkey logo

uoft-ecosystem's Projects

apex icon apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

bots icon bots

Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour under certain circumstances: task tiedness, throttle and cut-offs mechanisms, single/multiple task generators, etc.

bppsa-open icon bppsa-open

The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".

brax icon brax

Massively parallel rigidbody physics simulation on accelerator hardware.

cache-trace icon cache-trace

A collection of Twitter's anonymized production cache traces.

dlrm icon dlrm

An implementation of a deep learning recommendation model (DLRM)

gpgpu-sim_distribution icon gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated (and validated) energy model, GPUWattch.

gpgpu-sim_simulations icon gpgpu-sim_simulations

A repository that compliments gpgpu-sim, providing automated regression scripts, simulation launching utilities and the code + arguments for simulations that complete in a reasonable amount of time on GPGPU-Sim.

halide icon halide

a language for fast, portable data-parallel computation

hfta icon hfta

Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion

incubator-mxnet icon incubator-mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

inference icon inference

Reference implementations of inference benchmarks

minuet icon minuet

[EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs

moil icon moil

MoIL: Enabling Efficient Incremental Training on Edge Devices

nnvm icon nnvm

move to https://github.com/dmlc/tvm/

pytorch icon pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

rlscope icon rlscope

RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.