Code Monkey home page Code Monkey logo

Yan Wang's Projects

clipper icon clipper

A low-latency prediction-serving system

cocktail icon cocktail

Cocktail: A Multidimensional Optimization for Model Serving in Cloud (NSDI'22)

deeprecsys icon deeprecsys

http://vlsiarch.eecs.harvard.edu/research/recommendation/

early-bird-tickets icon early-bird-tickets

[ICLR 2020] Drawing Early-Bird Tickets: Toward More Efficient Training of Deep Networks

freetickets icon freetickets

[ICLR 2022] "Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity" by Shiwei Liu, Tianlong Chen, Zahra Atashgahi, Xiaohan Chen, Ghada Sokar, Elena Mocanu, Mykola Pechenizkiy, Zhangyang Wang, Decebal Constantin Mocanu

gavel icon gavel

Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020

gslice icon gslice

gSlice Slicing GPUs to Serve Heterogeneous Inference Requests

hack-sysml icon hack-sysml

The road to hack SysML and become an system expert

holmes icon holmes

HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units (KDD 2020)

igniter icon igniter

iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.

infmoe icon infmoe

Inference framework for MoE layers based on TensorRT with Python binding

llmperf-for-tiledarch icon llmperf-for-tiledarch

Analytical Performance Model for Tiled Accelerators/Dies in Spatial Architecture Running Large Language Models (LLMs)

mark-project icon mark-project

Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving

mixture-of-experts icon mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

muri icon muri

Artifacts for our SIGCOMM'22 paper Muri

nnif_adv_defense icon nnif_adv_defense

Detection of adversarial examples using influence functions and nearest neighbors

openvino_notebooks icon openvino_notebooks

📚 A collection of Jupyter notebooks for learning and experimenting with OpenVINO 👓

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.