Code Monkey home page Code Monkey logo

Hongyi Guo's Projects

alpaca_eval icon alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

baselines icon baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

cluster_optimization icon cluster_optimization

Course project of SJTU EE357: Computer Network, advised by Prof. Na Ruan. We implemented and improved "A Hierarchical Framework of Cloud Resource Allocation and Power Management using Deep Reinforcement Learning" and achieve a good trade-off between power usage and job latency.

commnet-lua icon commnet-lua

Neural network model, suitable for multi-agent learning. https://arxiv.org/abs/1605.07736

copier icon copier

Co-training for Policy Learning

csp icon csp

CCF计算机软件能力认证往年真题

face_recognization_detection icon face_recognization_detection

Course project for SJTU CS385: Machine Learning, advised by Prof. Quanshi Zhang, where I implemented many algorithms from scratch for face recognization and detection.

image_segmentation icon image_segmentation

Course project for SJTU CS385: Machine Learning, advised by Prof. Quanshi Zhang, where I compared SegNet and FCN on image segmentation task with VOC2012 dataset.

l_dmi icon l_dmi

Code for NeurIPS 2019 Paper, "L_DMI: An Information-theoretic Noise-robust Loss Function"

multiagent-particle-envs icon multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

overcookedgpt icon overcookedgpt

An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic multi-agent settings.

peer_bc_ct icon peer_bc_ct

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

peerloss icon peerloss

Learning with Noisy Labels by adopting a peer prediction loss function.

pymarl icon pymarl

Beta code release for Python Multi-Agent Reinforcement Learning framework

pytorch-unet icon pytorch-unet

Pytorch implementation of the U-Net for image semantic segmentation, with dense CRF post-processing

rain icon rain

Official implementation of [RAIN: Your Language Models Can Align Themselves without Finetuning]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.