gohsyi Goto Github PK

followers: 32.0 following: 33.0 repos: 49.0 gists: 3.0

Name: Hongyi Guo

Type: User

Company: Northwestern University

Bio: Ph.D. at Northwestern University.

Location: Evanston

Hongyi Guo's Projects

alignment-handbook

Robust recipes to align language models with human and AI preferences

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Course project of SJTU EE357: Computer Network, advised by Prof. Na Ruan. We implemented and improved "A Hierarchical Framework of Cloud Resource Allocation and Power Management using Deep Reinforcement Learning" and achieve a good trade-off between power usage and job latency.

commnet

an implementation of CommNet

commnet-lua

Neural network model, suitable for multi-agent learning. https://arxiv.org/abs/1605.07736

copier

Co-training for Policy Learning

cs214-algorithm-and-complexity

Homework of SJTU CS214: algorithm and complexity, advised by Prof. Xiaofeng Gao. All assignments are above A level.

cs356-operating-system-project

Course project of SJTU CS356: Operating System, advised by Prof. Fan Wu. Got 100 points.

csp

CCF计算机软件能力认证往年真题

data-stucture-2017

DS2017 coursework, deque and map

end-to-end-negotiator

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

exploration-by-disagreement

[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement

face_recognization_detection

Course project for SJTU CS385: Machine Learning, advised by Prof. Quanshi Zhang, where I implemented many algorithms from scratch for face recognization and detection.

gohsyi.github.io

hyperparallel_machine_learning

course repo for IV-J

image_segmentation

Course project for SJTU CS385: Machine Learning, advised by Prof. Quanshi Zhang, where I compared SegNet and FCN on image segmentation task with VOC2012 dataset.

l_dmi

Code for NeurIPS 2019 Paper, "L_DMI: An Information-theoretic Noise-robust Loss Function"

lightzero

look_for_words

Looking for words? Try me.

machine-learning-coursera

Exercises from coursera

multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

overcookedgpt

An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic multi-agent settings.

peer_bc_ct

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

peerloss

Learning with Noisy Labels by adopting a peer prediction loss function.

pymarl

Beta code release for Python Multi-Agent Reinforcement Learning framework

pytorch-unet

Pytorch implementation of the U-Net for image semantic segmentation, with dense CRF post-processing

gohsyi Goto Github PK

Hongyi Guo's Projects

Recommend Projects

Recommend Topics

Recommend Org