Code Monkey home page Code Monkey logo

Boyuan Chen's Projects

accelerate icon accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

aligner icon aligner

Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction

awesome-rlhf icon awesome-rlhf

A curated list of reinforcement learning with human feedback resources (continually updated)

cby-pku.github.io icon cby-pku.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

cookbook icon cookbook

🎉🎉🎉JAVA高级架构师技术栈==任何技能通过 “刻意练习” 都可以达到融会贯通的境界,就像烹饪一样,这里有一份JAVA开发技术手册,只需要增加自己练习的次数。🏃🏃🏃

data_process icon data_process

Practical data processing python files that may be used in research

fastchat icon fastchat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

gpt4_eval icon gpt4_eval

GPT-4 evaluation prompt, accelerated with ray.

malib icon malib

A parallel framework for population-based multi-agent reinforcement learning.

markdown-emoji icon markdown-emoji

Markdown语法支持添加 emoji表情,输入不同的符号码(两个冒号包围的字符)可以显示出不同的表情

marllib icon marllib

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

oi-wiki icon oi-wiki

:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)

omnisafe icon omnisafe

OmniSafe is an infrastructural framework for accelerating SafeRL research.

pku_dsa icon pku_dsa

A review of my code practice when learning pku : data structure and algorithm

pku_ics icon pku_ics

A review of my code lab when learning pku : ICS

ppoxfamily icon ppoxfamily

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

safe-rlhf icon safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

transformers icon transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

trlx icon trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.