cby-pku Goto Github PK
Name: Boyuan Chen
Type: User
Company: Peking University
Bio: Sophomore undergrad at Peking University📚 Focus on Scalable Oversight / AI Safety / AI Alignment
Location: Beijing
Name: Boyuan Chen
Type: User
Company: Peking University
Bio: Sophomore undergrad at Peking University📚 Focus on Scalable Oversight / AI Safety / AI Alignment
Location: Beijing
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction
A curated list of reinforcement learning with human feedback resources (continually updated)
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
🎉🎉🎉JAVA高级架构师技术栈==任何技能通过 “刻意练习” 都可以达到融会贯通的境界,就像烹饪一样,这里有一份JAVA开发技术手册,只需要增加自己练习的次数。🏃🏃🏃
Practical data processing python files that may be used in research
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
GPT-4 evaluation prompt, accelerated with ray.
A parallel framework for population-based multi-agent reinforcement learning.
Markdown语法支持添加 emoji表情,输入不同的符号码(两个冒号包围的字符)可以显示出不同的表情
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
OmniSafe is an infrastructural framework for accelerating SafeRL research.
reproduction of overthinking_the_truth
PKU 2023 -12 Machine Learning Labs
The basic demo of the ai_basic_learing_2023_spring_pku
Code for PKU AI Social Sciences
A review of my code practice when learning pku : data structure and algorithm
A review of my code lab when learning pku : ICS
A review of my code when learning PKU: programming-algorithm
the demo of jiangzehan_modeling
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
This is a benchmark repository for safe reinforcement learning algorithms
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.