zjc664656505 Goto Github PK

followers: 6.0 following: 7.0 repos: 26.0 gists: 0.0

Name: Junchen(Kevin) Zhao

Type: User

Company: University of California, Irvine

Bio: Computer Science Ph.D. student at UC Irvine.

Location: Irvine

Junchen(Kevin) Zhao's Projects

applied_scientist_interview_prep

automatic-code-generation-from-natural-language-descriptions-using-neural-machine-translation-with-b

bq_project

Apply a Transformer-based model for financial data prediction.

co416-machine-learning-for-medical-imaging

co490-natural-language-processing

evaluating-visual-fidelity-of-image-description-based-on-fluency-based-word-mover-s-distance

fundus

A very simple news crawler with a funny name

imperial-college-london-computing-meng-individual-project-multi-agent-apprenticeship-learning

As the rapid development of Artifical Intelligence in the current technology field, the Rein- forcement Learning has been proven to be a powerful technique that allows autonomous agents to learn optimal behaviors (called policies) in unknown and complex environments through models of rewards and penalizations. However, in order to make this technique (Reinforcement Learning) work correctly and get the precise reward function which returns feedback to the learning agent about when the agent behaves correctly or not, the reward function needs to be thoroughly specified. As a result, in real-world complex environments, such as autonomous driving, specifying a correct reward function could be one of the hard tasks to tackle for the Reinforcement Learn- ing model designers. To this end, Apprenticeship Learning techniques such as Inverse Re- inforcement Learning, in which technique the agent infers a reward function from expert behaviors, are of high interest due to the fact that they could result in highly specified reward function efficiently. However, for critical tasks such as autonomous driving, we need to critically consider about the safety-related issues, so as to we need to build techniques to automatically check and ensure that the inferred rewards functions and policies that resulted from the Reinforcement Learning model system fulfill the needed safety requirements of the critical tasks that we have mentioned previously. In order to have a well-designed Reinforcement Learning model system, which is able to gen- erate the highly-specified reward function and ensure the safety-related considerations, we are applying the Reinforcement Learning technique mentioned in the background section called Apprenticeship Learning, which will be introduced in detail in the later section. Our objective of this project is to make the extra extension based on the technique men- tioned in the paper "Safety-aware Apprenticeship Learning" written by Weichao Zhou and Wenchao Li[3] to improve the utility and the efficiency of the existing Reinforcement Learn- ing model system from single-agent learning setting to multi-agent setting framework.

junchenzhao.github.io

A beautiful, simple, clean, and responsive Jekyll theme for academics

kaggler

keras-example

kvquant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

lingualinked-inference

Distributed On-Device LLM Inference

my-data-science-projects

This Repository consists of my past Data Science project works as a showcase of my professional skills.

myself

onnxruntime-training-examples

Examples for using ONNX Runtime for model training.

openvaccine-covid-19-mrna-vaccine-degradation-

We are designing a model using CNN+GCN to predict the likely degradation rates at each base of an RNA molecule, trained on a subset of an Eterna dataset comprising over 3000 RNA molecules (which span a panoply of sequences and structures) and their degradation rates at each position. We will then score our models on a second generation of RNA sequences that have just been devised by Eterna players for COVID-19 mRNA vaccines.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.