Code Monkey home page Code Monkey logo

Junchen(Kevin) Zhao's Projects

bq_project icon bq_project

Apply a Transformer-based model for financial data prediction.

fundus icon fundus

A very simple news crawler with a funny name

imperial-college-london-computing-meng-individual-project-multi-agent-apprenticeship-learning icon imperial-college-london-computing-meng-individual-project-multi-agent-apprenticeship-learning

As the rapid development of Artifical Intelligence in the current technology field, the Rein- forcement Learning has been proven to be a powerful technique that allows autonomous agents to learn optimal behaviors (called policies) in unknown and complex environments through models of rewards and penalizations. However, in order to make this technique (Reinforcement Learning) work correctly and get the precise reward function which returns feedback to the learning agent about when the agent behaves correctly or not, the reward function needs to be thoroughly specified. As a result, in real-world complex environments, such as autonomous driving, specifying a correct reward function could be one of the hard tasks to tackle for the Reinforcement Learn- ing model designers. To this end, Apprenticeship Learning techniques such as Inverse Re- inforcement Learning, in which technique the agent infers a reward function from expert behaviors, are of high interest due to the fact that they could result in highly specified reward function efficiently. However, for critical tasks such as autonomous driving, we need to critically consider about the safety-related issues, so as to we need to build techniques to automatically check and ensure that the inferred rewards functions and policies that resulted from the Reinforcement Learning model system fulfill the needed safety requirements of the critical tasks that we have mentioned previously. In order to have a well-designed Reinforcement Learning model system, which is able to gen- erate the highly-specified reward function and ensure the safety-related considerations, we are applying the Reinforcement Learning technique mentioned in the background section called Apprenticeship Learning, which will be introduced in detail in the later section. Our objective of this project is to make the extra extension based on the technique men- tioned in the paper "Safety-aware Apprenticeship Learning" written by Weichao Zhou and Wenchao Li[3] to improve the utility and the efficiency of the existing Reinforcement Learn- ing model system from single-agent learning setting to multi-agent setting framework.

kvquant icon kvquant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

my-data-science-projects icon my-data-science-projects

This Repository consists of my past Data Science project works as a showcase of my professional skills.

openvaccine-covid-19-mrna-vaccine-degradation- icon openvaccine-covid-19-mrna-vaccine-degradation-

We are designing a model using CNN+GCN to predict the likely degradation rates at each base of an RNA molecule, trained on a subset of an Eterna dataset comprising over 3000 RNA molecules (which span a panoply of sequences and structures) and their degradation rates at each position. We will then score our models on a second generation of RNA sequences that have just been devised by Eterna players for COVID-19 mRNA vaccines.

wizardlm icon wizardlm

Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.