Code Monkey home page Code Monkey logo

Pankayaraj's Projects

aaai_2023_hierarchical-constrained-rl icon aaai_2023_hierarchical-constrained-rl

"Constrained Reinforcement Learning in Hard Exploration Problems" Pathmanathan Pankayaraj, Pradeep Varakantham. AAAI Conference on Artificial Intelligence 2022

cepdnaclk.github.io icon cepdnaclk.github.io

Github pages website for Department of Computer Engineering, University of Peradeniya

ecc_20_mamab icon ecc_20_mamab

"A Decentralized Communication Policy for Multi Agent Multi Armed Bandit Problems" P Pankayaraj, DHS Maithripala

icml_2024_rlhfpoisoning icon icml_2024_rlhfpoisoning

"Is poisoning a real threat to LLM alignment? Maybe more so than you think" Pankayaraj Pathmanathan, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, Furong Huang. ICML 2024 Workshop MHFAIA

llm_next_word_prediction icon llm_next_word_prediction

Code for next word prediction training based on the BookMIA dataset. This is part of the code for tests done of the work "Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?"

multi-arm-bandit-library icon multi-arm-bandit-library

A python based library which includes multi_arm_bandit and Bayesian_optimization_algorithms. The PYPI repository can be found as mabandit 1.3

sitnshop icon sitnshop

An web application(+mobile application) to advertise any kinds of shops and more additional features

soft-actor-critic icon soft-actor-critic

Implementation of the paper Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

softlearning icon softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.