pankayaraj,Pankayaraj,github

aaai_2023_hierarchical-constrained-rl

"Constrained Reinforcement Learning in Hard Exploration Problems" Pathmanathan Pankayaraj, Pradeep Varakantham. AAAI Conference on Artificial Intelligence 2022

cepdnaclk.github.io

Github pages website for Department of Computer Engineering, University of Peradeniya

cognitive_computation-2023_continual-learning-with-curiosity

"Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning" Pankayaraj Pathmanathan, Natalia Díaz-Rodríguez, Javier Del Ser. Cognitive Computation journal 2023

django_server_sleep_apnea

Central server made on django and rest framework to facilitate the detection of sleep apnea problem

ecc_20_mamab

"A Decentralized Communication Policy for Multi Agent Multi Armed Bandit Problems" P Pankayaraj, DHS Maithripala

fisher_discriminant_analysis_and_pca

icml_2024_rlhfpoisoning

"Is poisoning a real threat to LLM alignment? Maybe more so than you think" Pankayaraj Pathmanathan, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, Furong Huang. ICML 2024 Workshop MHFAIA

Code for next word prediction training based on the BookMIA dataset. This is part of the code for tests done of the work "Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?"

machine_learning_algorithms_numpy

Implementation of Machine Algorithms including NN in numpy

models_2024

multi-arm-bandit-library

A python based library which includes multi_arm_bandit and Bayesian_optimization_algorithms. The PYPI repository can be found as mabandit 1.3

neuralnetworkproject

Application of various neural networks on MNIST data set(On going)

pankayaraj.github.io

Personal We Page

pca_image_argumentation

Just an example for changing the lighting of images using PCA as mentioned in the AlexNet paper

programming_algorithms

Small library of personal programming algorithm implementations

reinforcement_learning

Reinforcement learning algorithms taught by David Silver on youtube to small scale problems

rl_pretraining

rlhf_poisoning

sac

Soft Actor-Critic

sitnshop

An web application(+mobile application) to advertise any kinds of shops and more additional features

sleep_apnea_detection-1

Non intrusive method for detecting sleep apnea in infants.

soft-actor-critic

Implementation of the paper Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

pankayaraj Goto Github PK

Pankayaraj's Projects

Recommend Projects

Recommend Topics

Recommend Org