Topic: offline-reinforcement-learning Goto Github
Some thing interesting about offline-reinforcement-learning
Some thing interesting about offline-reinforcement-learning
offline-reinforcement-learning,📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
User: allenpandas
offline-reinforcement-learning,PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
User: by571
offline-reinforcement-learning,A Japanese (Riichi) Mahjong AI Framework
User: cryolite
offline-reinforcement-learning,[FL-ICML 2023] Code for Federated Ensemble-Directed Offline Reinforcement Learning
User: desikrengarajan
offline-reinforcement-learning,Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
User: dhdev0
offline-reinforcement-learning,"S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning" (NeurIPS 2022)
User: dsshim0125
offline-reinforcement-learning,Summarising the research of Offline RL in Federated Setting.
User: elated-sawyer
offline-reinforcement-learning,Original implementations of the VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" by Jeen et. al (2024).
User: enjeeneer
Home Page: https://enjeeneer.io/projects/zero-shot-rl/
offline-reinforcement-learning,The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)
User: facebear-ljx
offline-reinforcement-learning,Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"
User: ganjiro
offline-reinforcement-learning,Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
User: holarissun
Home Page: https://sites.google.com/view/rewardshaping
offline-reinforcement-learning,Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
User: howuhh
offline-reinforcement-learning,The First Open-Sourced Building Batch Reinforcement Learning Dataset
User: hydesmondliu
offline-reinforcement-learning,JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
User: ikostrikov
offline-reinforcement-learning,:battery: Datasets with baselines for offline multi-agent reinforcement learning.
Organization: instadeepai
Home Page: https://instadeepai.github.io/og-marl/
offline-reinforcement-learning,Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
User: kschweig
offline-reinforcement-learning,Code for FOCAL Paper Published at ICLR 2021
User: lanqingli1993
offline-reinforcement-learning,A Production Tool for Embodied AI
Organization: loopmind-ai
Home Page: https://www.loopquest.ai/
offline-reinforcement-learning,Robust Offline Reinforcement Learning with Heavy-Tailed Rewards
User: mamba413
offline-reinforcement-learning,Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
User: manchery
offline-reinforcement-learning,
User: mohan-zhang-u
offline-reinforcement-learning,Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
User: nikhilbarhate99
offline-reinforcement-learning,Clean single-file implementation of offline RL algorithms in JAX
User: nissymori
offline-reinforcement-learning, Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets
Organization: polixir
Home Page: http://polixir.ai/research/neorl
offline-reinforcement-learning,A collection of offline reinforcement learning algorithms.
Organization: polixir
offline-reinforcement-learning,Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
User: reinholdm
offline-reinforcement-learning,Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
User: ryanxhr
offline-reinforcement-learning,[AAAI 2022] The official implementation of CPQ in "Constraints Penalized Q-learning for Safe Offline Reinforcement Learning"
User: ryanxhr
offline-reinforcement-learning, Implementation of CQL in "Conservative Q-Learning for Offline Reinforcement Learning" based on BRAC family.
User: ryanxhr
offline-reinforcement-learning,[AAAI 2022] The official implementation of "DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning"
User: ryanxhr
offline-reinforcement-learning,[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
User: ryanxhr
offline-reinforcement-learning,[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
User: ryanxhr
offline-reinforcement-learning,Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
Organization: sail-sg
Home Page: https://arxiv.org/abs/2210.05980
offline-reinforcement-learning,Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
Organization: snu-mllab
offline-reinforcement-learning,Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
Organization: snu-mllab
offline-reinforcement-learning,Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, Offline RL Workshop
Organization: tinkoff-ai
offline-reinforcement-learning,High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Organization: tinkoff-ai
Home Page: https://arxiv.org/abs/2210.07105
offline-reinforcement-learning,Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Workshop
Organization: tinkoff-ai
offline-reinforcement-learning,Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
Organization: tinkoff-ai
offline-reinforcement-learning,Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
Organization: tinkoff-ai
offline-reinforcement-learning,
User: weichengtseng
offline-reinforcement-learning,The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)
User: xionghuichen
Home Page: https://ieeexplore.ieee.org/document/10255284
offline-reinforcement-learning, Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
User: yangrui2015
offline-reinforcement-learning,Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
User: yangrui2015
offline-reinforcement-learning,An elegant PyTorch offline reinforcement learning library for researchers.
User: yihaosun1124
offline-reinforcement-learning,Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
User: yudasong
offline-reinforcement-learning,Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]
User: zaiyan-x
offline-reinforcement-learning,Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
User: zhengyaojiang
Home Page: https://sites.google.com/view/latentplan
offline-reinforcement-learning,[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
User: zhengyinan-air
Home Page: https://zhengyinan-air.github.io/FISOR/
offline-reinforcement-learning,[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization"
User: zhengyinan-air
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.