Code Monkey home page Code Monkey logo

ai_lib's Introduction

Jidi (及第) AiLib

License: MIT Release Version PRs Welcome

This repo provides a reinforcement learning library designed for our online submission evaluation platform Jidi(及第) which aims to offer fair benchmarking over various RL environments and host AI competitions on problems worth exploring.

Version 2.0 (branch V2) New version of Jidi AiLib which supports Population-Based Training! Still under construction and stay tuned. PRs are welcome!

Version 1.0 (branch master): In the version V1.0, the repo contains the code of all the benchmark RL environment we included on Jidi(及第) and simple-to-use training examples covering some classic baseline RL algorithms.

平台地址 (Our Website)

jidi_ai

快速开始 (Quick Start)

You can install Jidi AiLib on your own personal computer or workstation. To install it, please follow the instruction below.

Clone the repo

git clone https://github.com/jidiai/ai_lib.git
cd ailib

Build virtual environment

python3 -m venv ailib-venv
source ailib-venv/bin/activate
python3 -m pip install -e .

or

conda create -n ailib-venv python==3.7.5
conda activate ailib-venv

Install necessary dependencies

pip install -r requirements.txt

Now have a go

python examples/main.py --scenario classic_CartPole-v0 --algo dqn --reload_config 

训练例子 (Training Examples)

We provide implementations and tuned training configurations of vairous baseline reinforcement learning algorithms. More details can be found in ./examples/. Feel free to try it yourself.

We currently support the following benchmarking experiments:

Algo CartPole-v0 MountainCar-v0 Pendulum-v0 gridworld
RANDOM
Q-learning - - -
Sarsa - - -
DQN - -
DDQN - -
Duelingq - -
SAC -
PPO - - -
PG - - -
AC - - -
DDPG - - -
TD3 - - -

额外项目依赖 (Extra Dependencies)

Apart from the necessary dependency in requirements.txt, some environments require extra dependencies and we list all of them here.

目录结构 (Code structure)

|-- platform_lib
	|-- README.md
	|-- run_log.py		// 本地调试运行环境
	|-- examples	// 提交运行文件示例	需包含 my_controller 函数输出policy
	    |-- random.py  // 随机策略 需根据环境动作是否连续 调整 is_act_continuous 的值
	|-- replay		// render工具,用于非gym环境,打开replay.html上传run_log 存储的.json文件 
	|-- env		// 游戏环境 
	|	|-- simulators		// 模拟器
	|	|	|-- game.py
	|	|	|-- gridgame.py // 网格类模拟器接口
	|	|-- obs_interfaces		// observation 观测类接口
	|	|	|-- observation.py		// 目前支持Grid Vector
	|	|-- config.ini		// 相关配置文件
	|	|-- chooseenv.py 
	|	|-- snakes.py
	|	|-- gobang.py
	|	|-- reversi.py
	|	|-- sokoban.py
	|	|-- ccgame.py

平台提交说明

  1. 填写算法名称或描述,选择提交环境
  2. 上传一个或多个文件。
  • 其中必须包含一个运行文件,运行文件需包含my_controller 函数的一个submission.py文件。
  • 附属文件支持.pth .py类型文件。大小不超过100M,个数不超过5个。

ai_lib's People

Contributors

ht-zhou avatar jidiai avatar lc102197 avatar miracle1207 avatar smoothsmooth avatar wangcechen avatar yansong97 avatar yutongamber avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

ai_lib's Issues

样例跑不通,代码陈旧

现在都是python3.8,弄个3.7的实在跑不动
另外训练部分的代码基本上是空的,只会测试不会训练,没法用

样例就跑不通

python main.py --scenario classic_CartPole-v0 --algo dqn就各种报错

跑不通PPO_CNN MiniWorld_OneRoom-v0示例

我尝试了master、alog、old_config等多个分支,运行example里面main.py函数中PPOCNN MiniWorld_OneRoom的例子,可以运行成功,但是每轮训练回报始终为-25.50
如果作者能解决该示例的问题,万分感谢

代码库跑不通

好像跑不通,希望可以按照所提供的代码运行流程验证下。对于我来说可读性有点差,所以不解决了,交给作者了。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.