Shiva

Shiva is built to be simulation-engine agnostic, its framework is abstracted to support various types of observation and actions spaces with different environment settings and number of agents. Additionally, Shiva is designed to support distributed processing across a large number of servers to support learning in complex environments with large observation or action spaces where multiple agents need to converge to a team policy. At the moment, Shiva supports popular reinforcement and imitation learning algorithms such as Deep Q-Network (DQN), Deep Deterministic Policy Gradient (DDPG), Proximal Policy Optimizations (PPO), Multi Agent Deep Deterministic Policy Gradient (MADDPG), Dataset Aggregation (DAGGER) method in addition to a few customized and hybrid model-based algorithms that leverage the dynamics of the environment to converge to at a faster rate. The framework is built to enable researchers to design and experiment with new algorithms and be able to test them at scale in different environments and scenarios with minimum setup on the infrastructure.

Get started with the Installation and then thru the Quickstart to see how to run a session. The Tutorial section goes in more details about Shiva to familiarize with it's components and then be able to extend new algorithms.

Table of Content

Requirements and Installation
Quickstart
Tutorial
- Project Layout
- Classes
  - Algorithm
  - Agent
  - MetaLearner
  - Learner
  - Environment
  - Network
  - Buffer
  - Admin
- Configuration files
- Example Environments
How to extend Shiva
- UnitTests
- Creating a new algorithm
- Creating a new environment wrapper

Benchmarks

You can use these benchmarks to test if changes made to Shiva were improvements.

Restrictions

If you would like to contribute to Shiva, we would like you to do so by providing your own implementations of the abstract modules to maintain stability. If you have difficulties with any of the existing modules please raise an issue on the repository.

Credits

nFlux AI
- Seyed Sajjadi
- Andrew Miller
- Ezequiel Donovan
- Jorge Martinez
- Travis Hamm
- Daniel Tellier
- Joshua Kristanto
University of Southern California, Institute of Creative Technology (USC ICT)
- Volkan Ustun
- Rajay Kumar
CSUN NSF Grant # 1842386
- Carol Shubin

License

Apache License 2.0

lgtm-migrator / shiva Goto Github PK

shiva's Introduction

Shiva

Table of Content

Benchmarks

Restrictions

Credits

License

shiva's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent