Code Monkey home page Code Monkey logo

mdps-and-em-algorithms's Introduction

Markov Decision Processes and Expectation Maximization Algorithm Project ๐Ÿ“š

๐Ÿ“– Table of Contents

  • About the Project
  • Getting Started
  • Usage
  • Contributing
  • References

๐Ÿง About

Markov Decision Processes (MDPs) are mathematical frameworks for modeling decision making in situations where outcomes are partly random and partly under the control of a decision maker. MDPs are useful for studying optimization problems solved via dynamic programming and reinforcement learning.

This project discusses:

  • What are MDPs
  • Solving MDPs using Value Iteration algorithm
  • Example of MDP - Recycling Robot
  • Expectation Maximization (EM) algorithm
  • Implementing EM algorithm using reaction networks

๐Ÿ Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes:

Prerequisites

  • Python 3.x
  • Knowledge of Markov Decision Processes and Reinforcement Learning
  • Basic understanding of Expectation Maximization algorithm

๐ŸŽˆ Usage

To use this project, follow the steps below:

  • Run main.py to learn about MDPs and solve using Value Iteration algorithm
  • Check recycling_robot.py for an example MDP - Recycling Robot
  • Refer project report to understand EM algorithm

๐Ÿค Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

  • Fork the Project
  • Create your Feature Branch
  • Commit your Changes
  • Push to the Branch
  • Open a Pull Request

๐Ÿ“– References

  • Richard S. Sutton and Andrew G. Barto, Reinforcement Learning:An Introduction.
  • Abhinav, Masters Thesis, Molecular Algorithms and Schemes for their Implementation using DNA.
  • LIHONG LI, A Unifying Framework For Computational Reinforcement Learning Theory.
  • Muppirala Viswa Virinchi, Abhishek Behera, Manoj GopalKrishnan, A reaction network scheme which implements the EM algorithm

mdps-and-em-algorithms's People

Contributors

shashankatthaluri avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.