Code Monkey home page Code Monkey logo

reply's People

reply's Issues

Add support for storing the learned policy

Add some way to persist a policy across experiments. The idea is being able
to interrupt the experiment without losing all already learned knowledge.

This would also allow for later on loading the learned policy and

- continue learning from an advanced starting point
- just evaluate the policy, to perform a learned behaviour

Original issue reported on code.google.com by [email protected] on 29 Jun 2009 at 12:01

Replace encoders by mappings

Change the encoder concept by a more general mapping concept. The storage
takes two mappings as optional parameters: of for the observation space and
one for the action space.

A Mapping translates between a domain space and an image space.

Original issue reported on code.google.com by [email protected] on 23 Jul 2009 at 6:52

Add support for loading a policy

Allow for some way of loading a previously learned policy to use either as
a starting point for continuing learning, or just for evaluation.

Original issue reported on code.google.com by [email protected] on 29 Jun 2009 at 12:02

Add support for external space definition

Spaces should be used to define the model. Agent and Environment should use
the same spaces. 

If the space is given, the agent/environment are considered to be
reply-compatible, otherwise, they are rlglue-compatible.

If an agent/environment is reply-compatible, it's attributes can be
accessed by name, otherwise they can only be accessed by position.

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 4:38

Don't require rlglue dependency

Currently rl-glue is a hard dependency. Make this not be like this. rl-glue
should be available if installed, but not required.

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:03

Fix rl-glue spaces

When using a rlglue environment, task spec has not the extra parameter
giving the attribute names. Currently in those cases, the spaces ignore all
attributes. 

Make those attributes to be referenceable by index, at least.

Original issue reported on code.google.com by [email protected] on 16 Jun 2009 at 1:05

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.