The reply from luciotorre

Implement hanoi sample

Implement the hanoi sample with support for rl-glue

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:08

Test coding standards compliance

Make sure all code is PEP 8 compatible and passes all lint tests.

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:14

Write documentation

Add docstrings for generating documentation using Sphinx.

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:13

Implement the rock_paper_scissors sample

Implement the rock_paper_scissors sample with support for rl-glue

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:09

Add support for storing the learned policy

Add some way to persist a policy across experiments. The idea is being able
to interrupt the experiment without losing all already learned knowledge.

This would also allow for later on loading the learned policy and

- continue learning from an advanced starting point
- just evaluate the policy, to perform a learned behaviour

Original issue reported on code.google.com by [email protected] on 29 Jun 2009 at 12:01

Make sure all the code has full test coverage

Before the release, make sure all the code has 100% test coverage.

Original issue reported on code.google.com by [email protected] on 29 Jun 2009 at 2:15

Implement gambler sample

Implement the gambler sample with support for rl-glue

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:07

Replace encoders by mappings

Change the encoder concept by a more general mapping concept. The storage
takes two mappings as optional parameters: of for the observation space and
one for the action space.

A Mapping translates between a domain space and an image space.

Original issue reported on code.google.com by [email protected] on 23 Jul 2009 at 6:52

Add support for loading a policy

Allow for some way of loading a previously learned policy to use either as
a starting point for continuing learning, or just for evaluation.

Original issue reported on code.google.com by [email protected] on 29 Jun 2009 at 12:02

Move encoding/decoding into the storage

Move all encoding/decoding related code into the storage.

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:11

Implement gridworld sample

Implement the gridworld sample with support for rl-glue

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:08

Implement SarsaLearner

Implement the Sarsa learner along with its corresponding unit tests

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:10

Implement function aproximation for storage

Implement a storage that does function aproximation. 
Maybe tiling and kanerva and rbf

Original issue reported on code.google.com by [email protected] on 26 Jun 2009 at 2:10

Implement Q(lambda)

Implement eligibility traces for Q (maybe for sarsa too)

Original issue reported on code.google.com by [email protected] on 26 Jun 2009 at 2:09

Implement state_value sample

Implement the state_value sample with support for rl-glue

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:07

Add support for external space definition

Spaces should be used to define the model. Agent and Environment should use
the same spaces. 

If the space is given, the agent/environment are considered to be
reply-compatible, otherwise, they are rlglue-compatible.

If an agent/environment is reply-compatible, it's attributes can be
accessed by name, otherwise they can only be accessed by position.

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 4:38

Don't require rlglue dependency

Currently rl-glue is a hard dependency. Make this not be like this. rl-glue
should be available if installed, but not required.

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:03

Fix rl-glue spaces

When using a rlglue environment, task spec has not the extra parameter
giving the attribute names. Currently in those cases, the spaces ignore all
attributes. 

Make those attributes to be referenceable by index, at least.

Original issue reported on code.google.com by [email protected] on 16 Jun 2009 at 1:05

Add support for retrieving info about experiment progress

Add some way to see/measure how the experiment is going.

Show some measure of performance/error.

Original issue reported on code.google.com by [email protected] on 28 Jun 2009 at 11:59

Add complete set of unit tests for learner

Add a complete set of unit tests for the learner.py module

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:10

Implement the blackjack sample

Implement the blackjack sample with support for rl-glue

Original issue reported on code.google.com by [email protected] on 15 Jun 2009 at 11:09

luciotorre / reply Goto Github PK

reply's People

reply's Issues

Recommend Projects

Recommend Topics

Recommend Org