luciotorre / reply Goto Github PK
View Code? Open in Web Editor NEWAutomatically exported from code.google.com/p/reply
Automatically exported from code.google.com/p/reply
Implement the hanoi sample with support for rl-glue
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:08
Make sure all code is PEP 8 compatible and passes all lint tests.
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:14
Add docstrings for generating documentation using Sphinx.
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:13
Implement the rock_paper_scissors sample with support for rl-glue
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:09
Add some way to persist a policy across experiments. The idea is being able
to interrupt the experiment without losing all already learned knowledge.
This would also allow for later on loading the learned policy and
- continue learning from an advanced starting point
- just evaluate the policy, to perform a learned behaviour
Original issue reported on code.google.com by [email protected]
on 29 Jun 2009 at 12:01
Before the release, make sure all the code has 100% test coverage.
Original issue reported on code.google.com by [email protected]
on 29 Jun 2009 at 2:15
Implement the gambler sample with support for rl-glue
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:07
Change the encoder concept by a more general mapping concept. The storage
takes two mappings as optional parameters: of for the observation space and
one for the action space.
A Mapping translates between a domain space and an image space.
Original issue reported on code.google.com by [email protected]
on 23 Jul 2009 at 6:52
Allow for some way of loading a previously learned policy to use either as
a starting point for continuing learning, or just for evaluation.
Original issue reported on code.google.com by [email protected]
on 29 Jun 2009 at 12:02
Move all encoding/decoding related code into the storage.
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:11
Implement the gridworld sample with support for rl-glue
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:08
Implement the Sarsa learner along with its corresponding unit tests
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:10
Implement a storage that does function aproximation.
Maybe tiling and kanerva and rbf
Original issue reported on code.google.com by [email protected]
on 26 Jun 2009 at 2:10
Implement eligibility traces for Q (maybe for sarsa too)
Original issue reported on code.google.com by [email protected]
on 26 Jun 2009 at 2:09
Implement the state_value sample with support for rl-glue
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:07
Spaces should be used to define the model. Agent and Environment should use
the same spaces.
If the space is given, the agent/environment are considered to be
reply-compatible, otherwise, they are rlglue-compatible.
If an agent/environment is reply-compatible, it's attributes can be
accessed by name, otherwise they can only be accessed by position.
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 4:38
Currently rl-glue is a hard dependency. Make this not be like this. rl-glue
should be available if installed, but not required.
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:03
When using a rlglue environment, task spec has not the extra parameter
giving the attribute names. Currently in those cases, the spaces ignore all
attributes.
Make those attributes to be referenceable by index, at least.
Original issue reported on code.google.com by [email protected]
on 16 Jun 2009 at 1:05
Add some way to see/measure how the experiment is going.
Show some measure of performance/error.
Original issue reported on code.google.com by [email protected]
on 28 Jun 2009 at 11:59
Add a complete set of unit tests for the learner.py module
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:10
Implement the blackjack sample with support for rl-glue
Original issue reported on code.google.com by [email protected]
on 15 Jun 2009 at 11:09
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.