A Temporal Difference Learning Agent Powered by Farnama Gymnasium
This project aims to explore the potentials of bootstrapped Temporal Difference learning (TD) using Farnama, basing the experiments primarily off CartPole-v1 although the code is flexible and can be used in other deterministic games with discrete action space.
For more information on the utilisation and code specification, refer to specification.md