Script to compare Upper Confidence Bound and Thompson Sampling algorithms through simulated data. Data sample provided by Hadelin de Ponteves, containing information about which ad each of the ten thousand users would click on.
Reinforcement learning algorithms designed to make real-time decisions based on new data.
Works with Python 2 and 3.
Usage: UCB_thompson.py rounds ads Example: UCB_thompson.py 10000 10