Thompson Sampling

Thompson Sampling

Thompson Sampling is an algorithm that performs well on Multi Armed Bandit-problems. Sometimes known as Bayesian Bandit.